1.
Download Latest Links csv file from Google Webmaster Tools.
[
imgur.com...]
2. Open your prefered Text Editor and do some
Seach and replace. It will hem you sample the data on the next step.
I use pipe char
| ex:
.com/ -> .com|/
.net/ -> .net|/
.org -> .org|/
.info/ -> .info|/
.biz/ -> .biz|/
.us/ -> .us|/
.de/ -> .de|/
.ru/ - > .ru|/
.nl/ -> .nl|/
.fr/ -> .fr|/
etc (a lot of TLDs)
3. Open a new excel and import the data
Data - Get External Data Import the CSV file and select pipe
I as a separator.
[
imgur.com...]
Now you will have on the first column the domain and o the second column the URL structure.
Select the second column (
B) and
Sort alphabeticaly!
[
imgur.com...]
[
imgur.com...]
Now you can easily spot the domains that
share the same URL scheme structure. [u]Use search and clean to exclude one by one the domains that you add to the disavow file.[/u]
I used this technique to spot tons of spam domains that had
only one link to me, so It was like a
needle in the haystack, but I spotted them by the URL structure.
http://xxx.com/1234
http://xerc.info/1234
http://dfge.com/1234
etc