This is what I use not (4th round of cleaning)
I would really love to be able to put:
domain:*.info
===
Download Latest Links csv file from Google Webmaster Tools.
[
imgur.com...]
2. Open your prefered Text Editor and do some Seach and replace. It will hem you sample the data on the next step.
I use pipe char |
ex:
.com/ -> .com|/
.net/ -> .net|/
.org -> .org|/
.info/ -> .info|/
.biz/ -> .biz|/
.us/ -> .us|/
.de/ -> .de|/
.ru/ - > .ru|/
.nl/ -> .nl|/
.fr/ -> .fr|/
etc (a lot of TLDs)
3. Open a new excel and import the data Data - Get External Data
Import the CSV file and select pipe I as a separator.
[
imgur.com...]
Now you will have on the first column the domain and o the second column the URL structure.
Select the second column (
B) and
Sort alphabeticaly!
[
imgur.com...]
[
imgur.com...]
Now you can easily spot the domains that
share the same URL scheme structure.
[u]Use search and clean to exclude one by one the domains that you add to the disavow file.[/u]
I used this technique to spot tons of spam domains that had only one link to me, so It was like a needle in the haystack, but I spotted them by the URL structure.
http://xxx.com/1234xxx
http://xerc.info/1234xxx
http://dfge.com/1234xxx