JeffOstroff - 2:30 am on Nov 27, 2012 (gmt 0)
Here's a great and very powerful method that I have been using since 2005 to beat these scraper scammers at their own game, I never see people mention it, but this strategy involves not only shutting down the scraper sites, but also getting them booted out of Google. The problem is folks, Google wants us webmasters to do their dirty work and scrub the toilet bowl for them.
Now I too have a high quality site since 1999 that constantly has 300 or more scrapers, almost exactly what Frost_Angel reported here. it's very simple folks, here's what you do, here’s the recipe for revenge:
1) Identify the scraper site
2) Perform a whois lookup on the site to get the abuse email of the web host
3) Send a properly formatted DMCA (Digital Millennium Copyright Act) Notice to web host with screen shot of offending data.
4) Web host shuts down the offending web site, leaves a 404 Error message in its place
Here's my extra spice to this recipe:
5) Submit the dead URL to Google’s URL Removal Tool immediately while dead scraper site is still 404!
6) By the next day check the status in the URL removal tool, it will either say the site was REMOVED or Denied. The tool keeps a history of all the sites you have submitted for you to see.
By performing these steps above, you really put the screws to the scammer, because not only did you shut down his MFA, you removed them from Google’s index. That’s the important part that people skip over.
Also, Waste no time submitting the dead URLs to Google’s URL Removal Tool, as the scammers can pop back up tomorrow on another host.
So the very second I get a DMCA response from a web host, I instantly submit that dead URL to Google’s URL Removal tool. Done! It’s Miller time.
Another great benefit is that as you search Google for your stolen content, you are sure to find as I do, many URLS that are already dead, simply because their domain expired, or they got shut down for other reasons, but they still appear in Google. Feel free to submit those to the tool also, that saves you time.
Now it does take a couple of weeks for Google to re-index and flush them out, but it’s important to get these shuttered sites into the queue to be removed from Google.
If the webhost takes down the offending page, but it’s not returning a real 404 header error (maybe a splash page instead), Google’s tool will whine at you that it can still access the page. But, it offers to remove that page from Google’s cache, which to me is the next best thing.
But it asks you to enter a word that you know that was on the offending URL page before but is not there now. So you have to know a unique word from your content that was on the page but is no longer there. Enter that word into the form, and submit, check it next day. 75% of the time they remove the page from the cache. Other times it just says “DENIED”. I can’t tell why they deny some of them, which is why I specify to the web host to give me a 404 error page, because 100% of the time, with a 404 page, Google will remove it from the main index, and I would imagine the cache too.
I hope this helps many of you who were unfamiliar with this.
Happy shooting and looting!