JeffOstroff - 2:08 pm on Dec 26, 2012 (gmt 0)
You guys are completely missing the point here. I'm not talking about backing up my site. We back everything up regularly. I have backups.
I'm not talking about using wayback machine as a backup, I'm talking about using it as proof that our site was online with the content before the scraper site had it.
It's like playing whack a mole, you'll never stop them from coming onto your site, even if you update your robots.txt daily, because they keep changing names to fool your robots.tx file. Furthermore, many of the scrapers who steal our content are regular jos, and businesses who grabbed a paragraph. that has nothing to do with robots, when they come and manually cut and paste.
Furthermore, as web hosts are switching to automated DMACAs, they are requiring that you supply them with a URL that has your copyright content, as well as the url of the offending site. If you cannot supply them with a URL that shows your content that matches what the offending site has, they kick it back to you.
I don't know how you guys turned this into a backing up your data issue and my lack of being a responsible backup person, and then start talking about multiple hard drives.
This whole thread was about using the Wayback to PROVE, understand me, prove that I had the content up first. We have a popular auto related web site, which many people come and manually steal stuff, and there is also robot scraped but I doubt the threat from wayback is as big as the conspiracy theory people are suggesting. They are coming to our site and grabbing stuff BEFORE it appears on wayback. Our problem has not been with wayback. Wayback has been the solution for us.