Forum Moderators: Robert Charlton & goodroi
luke175 - where would the original publication date come from? If this is a databased date of publication, it would be easily forged. If it is a server timestamp - what happens when you change the file, or change it's name? What if you have a hard drive failure, and had to reupload an entire site?
I can't see how they could get a reliable 'publish' date from anything.
The only thing that could be of use (that I can see) would be an index date as recorded in google's database, even though that still has it's faults.
Solution: Only allow the original content to be crawled by Google for 48hrs. eg: cloak it - then release it to the general public after it shows in Google index.
In many cases Google is ranking the thief higher than the original source.
tedster wrote:
But it is not so trivial to spoof an IP address and as far as I know, it's impossible to spoof your way through this process. See How To Verify Googlebot [webmasterworld.com]
You can't tell that by just looking for snippets.
[edited by: Robert_Charlton at 7:37 pm (utc) on Apr 12, 2011]
'Unless of course it's ehow doing the copying, then save us all some time and don't bother filing...', so maybe people need to file more complaints when they think their work has been 'borrow' by the big guys?
Before Panda scrapers did not outrank my original stories. Now they do. That is all that matters. Google lost a functionality.
Additionally, the domain of the scraped content was not even a registered or indexed when Google first indexed my content- so why would Google index them higher than me for the same content?
[edited by: tedster at 1:24 am (utc) on May 2, 2011]
[edit reason] moved from another location [/edit]