Forum Moderators: not2easy

Message Too Old, No Replies

Stealing from search engine cache

         

my2yen

11:02 am on Sep 2, 2007 (gmt 0)

10+ Year Member



Hello,

I noticed that some people are using a tool to steal others contents from search engine cache in Japan.

This tool scraps contents from search engine cache and create a new page. Often obnoxious banner ad is placed on the top of the page, followed by stolen contents. Any links contained in the stolen contents are replaced by links to the thief's site that is filled with affiliate links.

I am wondering if anyone know similar tools that were used in the past? If so, what did people do to stop someone stealing from cache? Is using "noarchive" an option?

Quadrille

2:44 pm on Sep 2, 2007 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Content theft is a fact of life; if a page can be found, it can be stolen.

I don't see any advantage to the thief of stealing from the cache, rather than the site itself, but theft is theft.

The only way to stop it is to close your site down.

But there are things you can do after it's happened ...

my2yen

4:39 pm on Sep 2, 2007 (gmt 0)

10+ Year Member



Quadrille,

Thanks for your comment.

I do agree that there are always some people who steal others' contents.

I am no expert on the tools that are used to scrap other people's contents, but I have seen some. I've seen some tools & tricks, used in English speaking countries, spread in Japan a year or two years later. I have not seen the one that stole from search engine cache though. I was wondering why they want to copy from cache as well.

I just wanted to gather as much information on the enemy...!

But, I guess it may be a waste of time if they can not be stopped anyway...

londrum

7:24 pm on Sep 2, 2007 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



there's a couple of reasons why they might want to lift it from a search engine's cache
1) they might already have been blocked from the site
2) they know they won't run into any bot-traps

noarchive is the only way to stop it from appearing in the cache. i don't see any real benefit in having it in the cache anyway.