Forum Moderators: skibum

Message Too Old, No Replies

Someone probably scraping my site.

         

StephenBauer

7:26 pm on May 17, 2005 (gmt 0)

10+ Year Member



However, it is just an Amazon AWS/ECS affiliate site where they could get the same information anyway (only a bit more work of course).

Should I add some code to allow me to "stop" these folks? Should I not worry about it since the information is readily available from Amazon anyway (again, with a bit more work though)? Should I be concerned about them copying these web pages and then just re-branding them?

Hmmm. Decisions.

thowze

8:55 pm on May 28, 2005 (gmt 0)

10+ Year Member



Don't know if you have read about Google's own site getting hijacked, but this is a good article to read about this:

[pandia.com...]

It's taking place on my site too. You're not by yourself.

roadhazard

11:25 am on Jun 2, 2005 (gmt 0)

10+ Year Member



well, I am very new at this, only one year, but I added the frame breaker code to all my pages. I do not think anyone has framed my pages yet, but now they can't. There are several free frame breaker codes online. Is framing the same as scraping?

StephenBauer

3:07 pm on Jun 2, 2005 (gmt 0)

10+ Year Member



It is similar in that they are stealing your content in one way or another. Scraping usually involves (automagically) copying your content for use on their page(s).

Other ways to accomplish similar things: framing your content. 302 redirecting to your content (although Google seems to be working on this quite a bit lately).

thowze

4:55 pm on Jun 3, 2005 (gmt 0)

10+ Year Member



It happened to my site too. Here is what I did to fight back.

Run a WHOIS to see if you can find a site owner

I was lucky enough to find that the domain's owner was in Los Angeles. I wrote emails to them explaining the problem and asking for their help first before escalating the issue to a lawyer. This helped to get the offending pages deleted within a week.

Help others to help you

More than likely this individual has scraped other sites besides yours. Go through the other scraped pages and see if there is a name, business product or event that may be found on other sites. If it is obvious that content was stolen from other sites, (I opened another window and used a search engine like Yahoo! to look for other sites using portions of the scraped content as a search term), email the webmaster/owner and let them know. Plus ask them to complain to the domain owner too. This gets others on your side quickly.

See if the scraped site has affiliate programs on it

Most of the instances of scraped sites I've seen have links to affiliate programs because the individual is too lazy to do the work needed for top ranking pages, but wants the income. They will have affiliate program links that can show up sometimes when you click on the webpages product offerings. The affiliate ID will generally be in the address bar and you will be taken to the company offering a product. Copy the affiliate ID and complain to the company. Do what you can to stop their money flow.

By doing this I ended up having the scraped pages deleted from the domain and am gaining my Google and Yahoo! rankings back. The domain owner was a reputable business who didn't know the pages were there. But that is still not enough, as I am doing everything I can to have the hijacker's affiliate agreements cancelled. No one wants to be sued behind an affiliates actions. Hope that helps!

StephenBauer

7:36 pm on Jun 3, 2005 (gmt 0)

10+ Year Member




I did the research previously and it does go back to some company but I couldn't find anything on the web linked to them. They have not come back although they did grab 500 pages (of tens of thousands of pages) before they left. If they come back, I will be more diligent (and block their IP). I don't want to impose any additional logic in the page until I have an actual issue that occurs repetitively. I am sure I will have to at some point though.