homepage Welcome to WebmasterWorld Guest from
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Pubcon Platinum Sponsor 2014
Home / Forums Index / Google / Google AdSense
Forum Library, Charter, Moderators: incrediBILL & jatar k & martinibuster

Google AdSense Forum

Examples of scrapper websites?
What are some examples?

 5:55 am on Jun 24, 2005 (gmt 0)

My sites are product/informational sites dealing with triathlons, diabetes & emergency preparedness. I've heard here in the forum about scrapper sites, what are they, how do they work, what are some examples and, finally how do they work with AdSense?



 6:28 am on Jun 24, 2005 (gmt 0)

Most likely you read from somebody who can't spell "scraper".
A scrapper is a person who is always getting into fist fights.
a Scraper is someone who copies content from other websites, usually without
credits or a link back, and almost always for Adsense (etc.) income.

Both activities (fisticuffs and plagiarism) are deeply frowned upon
here, perpetrators excepted of course. Its easy to spot the perps BTW.
When they aren't trying to justify or minimize the effects of scraping,
they try to accuse Google and Yahoo of it!

If G finds scraped pages, they often get dumped into the 'supplemental results'.
If Y finds scraped pages, or other signs of 'gaming the system', the whole
site might vanish from the SERPs never to be seen again. -Larry


 9:17 am on Jun 24, 2005 (gmt 0)

If G finds scraped pages, they often get dumped into the 'supplemental results'.

Well... after the recent Bourbon update, if you have a site with original content and you are scrapped (become a target for spammers with a bunch of Adsense ads) your site will be sent to the nobody-search-that-but-the-site-owner index (i.e.: your paragraph with quotes + your site name).


 9:46 am on Jun 24, 2005 (gmt 0)

Hello Mauricio: I'm sure you mean scraped, and not scrapped.
Something that is 'scrapped' is either thrown away, or salvaged for raw materials.

You are right about the problem though. Google does have trouble
telling the scraper from the originator of the content at times.
This could result in some very unjust things. I suppose you could say
that a site that gets scraped is in danger of getting scrapped. -Larry


 9:57 am on Jun 24, 2005 (gmt 0)

Despite larryhatch's emotional diatribe technically a scraper is a site that takes bits of other sites, puts them together as automated pages and allows SEs to find those pages and list them in SERPs. Technically, as has been trashed to death here, Google and Yahoo are scrapers with some differences: 1. They are welcome scrapers 2. They respect robots.txt 3. They actually make an attempt at listing sites in the order they think searchers will find most useful 4. They trawl sites, not SERPS. When people refer to scrapers here in WW they usually refer to those sites that scrape SERPs results and autogenerate that into pages - a practise that infuriates other webmasters, is borderline illegal, and clutters the internet with pages that are rarely of use to the visitor. The reason they exist is because they are lucrative, they run Adsense and a range of other monetisers and some can make several thousands dollars a day.


 10:45 am on Jun 24, 2005 (gmt 0)

A supposed guide for G spam spotters has xome examples of scraper and 'thin affiliate' sites.


 11:10 am on Jun 24, 2005 (gmt 0)

Utah: You asked for examples of scraper sites, but we can't give URLs here.
You might try Googling for <snip keywords>

That's scraping. Not just little snippets, the whole lengthy story.

I don't mind at all if somebody quotes a phrase or two from my stuff,
especially if they give a link back. If somebody copies a whole page,
or most of it without credits or a link, then I go ballistic. -Larry

[edited by: Jenstar at 1:37 am (utc) on June 27, 2005]
[edit reason] No keywords or specifics [/edit]


 6:36 am on Jun 25, 2005 (gmt 0)

I just got one of the scraper sites that had all the info from my index page of my smaller site posted on their index page! Went back and forth with the webhoster until they turned him off...getting a 404 from them now.

Now if I can figure out how to get the foreign country one shut down maybe Google will take mine out of number 51 where it went today from #40. You would think they would know the difference, been up since 2001 for crying out loud and it's a PR5? The site is completely focused on Numerology and the scrapers are all over everything else.

They don't sell adsense but they were selling a lot of other advertising. You would think someone had enough education to write their own copy?


And THAT is what a scraper site can do to you.


 6:48 am on Jun 25, 2005 (gmt 0)

Ann, sometimes you can't win.

A site of mine ranked #1 in Google for months. Then another genuine website asked if they could use a brief history on their site with credit and link to me. So I said yes with pleasure.

A few months later and that site is now ranked top ten consistently based on the strength of that brief history from one of my internal pages. I am stuuck at number 142.

If I tell them to remove it, I have no guarantee that I would regain my ranking so better to let them keep it and get some traffic from it. Or am I wrong?


 7:49 am on Jun 25, 2005 (gmt 0)

Right Joeking,

I would leave it. I guess I am going to have to do a complete redesign on it (now in 52nd place), and put some effort into writing content...too focused on the other site....

Oh, been up since 1998, redesign done in 2001. Of course it doesn't have a lot of content, they stole my biggest page LOL, didn't expect a small site to fight back. Didn't see the larger one I guess.

With my site still moving around does that signal another Google update? Still woozy from the last one.



 10:31 am on Jun 25, 2005 (gmt 0)

Hello Ann, Joe, all:

It looks like you see exactly the sort of thing I complain about, theft of contents.

<snip specifics>

That makes no difference to the scrapers. They mindlessly copy one another
with no respect for the facts. That is their contribution to the web.

There is no official definition of 'scraper' (note proper spelling, all you 'scrappers') ..
so you can argue that the search engines are scrapers if you wish.
I think its perfectly useless to do so however.

The real issues are the pond-scum that steal your content like a thief in the night,
for their own profit, often damaging the rightful sources.

I just checked my access_log file. Many thanks to any and all who visited,
having searched for <snip>! A few of you looked around a bit.
I strongly welcome suggestions for my site, please sticky if any.

Best wishes -Larry

[edited by: Jenstar at 1:36 am (utc) on June 27, 2005]
[edit reason] No "outting" of sites or references to your own [/edit]


 11:25 am on Jun 25, 2005 (gmt 0)

It's a bummer, especially when the scrapers are in a distant land in a galaxy far, far away..



 12:12 pm on Jun 25, 2005 (gmt 0)

Ann: Scrapers are a total bummer, but, that's the value of these WebmasterWorld forums;
lots of discussions about what can and cannot be done.
I don't know how much you have read here, but a lot of ideas have been kicked around.

Oddly enough, and especially in Europe, the mere discovery of scraping
is sometimes enough to get the perp to take his scrapes down.
(Some countries excepted: One in particular, well to the west, is a content bordello.)

In severe and damaging cases overseas where legal stuff is impractical,
I suggest getting all your ducks in a row and making complaints to the search engines.
It does the scraper no good if he is lost in the SERPs or delisted entirely.
If scraper is using Adsense or similar, you have TWO complaints you can make ..
One possibly more effective than the other. -Larry


 11:00 pm on Jun 25, 2005 (gmt 0)

It's a bummer, especially when the scrapers are in a distant land in a galaxy far, far away..

I'll repeat it one more time....

Where the scrapers exist doesn't matter as the search engines are where the good old DMCA is valid and G Y! and M$N have to abide by it. Whether the scraper stays online is one thing but if nobody can find him is another ;)


 12:45 am on Jun 26, 2005 (gmt 0)

Where the scrapers exist doesn't matter as the search engines are where the good old DMCA is valid and G Y! and M$N have to abide by it. Whether the scraper stays online is one thing but if nobody can find him is another ;)

No way you'll ever stop people from stealing your content. If it is available for legitimate users to view, it's vulnerable to theft, but I think incrediBill's pretty much got the solution nailed. Forget the thief, deal with the search engine he (she?) is gaming.

I mean, what would happen to your sites if the search engine traffic just... stopped?


 9:20 am on Jun 26, 2005 (gmt 0)

Agreed. Somebody could copy a whole site, but it doesn't matter much if nobody can find it.
If I can't find my own copied content, I take it nobody else can.
Go for the engines, if it comes to that. -Larry

Sweet Cognac

 1:01 pm on Jun 26, 2005 (gmt 0)

Larry, I appreciate you explaining the difference between scrappers and scrapers (I'm weary of seeing it mispelled, and have to reread what they are saying every time)

My sites have not been scraped, which I'm very grateful (yet I tend to wonder why)


 9:41 pm on Jun 26, 2005 (gmt 0)

Hi Cognac: You are lucky. Many scrapers look for high ranking sites in the SERPs
and just scrape those. Others seem to pick pages at random.

While I'm not always successful, I try to spell words properly because
so-called 'typos' distract me from the message being sent.
A 'typo' is a typographical error, a slip of the fingers BTW.
Why is it that people who can't spell have such slippery fingers?

Back on topic: Scraping isn't _always_ laziness and theft.
I suspect some people scrape because they are just incapable of composing decent copy.
One kid in Florida copied my index page whole, changing colors etc.
and making an awful mess of it. I emailed him and he took it down.
I wonder if he originally knew this was a no-no!

The real scrapers know damn well what they are doing though.
Whether they can spell or not is a separate matter. -Larry


 6:17 am on Jun 27, 2005 (gmt 0)

In severe and damaging cases overseas where legal stuff is impractical,

Well... I have no trouble with sites from my country (western Europe); I know the law and I know exactly what to do and how.

The US, well... the US are overseas.

Global Options:
 top home search open messages active posts  

Home / Forums Index / Google / Google AdSense
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved