Forum Moderators: martinibuster
you could also create a 'black list' website
it would list but not link to the violator websites
as well as list the domain name owners etc
php-mysql
add to database-
but make sure there's an appeal process for people who feel they've been listed in error
if we create this and link to it, it will get seen-
make it easy for google to grab the data, and we shouldn't need too many connections
Clearly, that's not a cheater/scraper site, that's a helpful hub/webguide.
And it's very simple to distinguish: If it does not help users, and just cheats them and search engines, it's a scraper. If, on the contrary, it does help users to find the good stuff in the information jungle, it's an useful hub.
Frankly, I don't even think they can make that much from Adsense...You need alot of traffic to make $10,000/month
If they want to fill the gaps till "real useful quality" sites compete for the keywords....better than nothing?
That could be another reason why google doesn't care...
Or does google care? Has anyone asked?
Is there anyone with a "real" site getting killed by a scaper site....I would LOVE to see it....Please Please Please...
The best (worst) part is, the scrapper site is not even in english.
And I quote myself so you guys can get it through your head.
I as a user go to Google knowingly
I don't want to type a keyword on Google and be taking to another search engine,
Why can your people see the difference .
spaceylacie you for real, I got one site that should take first place, at list in my niche, lol
Qur1uS, can I pm you the site, mine is legit but this scraper site is on top in all my keywords lol, so yea there legit sites getting affected, one thing I think this mainly affects small sites, I think like mine which usually gets like 700 unique a day, although lately I am getting only 400 mainly due to this site which I email Google about but nothing happen.
Wonder what (G) will do if these scraper sites get enough data to sell lists of the high paying keyword and when they make enough money on the lists they won’t care if they get their account canceled or not
If you somehow manage to ban them from adsense then they will just move to another ppc company. There are companies out there who pay about half of what adsense pays, but they have no morales, and therefore you can put ads on any type of sites.
The blacklist wont do anything either.
I have reported sites and nothing gets done, and dont forget that these sites owners prob have numerous servers, numerous adsense accounts, numerous ips etc...
i have though about this a lot and i think the best thing i can do is spend my time improving my own sites and spend less time worrying about scraper sites.
Believe it or not you will make more money from working on your own sites instead of whinging about others... sad fact, but true.
the sad fact is you will never get rid of such sites, all you can do is make it harder for them.
If you somehow manage to ban them from adsense then they will just move to another ppc company. There are companies out there who pay about half of what adsense pays, but they have no morales, and therefore you can put ads on any type of sites.
Fine, banning them from AS would be a first step, and a big one. If scrapers have to move to another PPC company, they will at least stop draining AdSense advertisers' budgets - i.e. this money will then be spent for valid high-quality sites that provide a service (and presumably also high-quality clicks).
Advertisers will not follow the scrapers to the new PPC programs. In fact they would be happy to see the scrapers gone! Just watch the related threads over at the AdWords Forum. You will see that the advertisers demand this from G as well.
Truth is that AS is the most popular PPC program just BECAUSE it generates real money fast. Other PPC programs are less reliable and less attractive.
-- M.
IMO the only difference between "scraper sites" and Google, Yahoo and MSN is the search engines are welcome, even encouraged to come by and scrape our sites.
Let's look at a few characterstics of scrapers as defined in this thread:
1 - Scrapers rely almost entirely on automation to generate their pages. Google is the king of relying almost entirely on automation to generate their pages.
2 - Scrapers come by and extract snippets of content without first asking permission. Google does this on a daily basis.
3 - Scraper sites exist solely to make money. Let's face it, Google exists solely to make money. Remove Google's ability to earn revenue from advertising and it will disappear very quickly.
That being said, with the exception of Google, Yahoo and MSN, I don't like scrapers either for many of the reasons stated in this thread... but most of all because they clutter up the pages of my preferred scrapers: Google, Yahoo and MSN.
Let's be honest here - the criteria we use to form our own definitions of "scraper sites" are purely selfish ones. If a site helps us make money we like to say it isn't a scraper. If it helps someone else make money (but not us) it should be labeled a scraper. It's really that simple.
Google, Yahoo and MSN are scrapers in every rational sense of the term, but I love it when they crawl my sites and index my pages. Why? Because they help me make money.
The lower tier scraper sites that make my pages harder to find in the top tier scrapers get my goat. Why? Because they don't help me make any money.
To say I hate all scrapers would be intellectually dishonest because I happen to love Google, Yahoo and MSN. I simply hate the scrapers that don't benefit me and my family.
I have NOT invited one scraper site.And this betrays a fundamental misunderstanding of how the internet works. When you post a page on the internet that is not password protected you are inviting all comers. If you don't want them, require password access--that will stop them.
Every day dozens of unwanted bots visit my site for unknown reasons using my bandwidth. Not all are scrapers but they simply can't be stopped except by blocking IP's or user-agents. Fact of life, get over it.
A.
A SE creates an index through spidering the web, applying an algo and delivering results from the index based on user queries. A SE attempts to organize the entire web, regardless of how much Adsense is paying per click on a given topic.
A scraper copies information from search engines; they do not collect their own info and attempt to organize it. A scraper is only interested in money keywords. That's why scrapers post garbage pages about things like "carribean-travel-ringtones-texas-hold-em."
The only people who can't tell the difference between a SE and a scraper are scraper publishers who prevaricate on this board and say "I don't publish scrapers, but I love them!"
Little tip guys -- scrapers are a big pain in the rear, but they are not illegal. If you are going to publish garbage, I say, be proud! Admit you publish garbage and stop trying to tell me that I should love it cuz it smells and tastes like chicken.