Forum Moderators: martinibuster
Basically the entire site is contained in one page: index.php.
This script will use cURL to download the contents of a remote page and render them on my server. This is to be anonymous and bypass content filters.
ex: index.php?google.com will load google.coms content entirely from my domain. Even images will loaded from my site. At the top I can encode my own advertisments or simply a "back button".
Can I put adsense code on this index? Can I make frames and put adsense code on the top.
I would expect that this would cause technical problems of a nature that would make you get noticed and booted extremely rapidly, especially in the case of the example you used.
The bot won't see anything but dynamic pages from your domain, and I'd expect the 1:1 ratio of page views to bot visits to trip an alarm inside the first hour.
That's a user-agent based approach rather than an IP-based approach though. It's a difficult problem--cURL for instance allows setting the user agent header to any arbitrary value, so if I see strangeness in my logs, I ban by IP at that point. I don't know of a scraper blacklist, although some people do block a certain heavily populated country in entirety.
Some centralized database where one can nominate IPs and search among offenders.. sounds like a project for Brett to add to this site :)
OTher pages only show firewall and proxy type adds that you would expect.