| Chinanet bot not playing nice ignores the robots.text file, needs to be whipped |
netchicken1

msg:3075760 | 4:41 am on Sep 8, 2006 (gmt 0) | Am I the only one having trouble with the CHINANET bot crawler? I have updated my robots file to cut out the non text pages, yet this bot doesn't obey it. Is there something wrong with my file? Or is there some way to beat the bot into submission? User-agent: * User-agent: Mediapartners-Google* User-Agent: Googlebot Disallow:/post.php? Disallow:/member.php? Disallow:/misc.php?
|
Wlauzon

msg:3092261 | 10:38 pm on Sep 21, 2006 (gmt 0) | I put an IP block on all of china...
|
netchicken1

msg:3092269 | 10:44 pm on Sep 21, 2006 (gmt 0) | I wondered about that, I also wonder what sort of legitmate traffic would be lost doing that. BTW I fixed the robots text above, got rid of the google entries and the blank line. Makes no difference to the bots though
|
DamonHD

msg:3092277 | 10:49 pm on Sep 21, 2006 (gmt 0) | Hi, Block by IP address or (better) by behaviour. robots.txt is voluntary, and is ignored by bad bots, in the same way as a "please do not burgle this house" notice would be treated by a thief... Rgds Damon
|
|
|