Welcome to WebmasterWorld Guest from 54.158.65.139

Forum Moderators: bakedjake

Chinanet bot not playing nice

ignores the robots.text file, needs to be whipped

   
4:41 am on Sep 8, 2006 (gmt 0)

5+ Year Member



Am I the only one having trouble with the CHINANET bot crawler?

I have updated my robots file to cut out the non text pages, yet this bot doesn't obey it. Is there something wrong with my file? Or is there some way to beat the bot into submission?

User-agent: *
User-agent: Mediapartners-Google*
User-Agent: Googlebot

Disallow:/post.php?
Disallow:/member.php?
Disallow:/misc.php?

10:38 pm on Sep 21, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



I put an IP block on all of china...
10:44 pm on Sep 21, 2006 (gmt 0)

5+ Year Member



I wondered about that, I also wonder what sort of legitmate traffic would be lost doing that.

BTW I fixed the robots text above, got rid of the google entries and the blank line.

Makes no difference to the bots though

10:49 pm on Sep 21, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Hi,

Block by IP address or (better) by behaviour.

robots.txt is voluntary, and is ignored by bad bots, in the same way as a "please do not burgle this house" notice would be treated by a thief...

Rgds

Damon

 

Featured Threads

My Threads

Hot Threads This Week

Hot Threads This Month