Page is a not externally linkable
- Search Engines
-- Search Engine Spider and User Agent Identification
---- Baidu's IPs - Which Are Legit?


rowan194 - 12:38 pm on Aug 15, 2012 (gmt 0)


I've had a lot of problems with Baidu, so much so that I wrote a script that firewalls any c class that loads with a Baidu user-agent. Not a great long term solution, as anyone knowing this could perform a simple DoS - load a single page with a faked Baidu referer and the 256 IPs around you are quickly blocked - but I'd had it with them hitting my sites. It's the only time I've had to firewall a major crawler, rather than just blocking it with robots.txt (which doesn't seem to work.)

An interesting side effect is that crawlers purporting to be Baiduspider get blocked too. :)


Thread source:: http://www.webmasterworld.com/search_engine_spiders/4475767.htm
Brought to you by WebmasterWorld: http://www.webmasterworld.com