Page is a not externally linkable
rowan194 - 12:38 pm on Aug 15, 2012 (gmt 0)
I've had a lot of problems with Baidu, so much so that I wrote a script that firewalls any c class that loads with a Baidu user-agent. Not a great long term solution, as anyone knowing this could perform a simple DoS - load a single page with a faked Baidu referer and the 256 IPs around you are quickly blocked - but I'd had it with them hitting my sites. It's the only time I've had to firewall a major crawler, rather than just blocking it with robots.txt (which doesn't seem to work.)
An interesting side effect is that crawlers purporting to be Baiduspider get blocked too. :)