Forum Moderators: open
220.181.26.73 - - [29/Jul/2005:16:52:42 -0400] "GET / HTTP/1.1" 403 670 "-" "sohu-search"
220.181.26.73 - - [29/Jul/2005:16:52:42 -0400] "GET //robots.txt HTTP/1.1" 200 16078 "-" "sohu-search"
I would expect better from one of the top search engines in their market.
Jim
This behaviour is like being forced off a property by the security personnel, and then going back to read the prominent "No Trespassing" sign on the gate.
Really, my main point is that even 'big' search companies often run 'mis-implemented' robots.
Jim
User-agent: *
Disallow: /
Jim
The AP IP list you provided is still in use on the other site though... I'm not sure why they love it, but apparently that site's IP address used to belong to a site that they were very interested in...
Jim
netname: CNCGROUP-BJ
descr: CNCGROUP Beijing province network
descr: China Network Communications Group Corporation
descr: No.156,Fu-Xing-Men-Nei Street,
descr: Beijing 100031
country: CN
220.181.26.73 is assigned to:
netname: CHINANET-IDC-BJ
country: CN
descr: CHINANET Beijing province network
descr: China Telecom
descr: No.31,jingrong street
descr: Beijing 100032
I haven't seen malformed requests for robots.txt that I recall.