Forum Moderators: DixonJones

Message Too Old, No Replies

"google", not "googlebot"

         

SlowMove

2:58 pm on May 9, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



robot from 61.135.131.*** is craling my site. It just identifies itself as "google"

<added>it does check for robots.txt</added>

Stefan

3:24 pm on May 9, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



61.135.131.xx - - [09/May/2004:09:06:18 -0600] "GET / HTTP/1.1" 200 3435 "-" "google"

Yeah, I have it too. It isn't Google.

inetnum: 61.135.0.X - 61.135.255.XXX
country: CN

[edited by: webdiversity at 8:28 pm (utc) on May 9, 2004]
[edit reason] No specific details [/edit]

Yidaki

3:35 pm on May 9, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



eMail harvester - i have most of china banned.

dcrombie

3:12 pm on May 10, 2004 (gmt 0)



I agree with Yidaki - I've had that one blocked for months.

mars9820

2:55 pm on May 11, 2004 (gmt 0)

10+ Year Member



China blocks half of the world and the rest of the webmasters are blocking China :)

I see a partnership here. Anyone?

By the way I saw some code going over in some chinese usegroups about webcrawlers/emailharvesters that access google cache instead of the real website to gather their information in the event the page is down/not responding.

I don't know if it is working or what they were trying to do. However it looked quite smart.

volatilegx

3:53 pm on May 12, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



China blocks half of the world and the rest of the webmasters are blocking China

the Great Firewall of China? :P