Forum Moderators: open

Message Too Old, No Replies

msnbot coming through Google IP address

Now that's weird.

         

volatilegx

2:21 pm on Mar 10, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



UA: "msnbot/1.0 (+http://search.msn.com/msnbot.htm),gzip(gfe) (via translate.google.com)"

IP: 216.239.36.136

Arin whois:

OrgName: Google Inc.
OrgID: GOGL
Address: 2400 E. Bayshore Parkway
City: Mountain View
StateProv: CA
PostalCode: 94043
Country: US

NetRange: 216.239.32.0 - 216.239.63.255
CIDR: 216.239.32.0/19
NetName: GOOGLE
NetHandle: NET-216-239-32-0-1
Parent: NET-216-0-0-0-0
NetType: Direct Allocation
NameServer: NS1.GOOGLE.COM
NameServer: NS2.GOOGLE.COM
NameServer: NS3.GOOGLE.COM
NameServer: NS4.GOOGLE.COM
Comment:
RegDate: 2000-11-22
Updated: 2001-05-11

TechHandle: ZG39-ARIN
TechName: Google Inc.
TechPhone: +1-650-318-0200
TechEmail: ************@google.com

OrgTechHandle: ZG39-ARIN
OrgTechName: Google Inc.
OrgTechPhone: +1-650-318-0200
OrgTechEmail: ************@google.com

idoc

1:52 am on Mar 11, 2005 (gmt 0)

10+ Year Member



Maybe the msnbot user agent is forged and the user was using the Google translate with the msnbot UA to try to detect a cloak? Or maybe somebody fed a crafted URL into a link to cause msnbot to spider your page via Google translate?

volatilegx

4:32 pm on Mar 11, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



gzip(gfe)

I wonder what that means?

idoc

5:36 pm on Mar 11, 2005 (gmt 0)

10+ Year Member



Apache file compression to save bandwidth. Hope the link is o.k. with everybody.

[httpd.apache.org...]

encyclo

6:12 pm on Mar 11, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



I suppose it is a standard translate.google.com [translate.google.com] visit, but as [translate.google.com...] disallows everything, it can't (or shouldn't) be the real MSNBot. I guess that idoc's idea is the most likely: that it's someone spoofing their user agent and connecting through the translation service to try to get your cloaked code. Were any scripts or CSS files related to the page grabbed as well?