homepage Welcome to WebmasterWorld Guest from
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Become a Pro Member

Visit PubCon.com
Home / Forums Index / Search Engines / Search Engine Spider and User Agent Identification
Forum Library, Charter, Moderators: Ocean10000 & incrediBILL

Search Engine Spider and User Agent Identification Forum

msnbot/1.0 from CN
real or fake?

 5:48 pm on Oct 17, 2006 (gmt 0) and
(no ptr records)
UA "msnbot/1.0 (+http://search.msn.com/msnbot.htm)"
no referer
reads robots.txt and default main page

IP belongs to CHINANET beijing province network



 7:24 pm on Oct 18, 2006 (gmt 0)

Unfortunately no answers but the same question.

Saw something similar from:

Same UA and no ptr.


 8:01 pm on Oct 18, 2006 (gmt 0)

I have started seeing MSNBOT/1.0 returning a hostname like the following as of this last Monday. -->
livebot-207-46-98-73.search.live.com -->

Which seems to be following Googles lead by using dns lookups as a way to identify the real bot verses someone trying to spoof them.

This is only for MSNBOT/1.0 so far not the MSNBOT-media and other variants from Microsoft.

MSNBOT include an extra Header "From" with a value "msnbot(at)microsoft.com" which can also help weed out spoofers.


 2:05 am on Oct 25, 2006 (gmt 0)

I just seen this on one of my sites on this ip
everything else was identical.

Only checked Robots.txt and left (which it was baned) so it seems to be well behaved.


 1:48 pm on Oct 25, 2006 (gmt 0)

Although not entirely on-topic my inquiry for the same bot.

I seem to recall that 1.0 was the MS image bot?

As previously mentioned in another thread
( [webmasterworld.com...] ), I have severe problems of communication with the many MS bots.

Yesterday, I have 1.0 crawling with the same UA from three entirely different ARIN ranges. (denied)

previous days (APNIC denied)

msn-dude informed us a while back that MS was sorting out their bot identities and UA's however these inconsistecies seem to just add more confusion.

Global Options:
 top home search open messages active posts  

Home / Forums Index / Search Engines / Search Engine Spider and User Agent Identification
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved