is this bot friendly or not?

Relative newbie to website security and on a steep learning curve. Thanks in advance for any advice - I've learnt a lot here so far.

I have the following bot turning up regularly on my logs and am trying to work out if it is a good or bad one. It doesn't get into my bot traps but some of its requests seem unusual.
It resolves to Microsoft Corp with a WHOIS lookup.

Here are a couple of error/access entries

65.55.212.*** - - [27/Oct/2007:08:33:53 +0100] "GET /%7Edomainname/rss.xml HTTP/1.0" 404 1232 "-" "msnbot-media/1.0 (+http://search.msn.com/msnbot.htm)"

[Sat Oct 27 08:33:53 2007] [error] [client 65.55.212.***] File does not exist: /var/www/vhosts/domainname.org.uk/web_users/domainname

Would a legitimate spider be looking for a non-existent directory called web_users? Or at least one that isn't on search indexes and is an Apache thing rather than visible via ordinary links?

I've checked on MSN search for entries relating to my domain, and there is an enormous amount of content that has been denied in robots.txt for several weeks but is still on their index and I can't see any tools to get it deleted (whereas yahoo and google do have such tools).

The WHOIS says
OrgName: Microsoft Corp
OrgID: MSFT
Address: One Microsoft Way
City: Redmond
StateProv: WA
PostalCode: 98052
Country: US

NetRange: 65.52.0.0 - 65.55.255.255
CIDR: 65.52.0.0/14
NetName: MICROSOFT-1BLK
NetHandle: NET-65-52-0-0-1
Parent: NET-65-0-0-0-0

is this bot friendly or not?

deciphering a log entry about a bot

revrob

jdMorgan

revrob

jdMorgan

Join The Conversation

Moderators and Top Contributors

Hot Threads This Week