Forum Moderators: open
131.253.41.45 - - [26/Jun/2012:06:20:22 -0700] "GET /robots.txt HTTP/1.1" 200 533 "-" "msnbot-media/1.1 (+http://search.msn.com/msnbot.htm)"
131.253.41.45 - - [26/Jun/2012:06:20:22 -0700] "GET /hovercraft/images/kabloona.jpg HTTP/1.1" 200 44328 "-" "msnbot-media/1.1 (+http://search.msn.com/msnbot.htm)"
131.253.41.45 - - [26/Jun/2012:06:20:22 -0700] "GET /hovercraft/caribou.html HTTP/1.1" 200 10970 "-" "msnbot-media/1.1 (+http://search.msn.com/msnbot.htm)" 131.253.41.223 - - [26/Jun/2012:07:53:18 -0700] "GET /robots.txt HTTP/1.1" 200 533 "-" "msnbot-media/1.1 (+http://search.msn.com/msnbot.htm)"
131.253.41.223 - - [26/Jun/2012:07:53:18 -0700] "GET /hovercraft/images/yesno.jpg HTTP/1.1" 200 38878 "-" "msnbot-media/1.1 (+http://search.msn.com/msnbot.htm)"
131.253.41.223 - - [26/Jun/2012:07:53:19 -0700] "GET /hovercraft/caribou.html HTTP/1.1" 200 10970 "-" "msnbot-media/1.1 (+http://search.msn.com/msnbot.htm)" 207.46.199.163 - - [26/Jun/2012:08:50:38 -0700] "GET /robots.txt HTTP/1.1" 200 533 "-" "msnbot-media/1.1 (+http://search.msn.com/msnbot.htm)"
207.46.199.163 - - [26/Jun/2012:08:50:38 -0700] "GET /images/perez.jpg HTTP/1.1" 200 5781 "-" "msnbot-media/1.1 (+http://search.msn.com/msnbot.htm)"
207.46.199.163 - - [26/Jun/2012:08:50:38 -0700] "GET / HTTP/1.1" 200 2180 "-" "msnbot-media/1.1 (+http://search.msn.com/msnbot.htm)" 131.253.36.202 - - [07/Dec/2012:11:31:28 -0800] "GET /fonts/naamajut.html HTTP/1.1" 200 4774 "-" "Mozilla/4.0 (compatible; MSIE 7.0; Windows NT 5.2; SLCC1; .NET CLR 1.1.4325; .NET CLR 2.0.50727)"
131.253.36.202 - - [07/Dec/2012:11:31:29 -0800] "GET /piwik/piwik.js HTTP/1.1" 200 21927 "http://www.example.com/fonts/naamajut.html" {same}
131.253.36.202 - - [07/Dec/2012:11:31:30 -0800] "GET /sharedstyles.css HTTP/1.1" 200 2984 {et cetera}
131.253.36.206 - - [07/Dec/2012:11:31:31 -0800] "GET /fonts/fontstyles.css HTTP/1.1" 200 3191 {et cetera}
131.253.36.205 - - [07/Dec/2012:11:31:33 -0800] "GET /piwik/piwik.php?action_name=Naamajut& {et cetera} 131.253.26.244 - - [07/Dec/2012:13:08:06 -0800] "GET /fonts/legacy.html HTTP/1.1" 200 10247 "-" "Mozilla/4.0 (compatible; MSIE 7.0; Windows NT 5.2; SV1; .NET CLR 1.1.4322; .NET CLR 2.0.40607)"
131.253.26.244 - - [07/Dec/2012:13:08:06 -0800] "GET /piwik/piwik.js HTTP/1.1" 200 21927 "http://www.example.com/fonts/legacy.html" {same}
131.253.26.244 - - [07/Dec/2012:13:08:07 -0800] "GET /sharedstyles.css HTTP/1.1" 200 2984 {et cetera}
131.253.26.244 - - [07/Dec/2012:13:08:07 -0800] "GET /fonts/fontstyles.css HTTP/1.1" 200 3190 {et cetera}
65.55.212.65 - - [07/Dec/2012:13:08:13 -0800] "GET /piwik/piwik.php?action_name=Legacy%20Fonts& {et cetera} &res=800x600 It turns out that the traffic you're seeing isn't really the MSNBot search indexer - it's Bing Translator (AKA Microsoft Translator / Windows Live Translator).
If a user crawls your site and then translates the page into their local language through this tool then you will see the request coming from a 65.55 IP address which MAY (not always) reverse DNS to say "msnbot". However it's a real human requesting this page, and you should not really attempt to block it unless "msnbot" is in the user-agent string.
The translate server is proxying the request and you will therefore see the user's user-agent string - not the MSNBOT one.
It seems microsoft are repurposing IP addresses and not updating the reverse DNS names for them, so many translate server IP addresses reverse lookup to a MSN bot address.
There are several different ways of using their service - you can use Page > Translate with Live Search in IE8, or from the Windows Live Toolbar, or you can click "translate this page" from a bing.com search result screen.
Yet another reason to block all translators!
I've made non-English versions of my two most commonly translated pages... it basically means cutting each page's search numbers in half
Maybe you're just noticing the absence of those translation crawls for the two pages you made?
Give each language it's own unique address or put it in a sub directory, linked from the main page, then disallow indexing. Shouldn't change the SERP that way.
The only msnbot-media robots.txt request that fail are the failed headers checks