| 4:48 pm on Jun 11, 2009 (gmt 0)|
A two-letter word beginning with "N" and ending with "O" springs to mind.
| 9:09 pm on Jun 11, 2009 (gmt 0)|
It's very tempting to use those two letters to msn/live/bing/choose_next_name. I'm in the process of setting up a new logs anaylser and am about to add a rule to eliminate all the q=rubbish spam. I could do without that time-waster!
| 3:00 am on Jun 12, 2009 (gmt 0)|
And now a hit with no UA at all. Completely blank header. Guess what it got?
| 5:48 am on Jun 12, 2009 (gmt 0)|
Despite getting this robots.txt --
-- "adidxbot" went straight for a subdir content page. Whereupon it was 403'd because only "msnbot" bots are allowed (and only in certain dirs, and maybe not any for very much longer).
| 12:42 pm on Jun 12, 2009 (gmt 0)|
Yes I saw that too (UA adidxbot), but its rdns resolves to msnbot so I do not block it or anything. As of the robots.txt I don't rely on. You can send spiders right into traps with this, so for me is not a good way to do verification.
| 5:02 am on Jun 14, 2009 (gmt 0)|
|"adidxbot" went straight for a subdir content page. |
Same here this past week. It read robots.txt three times in a row and then took whatever it wanted. It was a small site though so it was gone before my bot blockers could kick in.
Different IP Address though:
adidxbot/1.1 ( [search.msn.com...]
| 10:13 pm on Jun 15, 2009 (gmt 0)|
Just got three hits from three MSN IPs with no rDNS and a UA of msnbot-webmaster.
All IPs were in the range: 22.214.171.124 - 126.96.36.199
UA: msnbot-webmaster/1.0 (+http://search.msn.com/msnbot.htm)
Robots: No idea, haven't checked.
It's possible this was in response to the site owner, who told me she was trying to get her site into bing local (probably on a loser: we're in the UK). Still doesn't mean they can come in like that.
| 12:30 am on Jun 16, 2009 (gmt 0)|
|UA: msnbot-webmaster/1.0 (+http://search.msn.com/msnbot.htm) |
It's my understanding this is their webmaster tools bot. It's been around since at least August 2008.
| 10:37 pm on Jun 16, 2009 (gmt 0)|
Then they could at least sort out the headers and get a proper rDNS for it.
In fact it was the site owner trying to get her site verified. It said the site was not valid. Actually, I KNOW the site is valid; whether msnbot can SEE it is different problem. :)
| 11:40 pm on Jun 19, 2009 (gmt 0)|
The adidxbot/1.1 (+http://search.msn.com/msnbot.htm) bot drove us crazy for the last week or so. It was crawling every SEM keyword URL we had configured in our MSN account. Sort of a "Welcome to Bing" I guess.
It was crawling at 25 pages/second so we bumped the crawl rate down to 1 but it took them about 8 hours to change their rate. What's really odd is that the adidxbot never actually read our robots.txt file! So it must get the info from another bot (MSNBot?). All the IPs were unique (i.e. they weren't the same as any other MSNBot). We still aren't sure what Microsoft is doing with this Bot. Anyone know?
| 6:54 pm on Jul 6, 2009 (gmt 0)|
This is our bot. We are currently working to fix the over-crawl issues you are experiencing. I'll update this thread as soon as I have more information.