Imaster

msg:1539872 | 6:08 am on May 6, 2004 (gmt 0) |
Its been hitting hard on my sites as well. In some cases, even higher than googlebot. I believe it's an experimental project which they plan to launch few months down the line, which could easily postpone to much more time than that. Useful Links: MSN - MSNBot [search.msn.com] Ballmer: MSN Search - Coming Next Year [webmasterworld.com] Google Results for 'MSNBOT' on WW [google.com]
|
uncle_bob

msg:1539873 | 9:09 am on May 6, 2004 (gmt 0) |
I've got msnbot deep searching a new site within days of it going live. Googlebot it still just getting the startpage .
|
pontifex

msg:1539874 | 12:21 pm on May 6, 2004 (gmt 0) |
This thing is much uglier behaving than G! I just opened a trouble ticket with my provider, because MSNBOT hammered the machine to death. For the next few hours 65.54.164.* is blocked! P!
|
jp_css

msg:1539875 | 2:21 pm on May 7, 2004 (gmt 0) |
Yeah, it was really hitting my site very quickly, but slowed down to a crawl now.
|
MikeBeverley

msg:1539876 | 8:37 pm on May 15, 2004 (gmt 0) |
Anyone seen this guy: msnbot/0.11 (+http://search.msn.com/msnbot.htm) he has hit my server 28,411 times in the past 14 days. That's alot of server load but I don't want to ban him as I'd like my site included in the Microsoft search engine when it comes out. Anyone know how to slow him down?
|
mars9820

msg:1539877 | 11:18 pm on May 15, 2004 (gmt 0) |
[search.msn.com...] | How often will MSNBot access a page from my web server? In general MSNBot should not try to access your site more than once every few seconds. MSNBot will also account for the time it takes to download a page from a site so tht if your site has a slower connection we will not access it as frequently. If you find that we are placing too high a load on your site please let us know by sending us e-mail at msnbot@microsoft.com. |
|
|
erthlng

msg:1539878 | 11:05 pm on May 18, 2004 (gmt 0) |
Seeing similar high activity over the past week. About 12,000 pages a day getting crawled.
|
Harry

msg:1539879 | 10:55 am on May 19, 2004 (gmt 0) |
And it cost me 2.28 gig in one day and used up my allowed bandwidth for the month.
|
dwilson

msg:1539880 | 11:01 am on May 19, 2004 (gmt 0) |
Those having this problem ... are you using dynamically generated pages? with?id=####? I wonder what similarities there are between sites w/ the difficulty. Perhaps something makes MSNBot think there are more pages than there are. Here's an idea ... If page.asp?item=1234&last=5432 has a link to page.asp?item=8897&last=1234 etc. ... then a nearly infinite combination of URL's would exist. Are your sites set up anything like that?
|
Harry

msg:1539881 | 11:30 am on May 19, 2004 (gmt 0) |
Except the message boards and the shopping cart, everything is plain html
|
caine

msg:1539882 | 9:50 am on May 20, 2004 (gmt 0) |
MSN is the biggest crawler in my logs at the moment - outdoing everyone together!
|
helenp

msg:1539883 | 12:48 pm on May 20, 2004 (gmt 0) |
Today seems have stopped spidering that heavily,
|
jasontromm

msg:1539884 | 7:15 pm on Jun 26, 2004 (gmt 0) |
I thought I had banned msnbot from my site, but it's come back. I put the following lines in my robots.txt file: User-agent: msnbot* Disallow: * And I even banned the IP address it seems to be coming from in my .htaccess file: deny from 65.54.164. What else can I do? It's hogging up my bandwidth and I don't care to be in Microsoft's directory. (I hate Microsoft in general) The majority of the world uses Google and people can find trommetter.com that way.
|
Oliver341

msg:1539885 | 9:46 pm on Jul 19, 2004 (gmt 0) |
Old thread, just came across it... | I thought I had banned msnbot from my site, but it's come back. I put the following lines in my robots.txt file: User-agent: msnbot* Disallow: * |
| Your syntax is all wrong. Use:
User-agent: msnbot Disallow: /
|
Receptional

msg:1539886 | 6:48 pm on Jul 21, 2004 (gmt 0) |
| Anyone know how to slow him down? |
| Absolutely - MSN built a special robots command "delay" into their msnbot for this purpose - launched in this very forum a couple of weeks ago :) [webmasterworld.com...] Dixon.
|
|