Welcome to WebmasterWorld Guest from 54.167.5.15

Forum Moderators: mack

Message Too Old, No Replies

MSN bot hitting 1000s of pages on my site

New to this forum. Is this just a test for MSN?

     

jp_css

2:28 am on May 6, 2004 (gmt 0)

10+ Year Member



MSN bot has been hitting my site on and off until now I have found that it has been indexing through over 1000 pages. Is MSN bot only an experiment for now, or is it something more?

Imaster

6:08 am on May 6, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Its been hitting hard on my sites as well. In some cases, even higher than googlebot.

I believe it's an experimental project which they plan to launch few months down the line, which could easily postpone to much more time than that.

Useful Links:
MSN - MSNBot [search.msn.com]
Ballmer: MSN Search - Coming Next Year [webmasterworld.com]
Google Results for 'MSNBOT' on WW [google.com]

uncle_bob

9:09 am on May 6, 2004 (gmt 0)

10+ Year Member



I've got msnbot deep searching a new site within days of it going live. Googlebot it still just getting the startpage .

pontifex

12:21 pm on May 6, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



This thing is much uglier behaving than G!
I just opened a trouble ticket with my provider, because MSNBOT hammered the machine to death.
For the next few hours 65.54.164.* is blocked!
P!

jp_css

2:21 pm on May 7, 2004 (gmt 0)

10+ Year Member



Yeah, it was really hitting my site very quickly, but slowed down to a crawl now.

MikeBeverley

8:37 pm on May 15, 2004 (gmt 0)

10+ Year Member



Anyone seen this guy:

msnbot/0.11 (+http://search.msn.com/msnbot.htm)

he has hit my server 28,411 times in the past 14 days. That's alot of server load but I don't want to ban him as I'd like my site included in the Microsoft search engine when it comes out. Anyone know how to slow him down?

mars9820

11:18 pm on May 15, 2004 (gmt 0)

10+ Year Member



[search.msn.com...]

How often will MSNBot access a page from my web server?

In general MSNBot should not try to access your site more than once every few seconds. MSNBot will also account for the time it takes to download a page from a site so tht if your site has a slower connection we will not access it as frequently. If you find that we are placing too high a load on your site please let us know by sending us e-mail at msnbot@microsoft.com.

erthlng

11:05 pm on May 18, 2004 (gmt 0)

10+ Year Member



Seeing similar high activity over the past week. About 12,000 pages a day getting crawled.

Harry

10:55 am on May 19, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



And it cost me 2.28 gig in one day and used up my allowed bandwidth for the month.

dwilson

11:01 am on May 19, 2004 (gmt 0)

10+ Year Member



Those having this problem ... are you using dynamically generated pages? with?id=####?

I wonder what similarities there are between sites w/ the difficulty. Perhaps something makes MSNBot think there are more pages than there are. Here's an idea ...

If page.asp?item=1234&last=5432 has a link to page.asp?item=8897&last=1234 etc. ... then a nearly infinite combination of URL's would exist.

Are your sites set up anything like that?

Harry

11:30 am on May 19, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Except the message boards and the shopping cart, everything is plain html

caine

9:50 am on May 20, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



MSN is the biggest crawler in my logs at the moment - outdoing everyone together!

helenp

12:48 pm on May 20, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Today seems have stopped spidering that heavily,

jasontromm

7:15 pm on Jun 26, 2004 (gmt 0)



I thought I had banned msnbot from my site, but it's come back.

I put the following lines in my robots.txt file:

User-agent: msnbot*
Disallow: *

And I even banned the IP address it seems to be coming from in my .htaccess file:

deny from 65.54.164.

What else can I do? It's hogging up my bandwidth and I don't care to be in Microsoft's directory. (I hate Microsoft in general) The majority of the world uses Google and people can find trommetter.com that way.

Oliver341

9:46 pm on Jul 19, 2004 (gmt 0)

10+ Year Member



Old thread, just came across it...

I thought I had banned msnbot from my site, but it's come back.

I put the following lines in my robots.txt file:

User-agent: msnbot*
Disallow: *


Your syntax is all wrong.

Use:


User-agent: msnbot
Disallow: /

Receptional

6:48 pm on Jul 21, 2004 (gmt 0)



Anyone know how to slow him down?

Absolutely - MSN built a special robots command "delay" into their msnbot for this purpose - launched in this very forum a couple of weeks ago :)

[webmasterworld.com...]

Dixon.

 

Featured Threads

Hot Threads This Week

Hot Threads This Month