Forum Moderators: open

Message Too Old, No Replies

YahooSeeker is crashing my site

Anyway to stop this without blocking it

         

itisgene

3:42 am on Mar 24, 2004 (gmt 0)

10+ Year Member



I got hundreds of slurp visits to my some of the database driven forums.
I had to restart the forums because of the crashes.
Is there any way that I can prevent this without using robots.txt? The forum uses only one main file to show the forum, say widgets.asp with all the parameters after question mark. (widgets.asp?forumID=1&category=7)
SO, If I block widgets.asp in robots.txt, it will block the whole forum including the main page.

Is this YahooSeeker different from normal Slurp?

We are #3 to #8 for two word keywords with these forums on Yahoo and MSN and I don't want to lose the listing.

Any advice?
Thanks,

Server Variables:
-----------------
ALL_HTTP = HTTP_ACCEPT:*/* HTTP_HOST:www.MYSITE.com HTTP_USER_AGENT:YahooSeeker/1.1 (compatible; Mozilla 4.0; MSIE 5.5;
htt*://help.yahoo.com/help/us/shop/merchant/) HTTP_IF_MODIFIED_SINCE:Sat, 21 Feb 2004 18:15:16 GMT HTTP_ACCEPT_ENCODING:gzip, x-gzip HTTP_FROM:slurp@inktomi.com
ALL_RAW = Accept: */*
Host: www.MYSITE.com
User-Agent: YahooSeeker/1.1 (compatible; Mozilla 4.0; MSIE 5.5; htt*://help.yahoo.com/help/us/shop/merchant/)
If-Modified-Since: Sat, 21 Feb 2004 18:15:16 GMT
Accept-Encoding: gzip, x-gzip
From: slurp@inktomi.com

itisgene

3:51 am on Mar 24, 2004 (gmt 0)

10+ Year Member



Wow,
I just looked at the Yahoo shopping area and Yahoo Shopping is selling our terms of services at $7.96!

We had an advertisement from another website on our web sites and Yahoo Shopping (YahooSeeker) thinks we are selling something at $7.96.

Can I just block this YahooSeeker and not the normal slurp?

gkrishnan

2:13 pm on Mar 24, 2004 (gmt 0)



hi,

YahooSeeker is used as the user-agent for crawling
only for yahoo's vertical searches such as shopping.
Disabling/blocking YahooSeeker will not affect your
websearch rankings. So, in that sense YahooSeeker
is different from slurp.

[help.yahoo.com...]

To specifically prevent Yahoo! from crawling your site, you can add these lines to the /robots.txt file on your server:

User-agent: YahooSeeker
Disallow: /

Rumbas

2:44 pm on Mar 24, 2004 (gmt 0)

WebmasterWorld Administrator 10+ Year Member Top Contributors Of The Month



In Tim's Pubcon presentation he mentioned a robots.txt syntax you could use to slow the regular crawler down. I found that kind of neat, but don't remember the right robots.txt command.

Anybody remeber the correct syntax?

It might work on YahooSeeker too..

bcolflesh

2:49 pm on Mar 24, 2004 (gmt 0)

itisgene

7:35 pm on Mar 24, 2004 (gmt 0)

10+ Year Member



Thanks, guys.
I will use robots.txt

User-agent: YahooSeeker
Disallow: /

as gkrishnan suggested.

We don't sell anything, so nothing to lose.
We just need to be listed in Yahoo natural results.

Thanks,