Forum Moderators: open

Message Too Old, No Replies

spiders agent names

         

bignooz

10:31 am on Apr 11, 2003 (gmt 0)

10+ Year Member



If someone knows where I can find the full list of SE spiders agent names?
Many thanks.

Bignooz

pendanticist

2:37 pm on Apr 11, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Well, there's Search engine robots [jafsoft.com], Robot Watch [philsearch.de] (a bit slow loading today) and finally Search Engine Spiders Crawlers and Indexers [searchengineworld.com].

The other one http*//4webhelp.com/spiders/spidersl.shtml is dead.

Pendanticist.

Ralphonso

4:43 pm on Apr 11, 2003 (gmt 0)



All the good bots should be here:

[robotstxt.org...]

wilderness

5:28 pm on Apr 11, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



http ://www.psychedelix.com/agents.html
http ://www.jafsoft.com/misc/opinion/webbots.html
http ://www.icehousedesigns.com/useragents/
http ://joseluis.pellicer.org/ua/
http ://www.pgts.com.au/pgtsj/pgtsj0208c.html Very slow loading and large

Jaf

12:07 am on Apr 12, 2003 (gmt 0)

10+ Year Member



The jafsoft.com link listed above has moved (and will be redirected) to [jafsoft.com...]

This is, in fact, the same link as that labelled "search engine robots" in an earlier post in this thread.

wilderness

1:34 am on Apr 12, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



<snip>The jafsoft.com link listed above has moved</snip>

John
I just told you the other day in usenet that I hadn't used you pages in a while :(
That explains my incorrect link to your gracious effort.
Keep up the good work.
Don

jphorn

1:26 am on Apr 13, 2003 (gmt 0)

10+ Year Member



Is there a list with recommendations on which user agent/robot/spider/etc to allow or to disallow? The psychelix and jafsoft ones are very elaborate, but I wish there was an extra column whether or not to allow the UA.

- djr

pendanticist

1:35 am on Apr 13, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Welcome to WebmasterWorld jphorn [webmasterworld.com] :)

The Perfect .htaccess [webmasterworld.com] has some valuable information you might wish to peruse.

Pendanticist.

jphorn

1:01 pm on Apr 13, 2003 (gmt 0)

10+ Year Member



Hi Pendanticist,

Thanks for the welcome, although I've been lurking around for a few months ;)
I finally implemented the (almost perfect) .htaccess from the thread you mentioned. However, does this mean all bots not mentioned in that list are good bots?

- jp

pendanticist

1:44 pm on Apr 13, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



However, does this mean all bots not mentioned in that list are good bots?

No. It only means that historically, those bots (listed) are known to have performed rather badly by stealing content, bandwidth hogging and etc. New ones pop-up all the time.

Have you ever seen snippet posts where a poster mentions a 'new' bot having scarfed their content and a few other posters chimming in? You can bet that in the background, there is a flurry of other webmasters/admins scurrying to add that particular one to their ban list before it steals their content.

(That list was ammended and tweaked many times to provide a good base from which you can add to, as dictated by your log files.)

Lastly, the range of bots in that list is broad. Some are image thieves that are of particular importance to site owners who have image rich content, while other non-image intensive sites may not be too concerned.

So, the significance of that ban list varies from site-to-site.

>Thanks for the welcome

My pleasure. :)

Pendanticist.

bignooz

10:37 am on Apr 14, 2003 (gmt 0)

10+ Year Member



Thanks to all, it helps a lot ;)