Forum Moderators: open

Message Too Old, No Replies

Longest UserAgent

why-dont-u-you-just-go-away-and-crawl-somewhere-else?

         

Josk

9:35 am on Jan 31, 2002 (gmt 0)

10+ Year Member



IP: 193.206.140.36
UA: parallelContextFocusCrawler1.1parallelContextFocusCrawler1.1

TallTroll

9:40 am on Jan 31, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



In a diametrically opposed vein, I saw one with a UA of "Jack" the other day. Am now on the lookout for "Bob"

Josk

10:06 am on Jan 31, 2002 (gmt 0)

10+ Year Member



how spiders that don't set ua's?

volatilegx

4:38 pm on Jan 31, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



How bout "Al"

wilderness

10:07 pm on Jan 31, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



"Jack"
of the UK is quite an honorable bot and search engine.

They do NOT take requests or submissions and it is a very detail oriented search engine.

Too bad I didn't find out the same until I denied Jack.
He hasn't returned since.

I have the URL somewhere.
If I find it I'll add it back later
Some times we can be too over bearing :-(

littleman

10:21 pm on Jan 31, 2002 (gmt 0)



There is a massive amount of UA pollution out on the web these days. I think it will continue to get worse as spiders become more common place. Also. browsers such as K-meleon [kmeleon.sourceforge.net] allow you to set the ua to anything you want.

brotherhood of LAN

10:37 pm on Jan 31, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Can someone enlighten me about "UA"?

littleman

10:56 pm on Jan 31, 2002 (gmt 0)



UA = user agent
Typically browsers start with Mozilla.

wilderness

11:09 pm on Jan 31, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Jack's FAQ
[domanova.co.uk...]

bird

11:07 pm on Feb 3, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



There is a massive amount of UA pollution out on the web these days.

Most of the "minor" bots I see only take robots.txt and those pages that are listed in the ODP. There's a steadily growing number of link checkers out there, which isn't necessarily a bad thing.

wilderness

10:58 am on Feb 4, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



<snip>There's a steadily growing number of link checkers out there, which isn't necessarily a bad thing.>

Hey bird,
link checkers can be a useful tool when WE apply that use to our own websites.

Somebody else using a link checker to travel my site is IMO not desireable.
If only a link or two that is accepatable to me.

When I see a link checker at the beginning's of traveling my entire sites that offers a different consideration.

IMO extesnive link checking on my sites by another is no diifferent than the many research and other bots which have no clear definition on "intent of use."

bird

2:32 pm on Feb 4, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



When I see a link checker at the beginning's of traveling my entire sites...

...then that's not what I call a link checker. The link checkers I was talking about try to make sure that their links really exist and serve a page. For that purpose they have to fetch those of your pages they link to.

Spiders that crawl full sites are a completely different story. I'm not saying that those don't exist, but at least in my stats they are a minority (by total numbers, not necessarily by numbers of accesses).

renewable

1:20 pm on Feb 9, 2002 (gmt 0)