Forum Moderators: open
[webmasterworld.com...]
See this thread and the last two lines of my reply
Anyone?
Also, why would an airline company harass like that?
65.104.122.18
is part of the backbone provider range
XO Communications
NetRange: 65.104.0.0 - 65.107.255.255
and could be anybody.
A simple beginning
[webmasterworld.com...]
LAX. Los Angeles, CA How's Arnie? lolol
ONe more thing, without programming experience, how do I set up a "spider-trap"?
"Anyone, anyone..." {think ben stein}
<off topic> "Join Ahhhnold." 'He vants to fondle Sa-cramen-Do. HE vants to govern for da people.'
Hey,better him than Bustmonte, and no more Davis. Maybe I will be able to afford to stay here afterall. :) </off topic>
wilderness, I see you ban those whose UA
BlueSky,
I don't recall when I began using it. It's been more than a year and in all that time I've pretty much been in agreement with the majority in advising that it should only be done in rare instances (deny blank UA, actually not only blank, includes "-" as well).)
Initially the only real SE bot it affected was Lycos and I didn't get much traffic from them anyway.
Until recently, the instances of an average visitor using a blank UA have been minimum. In most instances, the use of a blank UA is used by what might be termed non-desired visitors, at least as related to MY WEBSITES.
Recently, I've had a rash of HEAD (I've been considering for some time denying head requests) requests from AOL users (AOL users are around 30% of my traffic and subscribers) which contain both a DASH refer and dash UA.
Some other IP's as well.
It might be part of a software the visitors are using, which makes them aware of the UA option or the software may do it automatically. In any event, it's not appropiate for my sites.
I may have to change my outlook on this at some later date.
A while back I trialed denial of "Digital Ext."
Later relizing that somebody who had previously marked their browser to store pages for offline viewing and instituted automatic updates could have done so without being aware of the long range capability of what they were doing. In addition, I realize that a visitor would have to go through their accumulated Favorites/Bookmarks and change each one back to the default setting to remove the "Digital Ext."
I removed the deny.
Don
CA How's Arnie?
Rivethed: I don't think there's a good list of bad bot IPs. There is one for UAs which is a work in progress, but it's pretty good already. You can see it here: [webmasterworld.com...] There's a link to a spider-trap script in that thread written in Perl. You can do a search on this site to see if others have encountered nasty guys from the same IPs that are bothering you. There's an awful lot of bots running about. So, it's possible you may be getting some that others haven't seen.
Questionables in my logs, if you care to comment (replace my url wiht example, spaced the ip:
________________
65. 165. 198. 19--[01/Oct/2003:07:22:55+0800]GET /pdf /n03_body-components. pdf HTTP/1.020657577-contype
65. 165. 198. 19--[01/Oct/2003:07:22:55+0800]GET /pdf /n03_body-components. pdf HTTP/1.0200162513http://www .EXAMPLE .com/Mozilla/4.0 (compatible; MSIE 5.0; Windows NT; DigExt)
________________
200. 47. 155. 132--[01/Oct/2003:08:20:01+0800]GET / HTTP/1.120018706-IBSBand
________________
(found flaw in excel, will only open up about 3 hours of logs, I have a l;og file that is almost a gig, covering 8 days, any 'freeware' suggestions?)
[webmasterworld.com...]
but I still would like to know what the third 'hit' is.
contype
Rivethed,
I have plenty of PDF's online.
Currently and only for a few more days, nearly two hundred as part of two annual horse sales.
contype, comes up more than not after the initial non-html content load. Other times it doesn't come up at all.
The PDF plug-in to browsers along with slow internet connection speeds can create some unusual problems for plug-ins.
I've had some browsers load a PDF as many as 15-20 successive times.
My response to this reload and even though the the visitor may not be aware of, or even planned this excess intentionally, is to deny their IP range. They are responsible in any event.
This reaction is extremely overbearing for a webmaster. However as I've noted on many occassions, my websites are an exception from the norm in their very narrow market-desired visitor traffic.