Suddenly, scads of hits to "partial" filenames.

Since April 4, I've seen the strangest things in my logs, on just one site. And I'm stumped as to what to call this oddity other than what I've been calling the suspect visitors:

"Partials."

Not sexy, I know. But that pretty much sums up what are hundreds of hits to truncated -- or "partial" -- filenames.

For example, say I have these files:

/dir1/a01.Alphabet-List.html
/dir2/b02.Alphabetical.html
/dir3/c03.Alphabeta.html
/dir4/d04.Alphanumeric.html

All of the files are stable, the case-sensitive file names and file system haven't changed for five-plus years, neither has the server software, nor the DNS, IP, etc. But suddenly, ISPs via host names and IPs apparently just from the U.S. -- including a well-known federal agency, a national non-profit, a university, a culinary establishment, an international engineering consultancy, a state government, numerous national telcos and cablecos -- using non-Macintosh UAs (Hmmm... an MSIE 6.0; Windows NT 5.1 thing? See P.S.), giving no referer info -- ALL are hitting/erring these kinds of variations:

/dir1/a01.Alphabet-List.ht
/dir2/b02.Alph
/dir3/c03.Alphabeta.h
/dir4/d04.Alphanu

See what I mean by "partials"? The hits are to partial filenames. As of right now, maybe 10 or so files in multiple directories, each with maybe three truncated variations. And each suspect visitor only hits one partial filename one time. Other visitors -- thousands of them -- have no problems whatsoever.

Huh-wha?

At first I though some site coded a lot of links incorrectly. Really incorrectly. But then I realized there are too many hits, to too many filename variations, across too many different directories. And unlike most of my linkees, none of the partials ever come in on or stick around to go to ANY other pages. They hit my custom error page and they're gone.

Drat. So much for the Incoming Links Theory.

So after a week of watching the partials hit and run, I decided to refresh/redirect them to a special IP with instructions on how to e-me for access. I figured I'd get at least a couple of people touching base and then I could ask them from whence they came.

No such luck. Some partials follow the redirect, some don't. And those that do -- no e-mails.

So I'm left to wonder: Did some crawler make a mess of my URLs? Why are all these visitors suddenly hitting on similarly wrong URLs? What in the heck is going on?

Thoughts?

---
P.S.
For you UA sleuths, a sampling. Curiously, all are MSIE 6.0; Windows NT 5.1:

Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1)
Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; NYU-2002; SV1)
Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; SV1)
Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; SV1; .NET CLR 1.1.4322)
Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; SV1; .NET CLR 1.0.3705)
Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; SV1; FunWebProducts; (R1 1.5); .NET CLR 1.1.4322)
Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; FunWebProducts; .NET CLR 1.1.4322; .NET CLR 1.0.3705)
Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; SV1; (R1 1.5); .NET CLR 1.0.3705; .NET CLR 1.1.4322)
Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; SV1; .NET CLR 1.1.4322; InfoPath.1)
Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; SV1; .NET CLR 1.1.4322; .NET CLR 2.0.50727)
Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; SV1; FunWebProducts; InfoPath.1; .NET CLR 1.1.4322; .NET CLR 2.0.50727)

if (!defined $name{$url}) {
$name{$url} = $url;
if ($url =~ m,/([^/]+)$,) {
$_ = $1;
if (length ($_) > 40) {
# trim out spare stuff to keep it short.
s,^([^:]+://[^/]+)/.*/([^/]+$),$1/.../$2,i;
$name{$url} = $_;
} else {
$name{$url} = $_;

Suddenly, scads of hits to "partial" filenames.

Okay, sleuths. Who -- or what -- goes there?

Pfui

cgrantski

fiestagirl

Alex_Miles

Pfui

jdMorgan

Pfui

Pfui

Pfui

jomaxx

Pfui

jomaxx

Join The Conversation

Moderators and Top Contributors

Hot Threads This Week