Forum Moderators: DixonJones

Message Too Old, No Replies

What is Slurps problem anyway?

Files I never had, it requests. Someone 'Feeding' the bot mis-information?

         

pendanticist

6:45 pm on Mar 1, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



66.196.65.55 - - [28/Feb/2005:22:43:17 -0800] "GET /haiderint/BlahBlah.htm HTTP/1.0" 404 2847 "-" "Mozilla/5.0 (compatible; Yahoo! Slurp; [help.yahoo.com...]
66.196.65.55 - - [28/Feb/2005:22:43:17 -0800] "GET /trickle/BlahBlah/Shine.htm HTTP/1.0" 404 2847 "-" "Mozilla/5.0 (compatible; Yahoo! Slurp; [help.yahoo.com...]

The Blahblahs are actual file names and these are only a few of the more outlandish requests.

However, the info ahead of and behind those files are ficticious at best. Never had any files like that.

Where do they come from?

I'm all for correcting any 404s the bots may encounter that I caused, but this is getting a tad rediculous. The more I do, the more I have to do...

Also, my files do not end in .htm.

Any Yahoo gurus running around in here that can adequately explain how Slurp finds something that has never been there?

Stefan

6:52 pm on Mar 1, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



I'm not one of the gurus, but I'm familiar with what you're seeing. Occasionally Slurp, for at least a year, gets a little schiz and puts together file names for two different sites, then looks for them. It can ask for some very bizarre and mystifying pages, that exist only in its own digital mind. I like to think of it as Slurp having odd dreams, throwing bits and pieces together, trying to sort things out... it might be AI in the early stages.

pendanticist

7:30 pm on Mar 1, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



So, we're talking about a dilusional bot then? Great. <chuckle> That's just what we need, a bot suffering from dilusions and probably on some mind-altering drug too.

No stems, no seeds that we don't neeeeeeed....

zivkovicp

7:41 pm on Mar 2, 2005 (gmt 0)

10+ Year Member



Is it possible that they are just checking for sites that generate dynamic content based on ANY request? Could it be possible that they penalize that sort of thing?

I have also seen this before but I have only static pages.

bull

9:16 pm on Mar 2, 2005 (gmt 0)

10+ Year Member



Ditto, zivkovicp
Happens on shared hosting?

mattglet

9:45 pm on Mar 2, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Is it possible that they are just checking for sites that generate dynamic content based on ANY request? Could it be possible that they penalize that sort of thing

What about a dynamically created 404 page? This would be penalized?

It's not often you see a C&C reference here, pendanticist ;)

Stefan

9:53 pm on Mar 2, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Is it possible that they are just checking for sites that generate dynamic content based on ANY request?

The bizarro requests that I get have parts of real files that exist on my site, appended to someone else's file names, (not sure if that is 100%, but it's often the case). Why would it bother doing that if it were just testing? It could use /testtest.htm or whatever.

It's likely just a temporarily corrupted database, (or little nervous breakdowns, or the thing twitching in its sleep, or perhaps it starting to gain awareness and it's fooling around like a kid with toys). The bizzaro factor is too high for it to be intentional.