Welcome to WebmasterWorld Guest from

Forum Moderators: open

Message Too Old, No Replies

Spidering pages that do not exist

12:49 pm on Feb 6, 2004 (gmt 0)

10+ Year Member

I just checked my logs and saw that Jeeves is spidering a whole bunch of pages that don't exist and never existed. What it's doing is taking a valid file name and adding a space and a 1 to it, something like "/filename.html 1" but it also took "/ 4" and "/ 19" which don't exist at all. It had no problem getting my robots.txt file though. Any reason why this would happen?


1:47 pm on Feb 7, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member

Hi Jennifer,

Is it definetly AJ or is teoma's crawlers, or could it be bogus!

8:04 pm on Feb 7, 2004 (gmt 0)

10+ Year Member

Unfortunately, this thing has dropped out of my error log, but if I see it again, I'll be sure to check the IP. Thanks for the suggestion.


8:57 pm on Feb 7, 2004 (gmt 0)

10+ Year Member

It was AJ for sure. I saw this in the logs this week also. I've seen almost every engine do this at one time or another. My conclusion is that they are testing 404s.
11:44 pm on Feb 7, 2004 (gmt 0)

10+ Year Member

Hmm, why would they want to test 404's?


12:15 am on Feb 8, 2004 (gmt 0)

10+ Year Member

We get lots of this kind "GET /example.html%201 HTTP/1.0", agent "Mozilla/2.0 (compatible; Ask Jeeves/Teoma)", IP points to UUnet.

Featured Threads

Hot Threads This Week

Hot Threads This Month