Welcome to WebmasterWorld Guest from 54.242.72.36

Forum Moderators: open

Message Too Old, No Replies

Spidering pages that do not exist

     

RammsteinNicCage

12:49 pm on Feb 6, 2004 (gmt 0)

10+ Year Member



I just checked my logs and saw that Jeeves is spidering a whole bunch of pages that don't exist and never existed. What it's doing is taking a valid file name and adding a space and a 1 to it, something like "/filename.html 1" but it also took "/ 4" and "/ 19" which don't exist at all. It had no problem getting my robots.txt file though. Any reason why this would happen?

Jennifer

caine

1:47 pm on Feb 7, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Hi Jennifer,

Is it definetly AJ or is teoma's crawlers, or could it be bogus!

RammsteinNicCage

8:04 pm on Feb 7, 2004 (gmt 0)

10+ Year Member



Unfortunately, this thing has dropped out of my error log, but if I see it again, I'll be sure to check the IP. Thanks for the suggestion.

Jennifer

fiestagirl

8:57 pm on Feb 7, 2004 (gmt 0)

10+ Year Member



It was AJ for sure. I saw this in the logs this week also. I've seen almost every engine do this at one time or another. My conclusion is that they are testing 404s.

RammsteinNicCage

11:44 pm on Feb 7, 2004 (gmt 0)

10+ Year Member



Hmm, why would they want to test 404's?

Jennifer

rytis

12:15 am on Feb 8, 2004 (gmt 0)

10+ Year Member



We get lots of this kind "GET /example.html%201 HTTP/1.0", agent "Mozilla/2.0 (compatible; Ask Jeeves/Teoma)", IP points to UUnet.
 

Featured Threads

Hot Threads This Week

Hot Threads This Month