Welcome to WebmasterWorld Guest from 126.96.36.199 , register , free tools , login , search , subscribe , help , library , announcements , recent posts , open posts Subscribe to WebmasterWorld
Spidering pages that do not exist RammsteinNicCage msg:30942 12:49 pm on Feb 6, 2004 (gmt 0) I just checked my logs and saw that Jeeves is spidering a whole bunch of pages that don't exist and never existed. What it's doing is taking a valid file name and adding a space and a 1 to it, something like "/filename.html 1" but it also took "/ 4" and "/ 19" which don't exist at all. It had no problem getting my robots.txt file though. Any reason why this would happen?
caine msg:30943 1:47 pm on Feb 7, 2004 (gmt 0)
Is it definetly AJ or is teoma's crawlers, or could it be bogus!
RammsteinNicCage msg:30944 8:04 pm on Feb 7, 2004 (gmt 0)
Unfortunately, this thing has dropped out of my error log, but if I see it again, I'll be sure to check the IP. Thanks for the suggestion.
fiestagirl msg:30945 8:57 pm on Feb 7, 2004 (gmt 0)
It was AJ for sure. I saw this in the logs this week also. I've seen almost every engine do this at one time or another. My conclusion is that they are testing 404s. RammsteinNicCage msg:30946 11:44 pm on Feb 7, 2004 (gmt 0)
Hmm, why would they want to test 404's?
rytis msg:30947 12:15 am on Feb 8, 2004 (gmt 0)
We get lots of this kind "GET /example.html%201 HTTP/1.0", agent "Mozilla/2.0 (compatible; Ask Jeeves/Teoma)", IP points to UUnet.