homepage Welcome to WebmasterWorld Guest from 54.227.25.58
register, free tools, login, search, subscribe, help, library, announcements, recent posts, open posts,
Subscribe to WebmasterWorld
Visit PubCon.com
Home / Forums Index / Search Engines / Ask - Teoma
Forum Library, Charter, Moderator: open

Ask - Teoma Forum

    
Spidering pages that do not exist
RammsteinNicCage




msg:30942
 12:49 pm on Feb 6, 2004 (gmt 0)

I just checked my logs and saw that Jeeves is spidering a whole bunch of pages that don't exist and never existed. What it's doing is taking a valid file name and adding a space and a 1 to it, something like "/filename.html 1" but it also took "/ 4" and "/ 19" which don't exist at all. It had no problem getting my robots.txt file though. Any reason why this would happen?

Jennifer

 

caine




msg:30943
 1:47 pm on Feb 7, 2004 (gmt 0)

Hi Jennifer,

Is it definetly AJ or is teoma's crawlers, or could it be bogus!

RammsteinNicCage




msg:30944
 8:04 pm on Feb 7, 2004 (gmt 0)

Unfortunately, this thing has dropped out of my error log, but if I see it again, I'll be sure to check the IP. Thanks for the suggestion.

Jennifer

fiestagirl




msg:30945
 8:57 pm on Feb 7, 2004 (gmt 0)

It was AJ for sure. I saw this in the logs this week also. I've seen almost every engine do this at one time or another. My conclusion is that they are testing 404s.

RammsteinNicCage




msg:30946
 11:44 pm on Feb 7, 2004 (gmt 0)

Hmm, why would they want to test 404's?

Jennifer

rytis




msg:30947
 12:15 am on Feb 8, 2004 (gmt 0)

We get lots of this kind "GET /example.html%201 HTTP/1.0", agent "Mozilla/2.0 (compatible; Ask Jeeves/Teoma)", IP points to UUnet.

Global Options:
 top home search open messages active posts  
 

Home / Forums Index / Search Engines / Ask - Teoma
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved