Forum Moderators: open

Message Too Old, No Replies

Slurp Gone Wild

Took only disallowed files

         

GaryK

6:24 pm on Jul 12, 2009 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



slurp, yahoo! slurp, slurp/2.0, inktomi slurp, slurp.so/1.0
72.30.161.222
llf531029.crawl.yahoo.net
-----
OrgName: Inktomi Corporation
OrgID: INKT
Address: 701 First Ave
City: Sunnyvale
StateProv: CA

robots.txt? YES. And then proceeded to take only disallowed files!

enigma1

9:43 am on Jul 15, 2009 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



It's the reason I don't pay much attention to the robots.txt and just leave it blank. Plus a server can always emit headers and redirect spiders to a forbidden location and they do follow.