Welcome to WebmasterWorld Guest from 18.104.22.168 , register , free tools , login , search , subscribe , help , library , announcements , recent posts , open posts Subscribe to WebmasterWorld
Scooter 3.3 ignores robots.txt? this is ridiculous berli msg:1528124 11:13 pm on Jun 24, 2003 (gmt 0) Just found Scooter/3.3.vscooter grabbing a bunch of files disallowed to all ("*") in my robots.txt file. The only thing I can think of is that the directories were named things like img/ foo/images/
and I used a trailing slash. I saw here that apparently some browsers misinterpret that? Removing the slash somewhat bothers me, because what's stopping some *other* stupid robot from deciding that "foo/images" is a text file called images?
Does this mean I should block Scooter from certain directories in .htaccess? Let it chew 403's?
Or is this robots.txt-ignoring bot not really Scooter at all...?
IPs: 22.214.171.124, 126.96.36.199
outrun msg:1528125 11:23 pm on Jun 24, 2003 (gmt 0)
There should be a slash at the front instead of foo/images/ it should be this /foo/images/
Mark berli msg:1528126 12:39 pm on Jun 25, 2003 (gmt 0)
My mistake when I posted the example. The following is a real line from my robots.txt file: Disallow: /img/
Other spiders, such as Googlebot and -- get this -- ia_archiver have obeyed this directive.
mcavic msg:1528127 3:09 am on Jun 26, 2003 (gmt 0)
I'd try: Disallow: /img
That way, if it requests it with or without the ending slash, it should be denied.
stupid robot from deciding that "foo/images" is a text file called images?
As far as robots.txt is concerned, it doesn't matter what images is. It's supposed to just compare the disallow string with the beginning of the url.
Scooter seems to be obeying on my site.
cespedes msg:1528128 7:50 pm on Jun 26, 2003 (gmt 0)
I have EXACTLY the same problem. I have the following robots.txt:
Everybody obeys that except someone which identifies itself as "Scooter/3.3.vscooter" and makes connections from 188.8.131.52/24.
Does anyone know how to solve it and/or notify the culprits?
mcavic msg:1528129 2:39 am on Jun 27, 2003 (gmt 0)
Apparently, vscooter is Altavista's image indexer.
See here: [
and here: [
I don't know why it isn't obeying, but maybe try writing to Altavista?