Forum Moderators: open
195.154.174.164 - - [13/Sep/2002:15:52:49 -0400] "GET /robots.txt HTTP/1.0" 200 857 "-" "NG/1.0"
195.154.174.164 - - [13/Sep/2002:17:52:40 -0400] "GET / HTTP/1.0" 200 44134 "-" "NG/1.0"
195.154.174.164 - - [13/Sep/2002:15:52:50 -0400] "GET / HTTP/1.0" 200 44134 "-" "NG/1.0"
195.154.174.164 - - [14/Sep/2002:08:14:19 -0400] "GET /news.html HTTP/1.0" 200 34007 "-" "NG/1.0"
.
. (Fetched many allowed files)
.
195.154.174.164 - - [14/Sep/2002:08:14:05 -0400] "GET /common/officers.shtml HTTP/1.0" 403 838 "-" "NG/1.0"
403-Goodbye!
Jim
This is an English page with contact info [exalead.com].
It's a pretty impressive job Exalead is doing at AOL.fr.
If now they start spidering the whole web, regardless of language, I wonder what they are up to....
<answer>
After verification, it appears that our robot indeed rejected your
robots.txt file as malformed because of a missing terminal newline. Though this is technically incorrect with respect to the specification
(http://www.robotstxt.org/wc/norobots-rfc.html),
I reckon such a minor and unambiguous deviation should have been accepted. I will see that our robots behaviour is changed regarding this point.
</answer>
I think they are some of the good guys out there.
I reckon such a minor and unambiguous deviation should have been accepted.
Wow! I didn't know they spoke "Cowboy" in France. Tres bien!
Hmm... Now I'll have to go check for a terminal LF in my robots.txt and un-ban them if there isn't
one. Sounds like a minor upgrade to the Search Engine World robots.txt checker is needed, too.
Thanks for checking on this, weesnich.
Jim