Forum Moderators: open
81.154.39.137 - - [03/Mar/2005:15:06:54 -0800] "GET /robots.txt HTTP/1.1" 200 2004 "-" "Mozilla/4.0 (JemmaTheTourist;http://www.activtourist.com)"
81.154.39.137 - - [03/Mar/2005:15:06:56 -0800] "GET /Blah NOT_related_to_tourism.html HTTP/1.1" 200 11269 "-" "Mozilla/4.0 (JemmaTheTourist;http://www.activtourist.com)"Jemma is a web crawler that automatically crawls the web looking to add tourist information to our search index. We do this by looking for links within web sites that we can follow and index. Not every page we crawl is indexed, so below we have supplied some recommendations to ensure your pages are indexed.
If you can send me the URL where there is a robots.txt the JemmaTheTourist crawler is not obeying that would be great.
Thank you in advanced.
Damian