Forum Moderators: open

Message Too Old, No Replies

SnookBot

Spider for Small Business Advertising Network

         

incrediBILL

11:32 pm on Mar 16, 2010 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



IP is 75.101.188.x from the Amazon AMS cloud (aka mess)

User agent:

"SnookBot/2.9.1 (Small Business Advertising Network / SnookBot crawler; http://www.rentalboatcharters.com/; support [at] rentalboatcharters.com)"


This bot is used to populate a network of directories such as:
  • listholidayrentals.com
  • rentalboatcharters.com
  • listphotographers.com
  • lawyers.sban.com


Main site is: www.sban.com

Claims to follow robots.txt
Do you follow robots.txt protocol? What if I want my business to be excluded from your directory?

The robot exclusion standard, also known as the Robots Exclusion Protocol or robots.txt protocol, is a convention to prevent cooperating web spiders and other web robots from accessing all or part of a website which is otherwise publicly viewable.

We do respect and follow Robots Exclusion Protocol.
If you do not want to be listed in our directory, e-mail a request to our custumer support or use a robots.txt to disallow our robot to crawl our website content by adding following text to your robots.txt file:

User-agent: SnookBot
Disallow: /


Surprised nobody else has noticed this thing crawling as they've been very busy in the last couple of years.

keyplyr

12:45 am on Mar 17, 2010 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month




Surprised nobody else has noticed this thing crawling as they've been very busy in the last couple of years.

Probably because most of us have banned that and all other AWS ranges. Even before that epiphany I white listed UA containing crawler, spider, etc. But thanks for the write up. They may change providers at some point.

incrediBILL

12:58 am on Mar 17, 2010 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



I had them banned on first contact too but I still log attempts ;)

tangor

5:29 am on Mar 17, 2010 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



I probably do not report as many as I should... but as long as they respect my robots.txt and THAT is all they take then I'm not concerned. But you made me look... Never had one from SnookBot, but then again, I block by IP most of AWS.

keyplyr

6:21 pm on Mar 17, 2010 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Any relation to Snooky-bot and The Situation-bot?