Forum Moderators: open
robots and root on two sites.
74.62.161.zz - - [19/Feb/2009:13:55:29 +0000] "GET /robots.txt HTTP/1.1" 200 5023 "-" "flatlandbot/allspark (Flatland Industries Web Spider; [flatlandindustries...] dot com/flatlandbot; jason @ flatlandindustries dot com)"
Their web site says (in H tags)...
Providing New Revenue Streams for Web Publishers
... search solutions for business, education & government
... Your Own Vertical Search Engine
... Valuable Service for Your Users
... Monetize Those Search Results
In other words, make money from the sites we scrape.
great-plains-web-spider/flatlandbot (Flatland Industries Web Spider; [flatlandindustries.com...] jason@flatlandindustries.com)
Asked for robots.txt, then promptly ignored it.
Looks like the same HOST, too, because my info appears to match the OP's IP (if the last two digits are this year minus 1934):
rrcs-74-62-161-zz.west.biz.rr.com
[edited by: Pfui at 1:57 am (utc) on Feb. 20, 2009]
crawler.flatlandindustries.com
flatlandbot/baypup (Flatland Industries Web Spider; [flatlandindustries.com...] jason@flatlandindustries.com)
robots.txt? YES
Methinks six -- count 'em, SIX -- mentions of "flatland" in every single hit is egomaniacally over-the-top (not to mention added logspam).
Like you this is the most recent one:
flatlandbot/allspark (Flatland Industries Web Spider; [flatlandindustries.com...] jason@flatlandindustries.com)
Others from 2008 include:
flatlandbot/baypup (Flatland Industries Web Spider; [flatlandindustries.com...] jason@flatlandindustries.com)
flatlandbot/flatlandbot (Flatland Industries Web Spider; [flatlandindustries.com...] jason@flatlandindustries.com)
flatlandbot/flatlandbot (Flatland Industries Web Spider; [flatlandindustries.com...] jason@flatlandindustries.com)
great-plains-web-spider/flatlandbot (Flatland Industries Web Robot; [flatlandindustries.com...] jason@flatlandindustries.com)
great-plains-web-spider/flatlandbot (Flatland Industries Web Spider; [flatlandindustries.com...] jason@flatlandindustries.com)
great-plains-web-spider/gpws (Flatland Industries Web Spider; [flatlandindustries.com...] jason@flatlandindustries.com)
EDIT: Oops. The baypup one was last seen on Jan 31, 2009.
[edited by: GaryK at 11:58 pm (utc) on Mar. 10, 2009]
>Been blocking this for some time. My IP & UA compare to yours.
>Their web site says (in H tags)...
>In other words, make money from the sites we scrape.
We make money (laugh! not yet) the same way Google and Yahoo do. Our beta ad delivery network is at: ads [dot] baypup [dot] com.
Apologies if my bot was causing anyone problems.
All my bots obey robots.txt, and if they don't I would like to know about it.
Call my NOC phone if I can help - 816-309-1463