RE: older thread [
webmasterworld.com...]
I've been watching this bot. Althought it really hasn't been much of a pest, I was curious to see if in fact it would follow robots.txt standards especially after the bot owner chimed in and said he'd fix it.
I haven't been able to get their info page to load:
www.flightdeckreports.com/pages/bot , but reading this from the home page I decided not to allow access.
Flight Deck Reports is a next generation data mining service. We provide a cloud-based group computing platform in order to enable decision makers, research analysts and writers to process vast amounts of data found in the source code of web documents. Our goal is to enable discovery and measurement of the trends, business relationships, and Internet technologies that impact your business
So with the announcement from the bot owner (above older WW thread) that robots.txt is now being respected, I added:
User-agent: FlightDeckReportsBot
Disallow: /
Today it came, requested robots.txt and promptly disobeyed requesting index.html.
Now blocked.