Forum Moderators: open
After reading this over the weekend, I really don't want to have anything to do with them:
[radar.oreilly.com...]
"The ccBot crawler is a distributed crawling infrastructure that makes use of the Apache Hadoop and Nutch projects."
To put it politely, the guy who wrote that article is a pretty arrogant chap. Nevermind, he won't have the chance to ignore my robots.txt file since 38.nnn.nnn.nnn has been banned for years, nothing much good coming from there.
Nothing for nothing, but most government webmasters would not now what to do with Robots.txt file to start with. The job usualy held by an EX-Maiframe programmer that is stock in meetings and that had learned how to use fronpage 3.0 back in a day. And such...
On the other hand, the last project I did for that sector, the webmaster had Poster(home made, with the crown) that Said: "CONTENT IS KING", I swear...