Forum Moderators: open
The result was fast and furious spidering.
I added them to robots as they have specified below.
They've never specified what exactly their research is.
They do however specify that they honor robots.txt in the following lines:
If you only want to forbid only our crawler from going through your
site, then create a robots.txt file that contains the following lines:User-agent: http ://www.almaden.ibm.com/cs/crawler
Disallow: /
Please note, I've purposely left a blank space in the URL to keep the link non-active. If you use this line in robots, you'll need to remove that space.
At one time I "thought" this was the Compuserve bot. Compuserve is now a subsidy of the infamous AOL however some folks still have compuserver addresses and accounts. Perhaps somebody else can provide more insight.
This URL has some info on their newer projects:
[research.ibm.com...]
FWIW, they hit my geek site regularly.