Generally things I find tangled up in robot traps are either known scrapers I had not seen on that site yet or something that could be residential ISP, (but activities say otherwise). This is the first time I've seen an "Enterprise Search Appliance" IP crawling pages and ignoring robots.txt:
Thunderstone Software EXP-THUND-24 (NET-206-183-1-0-1)
206.183.1.0 - 206.183.1.255