Hello,
This miserable bot has been attempting to index my site for about a month. I have it blocked in robots.txt and htaccess and have also tried to stop it using the osCommerce spiders.txt file, but nothing works. It is fed nothing but 403s and continues to hammer my site. Today I tried sending an e-mail to them. Let's hope that works. Every time they index a page they include a session ID number, which would cause havoc with my site if anyone were to follow their links. I notice in GWT there are thousands of links to my site that include a session ID. Here is a sample from my logs of a discobot visit (notice the osCsid number):
38.101.148.nnn - - [05/Dec/2010:04:14:19 -0500] "GET /osc/index.php?cPath=945&osCsid=380c578840de9e8f9084f20d36361363 HTTP/1.1" 403 304 "-" "Mozilla/5.0 (compatible; discobot/1.1; +h**p://discoveryengine.com/discobot.html"
Bing used to do the same thing, but an e-mail to them stopped it.
If the e-mail to discoveryengine works, I will let everyone here know it.