Forum Moderators: DixonJones
Getting about a dozen each visit, once or more per week. Does this really need to continue indefinitely?
66.196.65.51 - - [29/Jan/2006:12:37:11 -0800] "GET /SlurpConfirm404/healtale/page.htm HTTP/1.0" 404 813 "-" "Mozilla/5.0 (compatible; Yahoo! Slurp; http://help.yahoo.com/help/us/ysearch/slurp)"
I have no interest in wasting bandwidth feeding Yahoo! Slurp China, since I can't practically provide any services to China, and any unfortuate chinese person who visited my site would probably find himself in trouble with the authorities -- Too many highly-dangerous concepts on the site, like liberty, freedom, free speech, and democracy.
Just my little gripe to add to yours... :)
Jim
Basically, the hosted site's robots.txt is 'included' as a text-only file by their script, and checked. Apparently, some customers didn't disallow those e-commerce-package-related dynamic URLs, and the robots caused this host a lot of grief, overloading servers and consuming massive bandwidth, due to the fact that the URL-space on those URLs is essentially infinite. I don't want to hijack this thread, but thanks for asking... Despite the fact that the server has been down maybe five minutes in the last seven years, I suppose I'll end up moving this site.
I just think that since 'the rules' for China are different from most of the rest of the world, Slurp China ought to recognnize a different agent name in robots.txt, and save me the bother of sending them a 'special' Falun Gong page so they'll take me off their spider's URL list...
And fix SlurpConfirm for us, too. ;)
Jim