Forum Moderators: open
Had this spider come by and take a large portion of my site yesterday evening. It came by the day before and read robots.txt file before it went to work.
kuloko.com still reads about contextual search coming soon!
Anyone have any other news on it?
Craig
How is it possible to make a determination of what the bot and the site intend to do with the data of which they are spidering, perhaps even mining with no pages explaining the sites goals?
It seems to me, an unreasonable expectation by the bot's owner!
Of course most bot creators/owners/users fail to comprehend the fact that they are invading, possibly without permission or desire at the expense of the webmaster they are visiting.
Perhaps someday there will exist an honored protocol by bots?
Don
I will implement a better robots.txt file.
66.227.104.196 - - [28/Sep/2003:10:53:44 +0200] "GET / HTTP/1.1" 200 2204 www.-.net "-" "kuloko-bot/0.2" "-"
66.227.104.196 - - [[b]28/Sep/2003:23:59:10[/b] +0200] "GET /robots.txt HTTP/1.1" 200 832 www.-.net "-" "kuloko-bot/0.2" "-"
66.90.81.41 - - [[b]14/Oct/2003:06:51:48[/b] +0200] "GET /robots.txt HTTP/1.1" 200 880 www.-.net "-" "kuloko-bot/0.2" "-"
66.90.81.41 - - [[b]22/Oct/2003:11:13:41[/b] +0200] "GET /start.html HTTP/1.1" 206 621 www.-.net "-" "kuloko-bot/0.2" "-"