Forum Moderators: open
I'm pretty new to this .htaccess and robot.txt stuff, and even though I've had my site for almost four years now, I've never had to use any of this stuff until now, when I started up a photoblog. I recently quelled some Java bots through .htaccess and that's been working pretty well for now, but then yesterday, this showed up in my log:
217-20-113-110.internetserviceteam.com - - [10/Feb/2006:21:51:30 -0500] "GET /example HTTP/1.1" 200 713 "-" "Mozilla/5.0 (Windows; U; Windows NT 5.0; de-DE; rv:1.4) Gecko"
217-20-113-110.internetserviceteam.com - - [10/Feb/2006:21:51:31 -0500] "GET /example HTTP/1.1" 200 51350 "-" "Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.0; FunWeb"
217-20-113-110.internetserviceteam.com - - [10/Feb/2006:21:51:31 -0500] "GET /example HTTP/1.1" 200 15732 "-" "Mozilla/4.0 (compatible; MSIE 5.17; Mac_PowerPC)"
217-20-113-110.internetserviceteam.com - - [10/Feb/2006:21:51:31 -0500] "GET /example HTTP/1.1" 200 17422 "-" "Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; SV1; Hotbat"
217-20-113-110.internetserviceteam.com - - [10/Feb/2006:21:51:31 -0500] "GET /example HTTP/1.1" 200 13564 "-" "Mozilla/4.78 (Windows NT 5.1; U) Opera 7.21 [de]"
As you can see, every request contains different information. I tried doing a search on internetserviceteam.com, but all I got were a couple pages saying that it was a pic-grabber and the rest were different IPs showing up in other people's logs.
Since I'm a total newbie at this, how on earth do you block the host in .htaccess? I assume you can do it because you can block IP ranges, but since this is probably spoofed, there's got to be a way to get it from any other side. Or is there a way to kill something for too many page requests in a time period? I feel like I've heard something like that before but since I'm self-taught and mostly winging it, I'm at a loss right now.
Thanks in advance for any consideration this gets!
[edited by: volatilegx at 4:37 pm (utc) on Feb. 13, 2006]
[edit reason] examplified URIs and shorted log excerpt [/edit]