Forum Moderators: open

Message Too Old, No Replies

"HTML Text Download Class" Agent

Does not respect robots.txt

         

carfac

3:29 pm on Sep 24, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Got hit by this one last night- from Instituto Tecnologico y de Estudios Superiores de Monterrey ...

Didn't take much, because it ignored robots.txt and got itself banned mucho pronto!

Started off like a 'bot- requests for all the links off my top page in a second, but then I see requests every 5-10 seconds, like a real person, so I guess it might be a web accelorator (which I bann just because...)

148.233.159.250 - - [24/Sep/2002:04:56:13 -0600] "GET /search.cgi?query= HTTP/1.1" 200 23818 "-" "HTML Text Download Class"

dave

Brett_Tabke

9:50 am on Nov 21, 2002 (gmt 0)

WebmasterWorld Administrator 10+ Year Member Top Contributors Of The Month



I've seen that agent myself and trying to connect it with a program. Anyone know?

carfac

4:35 pm on Nov 21, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Hi Brett!

Sorry- I have not been able to turn up anything...

dave