Welcome to WebmasterWorld Guest from 220.127.116.11 , register , free tools , login , search , pro membership , help , library , announcements , recent posts , open posts Become a Pro Member
new bot? analysis.he.net/18.104.22.168 skirril
Comes from 22.214.171.124 (shows up as: analysis.he.net) ua: Mozilla/4.0 (compatible;MSIE 5.5;Windows NT 5.0)
Does not seem to honor robots.txt; deep crawls;
up to 5 requests per second!!
also does NOT honor robots meta tag (deep-crawled a page I had set as "noindex,nofollow")
This one hit my site twice yesterday, and looking back in my logs had been around about the time you posted your message.
I do have pages that require authorization on this site. The funny thing is, it never got robots.txt, but it stays away from the directory that requires authorization. It "acts" like it has seen the robots.txt file, because it gets everything else on the site.
he.net is Hurricane Electric in Fremont, CA.
Anybody else seen this one?
I was getting ready to nuke 'em in .htaccess. I thought they were messin' with me. Sorta glad to see it's not just me.
I wonder if its one of the mods here at WmW checking up to see if we're all doing our part in applying the techniques learned here.;)
Well, h.e.'s back again. Nobody knows?
I just continue to let it rape my site without knowing whether to .htdisallow it or not.
Comes around about once every 2 months, just like google. Grabs everything.
I had a visit from cypress.he.net with the user agent Pizilla++ ver 2.45
Can't one block this IP?