| Welcome to WebmasterWorld Guest from 18.104.22.168 |
register, login, search, subscribe, help, library, PubCon, announcements, recent posts, open posts,
|Ban bots with no user agent|
Is it safe to do so? If so, how
| 3:47 pm on Jan 21, 2010 (gmt 0)|
Hi all recently my site stats show a bot that just sucked up over a gig of bandwidth. In my 'awstats' report it just defines it as:
Unknown robot (identified by empty user agent string)
Is it safe to assume that any 'legitimate' bot would have a user agent string?
If so, is there a way (via httpd.conf) to deny access to bots that are missing their user agent string?
| 4:59 pm on Jan 21, 2010 (gmt 0)|
There's no reason that any webmaster should allow visitors (bots or otherwise) when the UA is blank.
deny blank user agent (Webmaster World) [google.com]
deny blank user agent (google) [google.com]
| 6:36 pm on Jan 21, 2010 (gmt 0)|
Thanks that definitely helped - and now that I've identified about 400+ ip's with no user agent, I'm wondering if there's a way to bulk check them against a blacklist?
I have found a few sites that can check individualy, but not in bulk.
One interesting thing - Yahoo Japan's bot has been visiting with no user agent..Almost banned them!
| 6:47 pm on Jan 21, 2010 (gmt 0)|
Certain caching proxies --such as those in AOL's network-- will make HEAD requests with blank UA's. You'll also see a lot of favicon requests with no UA as well.
Yahoo's 'ycar' hosts make requests with no UA, but we haven't figured out what 'ycar' is yet... It's still an open question over in our spider and user-agent ID forum.
| 10:50 pm on Jan 21, 2010 (gmt 0)|
|I'm wondering if there's a way to bulk check them against a blacklist? |
None that I'm aware of.
All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
WebmasterWorld ® and PubCon ® are a Registered Trademarks of Pubcon Inc.
© Pubcon Inc. 1996-2012 all rights reserved