Hi Guys,
Thanks for your responses.
I am familiar (and comfortable) with toolbars, browsers etc fetching favicon - but they usually have a user-agent
e.g. "GET /favicon.ico HTTP/1.1" 200 2238 "-" "Mozilla/5.0 (compatible; Google Desktop/5.9.1005.12335; http:// desktop.google .com/)"
The requests that are bugging me are mostly coming from amazonaws IP ranges (as mentioned in my OP). Those are definitely not "normal" requests, from humans, browsers, toolbars, SEs etc.
e.g. ec2-67-202-61-111.compute-1.amazonaws.com - - [09/Aug/2010:08:34:41] "GET /favicon.ico HTTP/1.1" 403 - "-" "-"
The frequency of the many requests, by what are very obviously bots, is what got me wondering "why are they so interested in favicon".
blend27 wrote:
Testing if their IP Range blocked on .HTACCESS level?
That is a possibility that I had thought of. So I allowed access to favicon for "all" to see what they would do then, but nothing changed. The requests for only favicon continued. So I changed it back to deny when both referer and user agent are missing.
dstiles wrote:
There is (was?) at least one sort-of-SE that collects icons to display on a links page - never went to any pages, just assumed favicon was in the root (which is why I moved mine out of the root).
I remember that - iconsurf. com but I haven't seen it in any of my sites in a long time. Not sure why you have a problem with it, when I first spotted it I just denied it in robots.txt and it obeyed.
keyplyr wrote:
I allow blank referrer for favicon, robots.txt, and several other info type flat files.
I allow blank referer for robots.txt, and .shtml files (my error pages). I used to also allow .ico, but removed it when the current rush from amazonaws started.
So I am still flummoxed as to why they want it. You'd think that when denied access to favicon, they'd try to get a page or another file - but they don't. They just come back a bit later and ask for favicon again.
I guess I'll never know why - but thought it was worthwhile asking here as there are members who seem to know a lot about this sort of thing.