Forum Moderators: phranque
Thanks,
Soli
Tricky question this one - I don't quite see how you could use a "spider" to help you achieve this. From your post, as I understand it, you need to know how many times a particular file type has been requested from a set of servers. A spider won't do that - the only thing that can tell you is the server itself, usually via it's logfile.
I can see the issue with having multiple logfiles across multiple servers, but is there any reason you couldn't grab a copy of all of them, consolidate into one file and then run a logfile analyser over that?
What a spider could tell you, is the relative proportions of different filetypes residing on the set of servers (you could crawl everything and count the filetypes as you go). But that doesn't seem to me to be the raw stat you require which appears to be requests and not content?
TJ