How to determine the source of strange activity

Here we're using WebTrends, AWStats and Google Analytics. Our log-based analytic programs are reporting an unusually high number of requests (10,000+/month) for a story we published back in 2001, while Google is reporting the expected amount (less than 50/month).

The top referrers reports are not revealing anything useful, which leads me to believe some sort of spider or robot is the cause of the activity that is not registered with WebTrends and AWStats.

Any suggestions for getting to the bottom of this?

Here is just a sample of what is in our logs:

208.179.xx.xx - - [05/Jul/2006:00:01:19 -0700] "GET /news/maindish/2001/08/30/right/ HTTP/1.1" 200 27091 "-" "Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; SV1; .NET CLR 1.1.4322)"
146.21.xx.xx - - [05/Jul/2006:00:02:30 -0700] "GET /news/maindish/2001/08/30/right/ HTTP/1.1" 200 27091 "-" "Mozilla/4.0 (compatible;)"
80.178.xx.xx - - [05/Jul/2006:00:05:50 -0700] "GET /news/maindish/2001/08/30/right/ HTTP/1.1" 200 27091 "-" "Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; SV1; .NET CLR 1.1.4322)"
203.162.xx.xx - - [05/Jul/2006:00:06:09 -0700] "GET /news/maindish/2001/08/30/right/ HTTP/1.1" 200 27091 "-" "Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; SV1; .NET CLR 1.1.4322)"
124.106.xx.xx - - [05/Jul/2006:00:12:30 -0700] "GET /news/maindish/2001/08/30/right/ HTTP/1.1" 200 27091 "-" "Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; SV1; .NET CLR 1.1.4322)"
218.22.xx.xx - - [05/Jul/2006:00:12:48 -0700] "GET /news/maindish/2001/08/30/right/ HTTP/1.1" 200 27091 "-" "Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; SV1; .NET CLR 1.1.4322)"

[edited by: jatar_k at 8:58 am (utc) on July 13, 2006]
[edit reason]
[1][edit reason] no specifics thanks [/edit] [/edit][/1]

How to determine the source of strange activity

cschults

oxbaker

TXGodzilla

Join The Conversation

Moderators and Top Contributors

Hot Threads This Week