Forum Moderators: DixonJones
So I've got a script that pulls out all the AOL hits -- and I'm wondering if there's a rule of thumb for translating that list into a rough estimate for the number of people it represents.
In other words, is it, on average, 8 AOL IP addresses per visitor? Or is it more like 6? And would it be rare or common for AOL to dynamically reassign the same IP to a different visitor within 30 minutes?
You got me thinking so I set up a page on a test site and hit it a few times throughout the day through AOL, both DSL and dialup.
DSL 64.12.96.169 10:56:51
DSL 64.12.96.169 11:31:23
DSL 205.188.209.40 12:16:40
DSL 152.163.252.34 12:29:08
DU 205.188.209.40 12:48:55
DU 205.188.209.40 14:06:49
DU 152.163.252.34 14:26:50
DU 152.163.252.34 15:23:40
DU 205.188.209.40 15:39:42
DSL 205.188.209.40 18:25:09
DSL 205.188.209.40 23:22:48
This doesn't help too much and can't really see any way to extrapolate the information even if there is a larger sample size. Wish there were a good rule of thumb as a couple of our sites get very heavy traffic from AOL users.
Jim
So if your site averages 5 page impressions per unique visitor and you get 1000 hits from AOL IP's, then 1000 page requests / 5 = an estimated 200 unique visitors.
Or use cookies to get a more accurate count.
This is a new client and I'm trying to get a good picture of their traffic for many reasons. One is that they are looking for the most fruitful locations to hold seminars for their interested public and they want to pick cities where they have a good user base.
We're doing some geo-location from IPs and it's fascinating to see how non-homogenous their accesses are. I've advised them to ytack with persistent cookies and session cookies and that's going to happen, but it doesn't help me now with their legacy data.
I like your averaging idea, Key_Master. For this purpose it's probably the easiest approach I've considered so far and it will give me something intelligent to say right away.
aspdaddy, I also like your idea. I'm going to generate a subset of the logs that just has the AOL hits and then see what an analysis on that subset looks like. Looking at the data from both of these approaches should give similar results. If there's too much divergence, then I won't trust either one -- then we'll just have to wait until we have some cookie data.
It's the "as many as" part that is particularly frustrating. If the same IP always asked for the pages, then that would help. But I don't think that's true. And more than one AOL user within the same 30 minute period makes it quite nutty.
Geo-location only reports AOL or AOLUK. No real help there.
Even if you understand the characteristics of AOL IPs well, they might change the format, and anonomizers and some networks give similar problems.
Depending on the type of traffic & fields available, counting unique UA's, certain octects of the IP, external referers or even homepage hits can give a rough idea of session numbers.
If you want to reconstruct sessions though, thats much harder.
Apart from that ...cookie tracking. It's always better than logs imho, but that's been mentioned also.
This AOL thing... just wondering... is this weird way of handling privacy their new image? ip-switching and encrypted seachstrings and all...
/claus
- X-Forwarded-For
- Client-IP
- Via
Based on some research, I doubt AOL sends these headers the same way other Proxies do. Most people referred to their proxies as exibiting non-standard behaviour.
Is there anyone with an AOL connection that can test this?
In perl and probably PHP, these variables will be accessible as environment variables called:
- HTTP_X_FORWARDED_FOR
- HTTP_CLIENT_IP
- HTTP_VIA
One other thing I thought of, are you factoring in AOL's caching behaviour? If they have all your content cacheable your logs could show lower numbers of AOL requests vs the true number of requests.
The use of multiple IP addresses assigns several uniques to the same user, and caching hides some users. Plus the same IP may be assigned to more than one user.
I'm just hoping I can find a "rule of thumb" that puts me somewhere in the ballpark, and in this case, I'd rather be under the true number than over so I may not try to account for caching.
I understand that all of this is done to give the AOL user a faster surfing experience, and it was very helpful back in the day of slower modems. But, sheesh!
Speaking of AOL IP addresses....
I'm attempting to do state by state advertising and I've noticed that when I do this, the AOL IP addresses look very similar when I get a snapshot of who is on my site. For example, say I just did a ad to people in CA, then I went to look at my logs and the AOL addresses were all(something like), 123.456.789.xxx, where only the 'x's varied. Then I went to another state and ran the ad, say, New York and when I checked my log for that time I found a different set of similar IP addresses, as in, 212.343.545.xxx, where only the 'x's varied.
So my question is: Can one tell from the AOL IP's what state someone is from? Or if not a state, perhaps a region?
Any help would be appreciated.
GuinnessGuy
I'm currently using a geolocation service that does pretty well on most non-proxy IPs -- better than 90% are narrowed down to a city or region. But for AOL they can only return country. If this service could do it, I'm pretty sure they would. They update their databases several times a day.
Thanks Ted. It's as I thought. It would seem that AOL hands out very similar IP addresses at any given time so my pipedream of having those similar IP addresses having something to do with geo-location were wrong.
If I were AOL, I'd offer a geo-location service for a fee. Seems that they could make some decent money doing this.
GuinnessGuy
I think a lot of AOL's multiple IP addresses has to do with their caching strategy. AOL's proxy servers cache a ton of static pages so you can never really tell how many page views your site truly has because sometimes they are being served from a cached version on an AOL proxy server.
This point is helpful to remember if you see a ton of first visit pages that are a couple of pages deep into your site and there are not a lot of direct links into that page. While there are probably some bookmarks, it is more likely that your home page has been cached, and the first server request is a page or two in.
Finally keep in mind that AOL has hundreds of proxy servers and all of them will show up significantly in your logs so always be looking for new IP ranges if you plan on monitoring IP blocks.