Forum Moderators: bakedjake

Message Too Old, No Replies

AOL referrer with encoded keyword

         

pixel_juice

4:54 pm on Jul 6, 2003 (gmt 0)

10+ Year Member



Just noticed a referrer from AOL search which caught my eye because the keyword used was encoded, and came out as something like encquery=88AC203B263C05. I tried removing characters from the encoded query, but it takes me to a 404 not found page if I change any characters.

Does anyone know what sort of encoding this is, or have any ideas why the keyword was encoded? I'm unable to get a search on aol to show up with an encoded search term. I'm guessing it might be something obvious that I have just failed to understand ;)

Also a google search for enquery returns only 7 pages - mostly stats pages, and one page about "Runtime / Global / Dynamic Parameters" that is a bit too technical for me this afternoon :)

mcavic

5:34 pm on Jul 6, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Hmm. That's a really obnoxious thing for AOL to do. I have a few of those in my log from this month, but none last month.

Anyway, try pasting the whole url into your browser, and it should tell you in the title bar what the search terms were.

pixel_juice

7:26 pm on Jul 6, 2003 (gmt 0)

10+ Year Member



Thanks mcavic.

I was interested in why they would bother to encode the search term at all. Also, if this is something aol are planning on using more often, it will make it pretty tricky to find out search terms from large numbers of referrers.

What's the benefit to AOL (or the searcher) in encoding the query?

mcavic

9:38 pm on Jul 6, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



it will make it pretty tricky to find out search terms from large numbers of referrers.

That's the only reason I can think of - privacy. Still, it doesn't make sense to me. The way the AOL cache/proxy works, it also makes it near impossible to track a user's click path through your site.

jrobbio

10:30 pm on Jul 6, 2003 (gmt 0)

10+ Year Member



9 times out of 10 if I get a strange or disturbing search referring to my site, its from AOL Search or an AOL user on a different engine. This could be a privacy issue as mcavic said.

On another note, it could be a tracking technique though I do not understand it all very well.

Rob

jeremy goodrich

5:13 pm on Jul 7, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



AOL started doing this with some referrers to my sites, as well -> very bothersome, when I browse my home grown keyword stats package, to see some random string instead of the keyword data. :(

Hopefully they'll 'see the light' and continue to give better referrers...

wart888

11:31 am on Jul 10, 2003 (gmt 0)

10+ Year Member



so the bottom line is we need to figure out how to decrypt it (i have seen it with other search engines.)

This involves two stages:

1. figure out the encryption algorhythm (of about 5 widely used)

2. Obtaining a key (the difficult part)

Does anybody feel up to it? The race is on, because i'm suspicious this is becoming a trend.

The way the AOL cache/proxy works, it also makes it near impossible to track a user's click path through your site.

this is possible, and it is available with at least one site I know, a visitor tracking service who can do this. Send me a sticky mail for info :)

Last thought: its probably not a privacy issue (AOL is not famous for protecting user privacy) Its probably just protecting their info - which has market value.

olias

1:37 pm on Jul 10, 2003 (gmt 0)

10+ Year Member



Its probably just protecting their info - which has market value.

That would be my guess too, but it seems like a fairly pointless excercise. If I want the referrer information I'll just end up writing a script to query all the encoded refferers and find out what they were.

pixel_juice

6:33 pm on Jul 10, 2003 (gmt 0)

10+ Year Member



>>it also makes it near impossible to track a user's click path

You can use a cookie to uniquely identify the visitor which works pretty well and is much more reliable anyway in the long run IMO.

>>a script to query all the encoded refferers and find out what they were

That sounds like a good plan - i've seen a lot more of these referrers. Hope you'll let us know if you come up with anything (likewise wart888) :)

mcavic

9:47 pm on Jul 10, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



You can use a cookie to uniquely identify the visitor which works pretty well and is much more reliable anyway in the long run IMO.

Good point. I don't really like sites that set a cookie on every hit, so I only use them for shopping carts and such.

Does anybody feel up to it? The race is on, because i'm suspicious this is becoming a trend.

I'll give it a shot, but it'll be my first attempt at breaking a code. I did check to see if it's just the ascii characters encoded into hex. That's not it.

mcavic

10:24 pm on Jul 10, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Ok, I played around a little more... it's not a simple code.

Adding or removing characters from the encoded string causes a 404 error. But, changing the value of a single encoded character causes a whole word in the unencoded string to be corrupted.

The beginning of the encoded string seems to correspond to the beginning of the unencoded string, and so on with the middle and end of the string. But, characters in the unencoded string must be hashed together in some way.

bird

8:58 am on Jul 11, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



I have only one example to play around with so far, does anyone know when in which situation they do this transformation? It can't be from a normal search, at least not without a redirect in between.

The encoding works in blocks of 8 characters of the original string at a time, turning that into 16 hex characters. This may be necessary so that they can cover the complete range of 16 bit Unicode characters.

wart888

9:19 am on Jul 11, 2003 (gmt 0)

10+ Year Member


http://aolsearch.aol.com/aol/ search?encquery= 56ED29EE927026BFCA1B834C32E4CB68&invocationType =keyword

http://aolsearch.aol.com/aol/ search?encquery=58310702BC93BBC6BC05B619A1E15263&invocationType =keyword

http://aolsearch.aol.com/aol/ search?encquery=3664A2C66E18E95E7AD5917A8D87D127&invocationType =keyword

they only use encoding for searches from people that use AOL as ISP

mcavic

4:30 pm on Jul 11, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



they only use encoding for searches from people that use AOL as ISP

I don't think so. Most of the aolsearch hits in my log are from AOL users, and most of them aren't encoded.

The encoding works in blocks of 8 characters of the original string at a time, turning that into 16 hex characters.

Yes, that's true.

netcommr

8:49 pm on Jul 11, 2003 (gmt 0)

10+ Year Member




I just think they are deperate...

How many more ads have they shown from webmasters checking keywords?

pixel_juice

1:23 pm on Jul 12, 2003 (gmt 0)

10+ Year Member




>>they only use encoding for searches from people that use AOL as ISP

I don't think so. Most of the aolsearch hits in my log are from AOL users, and most of them aren't encoded.

Although they may not be using this for ALL aol user-searches, every example i've come across of an encoded keyword has been someone on AOL.

>>How many more ads have they shown from webmasters checking keywords?

Haha, I never thought of that, but I find it unlikely that this would be the main reason for encoding.

netcommr

1:45 pm on Jul 12, 2003 (gmt 0)

10+ Year Member




_juice> just a joke... ;) Warner Bros. is putting a ton of pressure trying to get out from under AOL. Especially before the next 'Harry Potter' & 'Lord of the Rings' movies are released(Dec. I think). This would put them (AOL) finacially in trouble. Movie success has been financing the web blunders for couple years now. Been suckin them(WB) dry and their po'd.

could be some are encoded and others not is due to the version of AOL being used. Maybe the newest version has this feature and others not?

are all the encoded terms 32 chars in length? this may be a tip you won't be able to decode, probably tied to an internal database of words/terms

olias

2:13 pm on Jul 12, 2003 (gmt 0)

10+ Year Member



could be some are encoded and others not is due to the version of AOL being used

Judging by user agent strings I am seeing AOL 6,7 and 8 users.

are all the encoded terms 32 chars in length?

No, I am seeing lots that are 48 chars in length as well, I have even seen one that was 80 long!

netcommr

2:19 pm on Jul 12, 2003 (gmt 0)

10+ Year Member



thanks olias, I haven't seen a log in ages.

pixel_juice

2:54 pm on Jul 12, 2003 (gmt 0)

10+ Year Member



Since my first post where Google returned 7 pages searching for encquery, there are now about 568 - so this is certainly on the increase.

claus

3:06 pm on Jul 12, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



You don't really have to break the code except for the fun of it. Meaning: you have the querystrings (refrr), and you can read the query off the AOL Search page.

i feel somewhat bad about posting this, as i don't offer the full solution, but here goes anyway:

a) parse AOL-strings from log to data.txt
b) send strings off and recieve webpage
c) parse this string: <title>AOL Search: Results for "QUERY"</title>
d) dump QUERY to queries.txt

- you could do it with perl, php, asp, whatever... Anyway, be sure to put the script in the public domain and return an appropriate UA string to AOL, like "we want to see our users queries" ;) Personally, i don't have any visits from AOL.

/claus

mcavic

3:17 pm on Jul 12, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



probably tied to an internal database of words/terms

Thought of that, but no, because if you change one of the encoded characters, you get gibberish for the search string - the gibberish wouldn't be in the database.

No, I am seeing lots that are 48 chars in length as well, I have even seen one that was 80 long!

All multiples of 16, though.

you have the querystrings (refrr), and you can read the query off the AOL Search page.

True true, but it would take time and bandwidth to run. Might be the only way, though.

pixel_juice

3:21 pm on Jul 12, 2003 (gmt 0)

10+ Year Member



>>All multiples of 16, though.

I saw a 16 character one too.

>>but it would take time and bandwidth to run

Exactly, and depending on what you are using to get stats might be pretty tricky to implement. Why do AOL always have to be different?

bcc1234

3:53 am on Jul 25, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



I think they encode the query string to prevent adware from stealing their PPC clicks.

I saw a few more sites doing that.

bird

7:53 am on Aug 1, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Anybody have any thoughts on what algorithm would be used in this case?

Take the utf16 representation of the string, and juggle the bits any way you please... ;)