Forum Moderators: DixonJones
My tracking script automatically filters out a dozen or so known bots, but as you know lots of bots pose as IE by faking the user agent. To get around this I manually go through the click database once a week and remove clicks that look like bot clicks (e.g., something that clicked every single banner in every single position in the same minute, and which didn't click any of the few banners we have which are generated by Javascript).
This works okay but its tedious, and there are many times I'm not sure whether clicks are bots or not. So I was trying to think of another way to do it. I figured that I could set a cookie and then try to read it, and if I failed to read it then assume the agent is a bot and don't record the click. I wouldn't get to count real people who have turned off cookies, but I imagine they're less than 5% of users, probably less than 1%.
So my question is: Does this seem like a good idea? Seems like it to me, but I'm new at this and could be missing something. Is there a better way to exclude bot clicks when tracking ad campaigns?
Incidentally, the solution does have to be home-grown -- I need the data stored in my own database so I can easily customize the reports that advertisers see and do other things with that data with scripts. So I can't use any off-the-shelf software.
Thanks for your help.
I'd have no idea how to implement that.
Have 2 scripts:
1) Ad script will record your normal things (IP, time etc) and, if its first request from that IP/useragent in the last X hours then mark it as BOT, if its second or more requests and it is still marked as BOT then don't forward click
2) Another script will be included as 1x1 pixel image written into code of the page using JavaScript write function -- this script would execute after actual click occurred and it should unmark previous request that was marked as BOT.
<IMG src="php or CGI script" height= "1" width="1"> Now the question remaining - what to do with the script. If the script is called for that pageview the clicks are valid. Here is where it gets beyond me but if you go into the CGI or PHP forum (whatever your server supports) I'm pretty sure they could help you with it. Or you could match referers for the page with referers for the script, or match referer for the script with referer for the link, if it has both it's a click.
I just thought too - if your already tracking clicks on the banner then you just need to track requests for the banner image as well. If an IP requests the banner image then it's flagged as valid.
Here again, I have no idea how to test for this.
I think I'm just going to use my original cookie-writing idea.