Forum Moderators: open
As andreas mentioned, you can get all of that info from your server logs and analyzing them has no effect to any bots.
For over 12 months now I log every page accessed on any of my projects. They are all served out of my own small CMS which can react on the user agent information (e.g. replace highly dynamic parts of pages with static info for Googlebot etc.) and parses page accesses on the fly.
E.g. the server sends me a mail when Googlebot starts spidering a domain or a new page is first hit by a referer url containing "%google%" and extracts the search query string from SE referer URLs and logs them to a special table etc.
Anyway, I would be quite interested in whatever efforts you make in this direction. I still try to make my tools more flexible and intelligent.
Rincey
If you serve different content based on human/bot visits for the explicit reason of increaing rankings you are playing with something engines hate.
I am not saying don't do it, I am just saying make sure you do it right because an overly apparent mistake could cost you. The learning curve becomes very steep.
Try this search [google.com] and click the 4th spot (this forum). See the search term being highlighted?
It can be done ;)
<added>Gee, if I only read the whole thread - better keep out of this forum *blush*</added>
Could this be considered dangerous? I don't have any other way to get referrer info on internal pages.
Thanks.
1) Referring URL
2) Browser Type
3) First page hit
4) IP Address
I can run queries on the data however I like now. Believe me, it's valuable information to have in a database format.
Again, I don't have access to the server logs. I am using an embedded image tag <img src=../cgi/save?referrer=> (where referrer is captured from the JavaScript Document object) to forward this information from each internal page on my site. Could this appear to be a cloaking technique to a penalty filter or does anybody see any other penalty dangers with this technique?
And surprise surprise surprise, this guy, who I have seen him posting in another forum, who like to bash other people web sites and claimed himself to be the BEST have actually CloaKing ...
Why I say this? This guy really have a lot inbound links to his site, all from related area. But if you go diretly to this site, it looks like the main page. But if you get in through other sites, it parse the referral URL and reflect the page it should display, which is a status of that link site.
Since googlebot never send referral URL she will always sees the main page.
I am sure this is call Cloaking, just I am not sure if it is LEGAL cloaking or ILLEGAL cloaking. I didn't report this case. I would like to gather more information and comments before I do so.... would like to listen to you all...