Forum Moderators: open
We are using a content management system that serves up dynamic pages and our site also requires cookies. Our logfiles show that googlebot is visiting our site but we don't seem to ever appear in their search engine.
Thanks for your response. It is currently impossible to view our site without cookies turned on. However, over a year ago, our developers added code to check the HTTP_USER_AGENT and allow anything with "Googlebot" to enter the site.
I'm wondering if maybe this code isn't working correctly.
I think the code is working correctly, and that's your problem - serving different content to googlebot than to a visitor is cloaking, at least from the technical standpoint, and that means you will be dropped for spamming the index.
I did the same thing about a year ago, with no-spammy intentions, I just did not want googlebot to become entreched in my session-ids. But, sadly, googlebot cannot detect good intentions, so I got dropped from the index ;-(.
Having something like if(strstr("Googlebot", $HTTP_USER_AGENT)) anywhere in your website is a grave error, imho.
Try to make your site browsable without a cookie.
What do you need the cookie for? If its sessions, only assign one after the user requested a page/funtion that uses them (In my case, it was a company page with an online shop, the first version assigned a session on ever pageview, now I only assign it when somebody actually puts a product in a cart, so spiders can crawl the page without problems)
I think the code is working correctly, and that's your problem - serving different content to googlebot than to a visitor is cloaking, at least from the technical standpoint, and that means you will be dropped for spamming the index.
That is not true. Detecting Googlebot for the purposes of serving your content without a session ID or a cookie is not cloaking and it will not get you banned. If you were doing that and you dropped from the index it was a coincidence.
Cloaking in the eyes of Google is serving different content to spiders. The absence of a session id or a cookie does not create different content.
You can find GoogleGuy's comments on the subject here [webmasterworld.com]
Just my 2 cents.