Forum Moderators: Robert Charlton & goodroi
Checking search engine indexing in Google most pages other than the home page have been indexed with session id’s and moved to Supplemental results
I’ve pointed this out and suggested they stop the use of session id’s. However the site is developed by a large template based web developer and it doesn’t look like they’ll be able to change the way things work for a long time.
Short of moving the site, which is an option I’ve told them to consider, what else could they do? I’ve suggested a couple of things to think about –
Duplicate page content on static pages that do not use session id’s. If they are able to do this should we use a robot.txt file (or robot meta tags if the structure of the file can’t be changed easily) to keep the engines from trying to index the other page copies that use session ids?
Detect engine bots and serve a page version with no session id? If they can do this is this something that might make the search engines think they are spamming?
Wait for the situation to change. Are the search engines like Google working on fixing the issues with session ids?
Site owners do have issues with session IDs. It's good practice to detect spiders, serving them a clean URL without long and ugly ID variables. However, all page content should be identical, regardless whether the script is called with a session ID or not.
Geeks have no issues with session IDs. They love session IDs. Unfortunately, they aren't involved in marketing tasks. Otherwise they would apply a more elegant Web development.
This is NOT a session ID and Google index's them fine after the first page is delayed:
domain.com/page.asp?id=123
Its a natural way of providing SE with how your site can work rather than mod re-write. Mod re-write can be a pain for advanced sites and bug tracking and impossible for some ASP servers.
These however can be session id's and are bad
domain.com/page.asp?id=h4to489fji4t9io49fk3et3
a more typical session id looks like this:
domain.com/page.asp?sid=h4to489fji4t9io49fk3et3
domain.com/page.asp?SESSIONID=h4to489fji4t9io49fk3et3
never pars you session ID's via a URL! In ASP its easy... dunno about PHP.
Google guidlines suggest this too could be bad:
domain.com/page.asp?page=2&id=123
I agree it should be written like this:
domain.com/page.asp?id=123&page=2
Hope it helps.
Allow search bots to crawl your sites without session IDs or arguments that track their path through the site. These techniques are useful for tracking individual user behavior, but the access pattern of bots is entirely different. Using these techniques may result in incomplete indexing of your site, as bots may not be able to eliminate URLs that look different but actually point to the same page.
And GoogleGuy has also warned - going back years - that session IDs in URL strings are bad news.....