Welcome to WebmasterWorld Guest from 188.8.131.52
Checking search engine indexing in Google most pages other than the home page have been indexed with session idís and moved to Supplemental results
Iíve pointed this out and suggested they stop the use of session idís. However the site is developed by a large template based web developer and it doesnít look like theyíll be able to change the way things work for a long time.
Short of moving the site, which is an option Iíve told them to consider, what else could they do? Iíve suggested a couple of things to think about Ė
Duplicate page content on static pages that do not use session idís. If they are able to do this should we use a robot.txt file (or robot meta tags if the structure of the file canít be changed easily) to keep the engines from trying to index the other page copies that use session ids?
Detect engine bots and serve a page version with no session id? If they can do this is this something that might make the search engines think they are spamming?
Wait for the situation to change. Are the search engines like Google working on fixing the issues with session ids?
Site owners do have issues with session IDs. It's good practice to detect spiders, serving them a clean URL without long and ugly ID variables. However, all page content should be identical, regardless whether the script is called with a session ID or not.
Geeks have no issues with session IDs. They love session IDs. Unfortunately, they aren't involved in marketing tasks. Otherwise they would apply a more elegant Web development.
This is NOT a session ID and Google index's them fine after the first page is delayed:
Its a natural way of providing SE with how your site can work rather than mod re-write. Mod re-write can be a pain for advanced sites and bug tracking and impossible for some ASP servers.
These however can be session id's and are bad
a more typical session id looks like this:
never pars you session ID's via a URL! In ASP its easy... dunno about PHP.
Google guidlines suggest this too could be bad:
I agree it should be written like this:
Hope it helps.
Allow search bots to crawl your sites without session IDs or arguments that track their path through the site. These techniques are useful for tracking individual user behavior, but the access pattern of bots is entirely different. Using these techniques may result in incomplete indexing of your site, as bots may not be able to eliminate URLs that look different but actually point to the same page.
And GoogleGuy has also warned - going back years - that session IDs in URL strings are bad news.....
Can you tell us more about these work arounds?
2. Make your site work, even if the browser does not support cookies. Maybe assigning a kind of default session ID for the browsers that do not support cookies (like most SE bots)