Forum Moderators: open
GoogleBot may crawl the site one day and index the page with one session ID, and then a few days later get a different session ID for the same page.
The session ID doesn't change any of the content on the page.
Does Google see these as two separate pages? Or does it understand they are the same page treat it as such when calculating backlinks and PR?
For example, if Google follows 50 external links pointing to a paricular page and gets the same URL with a different session ID appended each time, is it smart enough to figure out that the 50 links actually point to the same page?
And then does it apply those backlinks when judging the value of the page?
I forgot: the pages have inbound links but obviously not shown ...
So every inbound link to a page would have a different tracking variable appended, but they all end up at the same page with the same title, content, etc.
Can Google discern that they all point to the same page even though the URLs are slightly different? Can it apply these as backlinks and do the links transfer PR?
From my own experience, a couple years ago (when we had session ids in the url) I would sit and watch live log files and see Google (and other search engines) indexing the same pages over and over... Though I can't be 100% positive - it seemed like Google saw that I had A LOT of duplicate content on my site.
Regarding passing PR - well, Google doesn't pass PR the same to duplicate pages as it does the original page it found. See the problem? How do you know what page it gave the PR too? Most likely a URL with a session id in it....
Example: Lets say your main page has a PR5 - in a happy world you may like to see "www.domain.com/pageA.html" with a PR4 (or PR5) - but with session ids in place - Google may never see "www.domain.com/pageA.html" - it sees:
www.domain.com/pageA.html?sessionid=GKD998423KFDJ...
www.domain.com/pageA.html?sessionid=9234GKD923OP2...
...etc.
Which page got the PR? Who knows.
What I ended up doing is writing some PHP to detect bots and turn sessions off it was a bot - so there was no chance of a session id being appended to the URL. By doing so I noticed PR being shared throughout our pages A LOT better than before.
I hope that helps in some way. You never know with Google though, someone could have a completely different experience than mine. Google may handle session ids differently now. But, I still think it would be wise to make sure those Session ids from your URLs - There are other bots out there to consider that may not take so kindly to session ids.
What I ended up doing is writing some PHP to detect bots and turn sessions off it was a bot - so there was no chance of a session id being appended to the URL. By doing so I noticed PR being shared throughout our pages A LOT better than before.
AprilS, did this also result in better rankings or more referrals from Google?
With so many uncertainties about toolbar PR and how many backlinks Google displays vs. how many actually count, it seems rankings and referrals are the truest measure.
Thanks
One thing that is truly unknown is how Google responds to seeing the same page over and over on a site - it may think you have thousands of copies of the same page.
Session ids are very valuable - but either:
1) Don't start sessions until you HAVE to (ie. customer adds something to the shopping cart)
OR
2) Detect search engine bots and don't start sessions at all if it is a bot