Forum Moderators: open

Message Too Old, No Replies

Dynamic urls and session ids

why use them?

         

bluecorr

12:55 pm on Oct 12, 2003 (gmt 0)

10+ Year Member



Hi!

I've never optimised a dynamic site before but now I've been asked to analyse one that is dynamically generated. The pages are all jhtml and every link is associated with a session id. The number of scripts, flash and other yukky stuff is amazing. Now, because I am not a programmer I'd like to know why session ids are used for. I know I have to recommend them to drop them because Google can't index the site but I'd like to know why anyone would be doing it in the first place, what is it used for and if it's essential or not. If it is essential, then can a site be optimised and still keep session ids?

Thanks

GoogleGuy

11:13 pm on Oct 12, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Hey bluecorr! Often as a user surfs or shops on a site, the site owner wants to track what pages they've seen, or stuff like what products are in their shopping cart. A session-id lets a site owner give each user a unique identifier. That identifier can reference customer data like the shopping cart contents stored in a database, for example.

Some people would mention that you could use a cookie to do the same thing and keep the urls much cleaner. That's true, but not every user has cookies enabled. Using session-id's is one way to try to guarantee that you know that state of a user, even if they don't allow cookies, for example.

So what's the problem with a session id, and why doesn't Googlebot crawl them? Well, we don't just have one machine for crawling. Instead, there are lots of bot machines fetching pages in parallel. For a really large site, it's easily possible to have many different machines at Google fetch a page from that site. The problem is that the web server would serve up a different session-id to each machine! That means that you'd get the exact same page multiple times--only the url would be different. It's things like that which keep some search engines from crawling dynamic pages, and especially pages with session-ids.

Google can do some smart stuff looking for duplicates, and sometimes inferring about the url parameters, but in general it's best to play it safe and avoid session-ids whenever you can. Hope it makes sense!

best wishes,
GoogleGuy

plasma

12:36 am on Oct 13, 2003 (gmt 0)

10+ Year Member



What GoogleGuy forgot to say:

Just turn off the SIDs for UserAgent googlebot (and other identifiable robots) and leve them on for ordinary visitors.

bluecorr

5:42 pm on Oct 13, 2003 (gmt 0)

10+ Year Member



wow GG, great stuff! Mind if I refer the client to this post? ;)

Cheers.

steve_3231

11:47 am on Oct 14, 2003 (gmt 0)

10+ Year Member



We have a site that creates the links off of the domain name typed in by the user. If the user does not type in the main domain name, but another name that is parked to the main one, then the links are all created as [parkeddomain.com...] It sounds like we need to make all links static?

xszer

2:55 pm on Oct 14, 2003 (gmt 0)

10+ Year Member



steve_3231 said >We have a site that creates the links off of the domain name typed in by the user.

Does Google view all your different parked domains as separate websites?