Welcome to WebmasterWorld Guest from 54.163.40.152

Forum Moderators: coopster & jatar k

Message Too Old, No Replies

indexing visitor session urls

Trying to get dynamic urls indexed & ranked by google

     

abbey

7:29 pm on Jan 14, 2003 (gmt 0)

5+ Year Member



Whats the deal with trying to get urls like these indexed and ranked well by google?

I understand there are 2 issues.
1.The characters in the urls.
2.Visitor sessions.

Will google bot have a tough time with these urls?
Is there a solution, simple or complex, to issues 1 and 2?

URL1.
[domain-name.com...]

URL2.
[domain-name.com...]

[edited by: jatar_k at 7:52 pm (utc) on Jan. 14, 2003]
[edit reason] shortened url a little [/edit]

jatar_k

7:47 pm on Jan 14, 2003 (gmt 0)

WebmasterWorld Administrator jatar_k is a WebmasterWorld Top Contributor of All Time 10+ Year Member



Welcome to WebmasterWorld abbey,

It looks to me like your two questions cover the same issue, the characters in the url which are the session id.

Have you looked at using a transparent session id?
[php.net...]
look for session.use_trans_sid

this one shows you how to use ini_set to change it, the user comments below are good as well.
[php.net...]

bot have a tough time with these urls?

Yes, anytime you add?var=value you are starting to flirt with possible problems, regardless of what the engines say. For every var you add your possibilities for problems seem to multiply exponentionally.

dingman

8:37 pm on Jan 14, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



My experience suggests that you're pretty safe with query-strings as long as they don't change. I have URLs that look like 'http://my-site.tld/~me/?sect=foo&subsect=bar' that get spidered without any problem. However, there are deffinitely a few things to be aware of.

(1) to a spider, '?sect=foo&subsect=bar' is not the same as '?subsect=bar&sect=foo', even though they are of course the same to your script. Pick one. [Adam - this problem doesn't quite grow exponentially, but it does grow as the factorial of the number of variables, which is close enough for horseshoes ;)]

(2) If each visitor gets a session ID, and URLs on your site are re-written to include it, you might easily fool a spider into thinking that you had removed all links to a page it saw last month, just because the session ID is different.

(3) I've probably overlooked at least one possible complication.

I'd pretty much say that session IDs in the URL are a bad plan if you want to let spiders in. I've got a site that uses them, but it's a place where spiders are most unwelcome for other reasons. (If you can't log in, go away.)

jatar_k

8:51 pm on Jan 14, 2003 (gmt 0)

WebmasterWorld Administrator jatar_k is a WebmasterWorld Top Contributor of All Time 10+ Year Member



yes, exponentially is pushing it a bit but, at some point, the spiders won't index the var when you get up to 4, 5, 6 or more. Don't know the exact number but at some point the query string reaches critical mass, I've seen it too many times.

The word exponential was carefully chosen to imply that query strings should be avoided if humanly possible but thanks for calling me on it dingman. ;)

dingman

9:32 pm on Jan 14, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Any time. ;) Obviously, in this case there is no qualitative difference between what you said and the strict mathematical truth.
 

Featured Threads

Hot Threads This Week

Hot Threads This Month