Forum Moderators: coopster

Message Too Old, No Replies

languages and spiders seo

headers language spider seo

         

elkiwi

10:24 am on Apr 16, 2005 (gmt 0)

10+ Year Member



Hi I'm developing a dual language site where a users language is set by a session on the index page of my site.

All pages are dynamically parsed from a language array and database depending on the users language choice.

eg:

$HTTP_SESSION_VARS['lang'] = 'en';

If this session is not set they are redirected to the index page where the session is set.

My question is what effect will this have on spiders visiting my site? When I try to validate a page at w3.org for example I can't because the validator just gets sent to my index page becuase there is no language set.

Does anyone know of a solution to this problem or have I just done this language thing badly?

Thanks

jatar_k

4:18 pm on Apr 16, 2005 (gmt 0)

WebmasterWorld Administrator 10+ Year Member



well, you need you site navigable with out the necessity for the session value. The spiders won't hold the session and you could default them to a language but then they would only be able to spider one set of pages.

You need to allow them access to the whole site in every language. links on every page for the same page in the other languages and once they have selected a language they are essentially confined to that part of the tree.

It's funny coopster and I were just chatting about this yesterday, we both do it very differently but they both work very well.

elkiwi

1:39 am on Apr 17, 2005 (gmt 0)

10+ Year Member



Hi, what are your different ways of doing it?

I want to add more languages later on and the way I have done it I only have one set of pages to edit, which is nice for not having to update 20 pages in 4 languages. If you have some suggestions I would love to hear them.

Thanks

Peter

jatar_k

5:00 pm on Apr 17, 2005 (gmt 0)

WebmasterWorld Administrator 10+ Year Member



I think coopster uses apache content management, he could explain that better, there might even be a post around about it somewhere.

I actually replicate the directory structure and use common templates, so menus and everything are central but all content is stored on individual pages. I like doing it this way and have for years, even if others think it's more work. ;)

coopster

10:10 pm on Apr 17, 2005 (gmt 0)

WebmasterWorld Administrator 10+ Year Member



hehe, am i being categorized in the "others" group here? ;)

jk is correct, I prefer Apache's Content Negotiation. I keep foreign language documents right in the same directory as the base language documents for the site. You can use MultiViews if you would like, but variant files are faster. A bit more work on the front end though.

Then, if you want to make it look your have sub directories of the foreign languages you can do so. SE's will pick them all up because you can have the "Additional Languages" link on your page(s).

Have a look at the Apache docs and dig into the httpd.conf file. You'll learn a lot by just analyzing the way the developers made the Apache manual.

elkiwi

8:07 am on Apr 18, 2005 (gmt 0)

10+ Year Member



I skimmed Apache's Content Negotiation in the manual and got dizzy. Sorry

What if I create a subdomain for each language like es.domain.com only making the index of each determining the language. Then submit each domain separatly to SE. Those indexes would then just link to the rest of the "common" pages.

I've tried spidering the site pointing a scanner at the index of each language and it finds the rest no problem in the right language.

Would this solution work seeing as I don't think I'm going to pass Apache's Content Negotiation 101.

Thanks

jatar_k

4:24 pm on Apr 18, 2005 (gmt 0)

WebmasterWorld Administrator 10+ Year Member



well, you dont need to submit anything, get incoming links and they will come.

subdomains is fine if that is the way that works for you, though having them all on the same domain might be nice for linking.