Forum Moderators: open
have been working on a website that is in twelve different languages.
users enter site and select language to view from textual links at top of screen (no drop down menus etc.)
each language has 8 pages. url goes www.domain/language/index
site is all static html, minimal graphics, lots and lots of content on each screen
when search on ATW for indexed urls, all the english screens are there, but only the first (/language/index) of other languages is there.
any ideas?
Probably not. A link is a link and should get the spider to crawl your site.
Linking struckture is valid? Eg. links are in href format (no javascript etc.?)
Try checking it with the Sim SPider: [searchengineworld.com...]
tried the spider suggestion, and when it goes to spider one of the lead language pages www.domain/language/index returns error code 500, meaning a timeout. sure enough, it cannot spider beyond the index page of each language. whta it does do is misread the url: instead of www.domain/language/index, it produces www.language/index.
there is no java or anything else in the site, just straight static html. got me stumped. any advice much appreciated.
Are you using absolute URL's http://www.domain.com/language/page.html or just relative /language/page.html in your HTML?
You should be using absolute or at least insert a base href link:
<base href="http://www.domain.com/"> to tell the spider the root domain. Try that and put the Sim Spider on it again.