Forum Moderators: phranque
Also, do SE's prefer the submission of index.htm to say mydomain.com/home.htm etc.
Organizing them? I try to keep the number of directories as short as possible. There is some fairly signifgant data that suggests more than 3 directories deep decreases your rankings on some engines.
Filenames and paths should also be kept as short as possible. My working guideline is less that 35 characters total including the domain name.
The filenames should of course be a keyword related to page content. Directory/path names should be the next logical grouping hire. Most often I use the path to build a keyword phrase. For example: "searchengineworld.com/spiders/google.htm".
I can vaguely remember something about yahoo penalising long URL's
either way, the shorter the better.
Ive been using descriptive names for files under numbered folders, though more larger projects in the future look like they will inevitably have to be numbered for continuity
I try to keep the number of directories as short as possible. There is some fairly signifgant data that suggests more than 3 directories deep decreases your rankings on some engines.
Here's something posted just this week by Robber in the thread about the current FAST update [webmasterworld.com]:
One thing I have been investigating, but don't have good evidence for is that pages more than a couple of levels downs (in terms of directory structure) have taken a hit in terms of both ranking (eg top 3 to third page) and possibly getting excluded.Like I say, no firm evidence since it is not something I've really looked at until now, but this does seem to be quite consistent with my situation.
FAST updated 3 times to googles 1, all 4 spiders getting the same directory structure, but FAST this time round reducing the amount of pages available from my site.
I guess its quite safe to say that directory structure is not a factor there (or a major factor in light of my experience)
In regards to naming the files for the purpose of SE's, I name most of my folders as letters or numbers and use keywords for the htm or asp pages themselves, sort of giving an "indication" of what the content is to SE's, hovering over link n looking at status bar etc
It almost gets to the boundaries of ridiculousness when SE's dictate what folder structure you use and what you call them