Forum Moderators: open
1) The underscore -- an old conversation really, and settled to my satisfaction months ago by inspecting DMOZ pages in the Google index. A filename such as kw1_kw2.html doesn't (or didn't) get seen as containing either keyword. A hyphen separates them, but an underscore doesn't. I wonder if anyone has seen any counter evidence recently.
2) The pipe -- this one really feels bad to me. The pages that use a pipe in their filename seem to be indexed (in Google, Inktomi and others) with the pipe character turned into "%7C". But I really wonder about what kind of back-end trouble it's causing and whether those keywords are seen as separate
3) The period or dot -- I'm talking about a page name like atlanta.georgia.html here. I don't thinks there's any problem with the keywords being seen as separate, but don't some operating systems have trouble with periods when they're not file extensions or domain name separators? These pages don't seem to be getting the rank I'd expect from Google and I'm wrestling with the period as one possible culprit.
1) GoogleGuy has confirmed here (about 4 or 5 months back) that underscore does not separate keywords, whereas hyphen does.
2) I've never even considered using a pipe in a filename. It used too have other functions in DOS, so I'd be scared of accidental side-effects. I think though, that any character that gets converted to %? can cause havoc with a browser's history - the stored url never matches the (unconverted) link url and a link will never be marked as visited. This is a problem with spaces too.
3) Again this is really a matter of cross-platform compatability. I imagine those extra periods would cause havoc on a Windows system, making life very difficult for your competitors trying to cache your pages to study your SEO techniques. ;)
Funny enough, I've done some more research and the pipe character does get seen as a "word seperator" by Google. The browser history argument makes a strong point however, even if Google has learned how to cope with the pipe character.
If anyone who reads this still needs convincing that Google doesn't see underscores as word separators (they are only seen as a character in the middle of a long word), play around with DMOZ's pages on Google. DMOZ uses underscores as a rule, and there's no shortage of fodder for research. For example, do searches like this:
allinurl: open site:dmoz.org
There is a page named http://www.dmoz.org/Computers/Open_Source/ but you don't see it in the results. Not even searching DMOZ on allinurl: "open source" works. The only way that page gets returned is on this search:
allinurl: open_source site:dmoz.org
The dash and the period don't show me any keyword separation problems on Google, however, and I've invested a decent bit of research time on the issue.
I've not once since 1996 used an underscore in any file names, neither have I used mixed case, I always use lower case.
I keep it simple too. I want to provide the shortest possible route to the resource. I only use hyphens to separate words in file names when applicable.
Brett has done an excellent job of keeping the URIs here at the forums very neat and trim. ;)
My experience as a trainer (almost twenty years) has taught me that the most valuable PC skill is good file management. SEO considerations aside, a well-structured site displays clear planning, easy navigation and easy maintenance and expandibility. Also, I get a buzz any time I'm looking for a file I made a year ago and I can guess exactly what I called it. When that happens, you know you're doing something right. (Doesn't always happen though :( .)
Fortunately (I hope!) we're talking about rules for re-writing dynamic URLs for asp pages. It should be an easy fix, now that I've convinced the client that it needs to be done. But as with any "should be easy" fixes, I'm ready for all the hidden dependencies that aren't immediately obvious. Who knows what dragons lie hidden in that database tangle?