Forum Moderators: Robert Charlton & goodroi
Do make sure that your rewrite rules pass every part of the URL to your script for verification, and that any non-valid value returns either a proper 404 error in the HTTP header, or else a 301 redirect to the correct URL in the case where for your page
/funnyvideos-259-ball.html someone links to /funnyvideos-259-[i]do-not-watch-this-junk[/i].html. In fact, at this point, why not go extensionless, dropping the .html part from the URL? Only "real" files on your server, like robots.txt, stylesheets, js files, and images will therefore have an extension. This makes your rewrite rules a LOT more simple and a heck of a lot more efficient. In particular you can do away with the very time-consuming "exists as a file" checking.
Another thing here, if you are going to have category and name in the URL, do make sure that for the requested name, that the category in the URL is valid for that name. That is, if it is incorrect, either send a 404 error or send a 301 redirect to the correct URL. Failure to do so, invites infinite Duplicate Content problems.
Personally, I would give each video a unique ID and not worry about carrying category information in the URL. That data would be pulled from the database, keyed by the unique video ID number and used to populate page content and navigation.
Are you sure it is related the "number of slashes in the URL" and not just the "number of clicks away from root"? :)
As I said, it's just been from my observations, and I'm sure there are just as many (if not more) counter-examples.
I remember some pages that were a few levels down, but linked from the site's index page.
I've explored pages in the same situation, but on those I've checked, the TBPR shows the same as other pages linked directly from the index page.
Additionally, I could argue, depending on how the nav structure is set up (and what kind of deep links the site has), the actual PR on these pages may well be higher than pages linked only directly from the index page... but generally only by a fractional amount that you wouldn't see on the Toolbar.
It is a question of click levels, as well as how PR is split up along the way. The directory levels should have nothing to do with it, but many sites are set up so click levels correspond to directory levels, and it therefore often appears that directory levels are what matters.
[edited by: Robert_Charlton at 9:44 pm (utc) on May 21, 2009]