Forum Moderators: goodroi
www.dogs.com/large-dogs/german-shepherds
or
www.dogs.com/friendly-dogs/german-shepherds
It's a database-driven site and the "large-dogs" or "friendly-dogs" is used as a link on the article page to link back to a section on the site where the visitor was just browsing, so I need to pass that info.
I'm guessing this may be seen by Google as different urls with the same content and get a penalty, anyone have advice for how I can get around this issue? Or is Google smart enough and I'm worrying about nothing?
However, using your example, not all large dogs are friendly dogs (and vice versa), they're probably not going to be exact duplicates- just some overlap between them. Depending on the extent of non-duplicated data, Google is probably smart enough to figure things out.
That is, by all means have index pages for:
www.example.com/large-dogs
or
www.example.com/friendly-dogs
which list various dogs, but make the final dogs page something like:
www.example.com/german-shepherds
or
www.example.com/breeds/german-shepherds
The dog page itself can then have links that point back to "find other large dogs" and "find other friendly dogs".
My fallback is to follow your advice and just grab the primary section for the link out of the database, which would give the user the right section MOST of the time, but it would be much nicer if the link were right every time.
Look at how some of the best e-comm sites do this:
- find more of this colour
- find more of this size
- find more of this type
- find more of this brand
etc.
It is the ultimate cross-sell cross-linking. You don't know where the person on that page wants to go next. Make it easy to go wherever they want. Don't make them have to go all the back to the home page and start a new category drill down process.
[edited by: g1smd at 7:09 pm (utc) on June 2, 2009]
Doesn't seem that there's much I can do about it though, I'm def more worried about SE penalties for dup content than the extra I'd get from a better url.
Thanks again for the help, I'm learning a ton from your posts here.
EDIT
Ok, that didn't come out right, what I meant was it's kind of a breadcrumb and also kicks link juice back to specific section pages.
[edited by: sdguy at 7:14 pm (utc) on June 2, 2009]
Ok, I'm confused. Normally I would direct a user to an anchor point on a page with something like www.dogs.com/breeds.htm#german-shepherd, but with mod_rewrite a pattern is used and matched (ie /breeds) and then mod_rewrite loads the preset page (breeds.htm). I tried to get it to load breeds.htm#german-shepherd but it ignores everything after the #, is there any way to escape the # in mod_rewrite?
Two notes:
1) Of all the browsers in existence, only Apples' Safari sends URL-fragment identifiers to the server when it makes a request. So rewriting from a URL with a specific fragment identifier only works with Safari. Rewriting to a specific on-page named anchor can work, but...
2) The 'urgent' need to use and to redirect to named anchors on a page is a strong indicator that that page needs to be broken up into two or more more-specific-subject-oriented pages. This benefits both site usability and SEO factors, as each page is then more tightly focused.
Jim
I've tried using \# and it didn't work, I'll look into the [NE] flag.