Forum Moderators: open
Also, how long can a URL be to still be SE friendly.
This website I am talking about has LONG URLs, plus SPACES, plus SYMBOLS, plus word ID.
It seems like completely SE unfriendly.
Check it out:
detail.htm?stylepkey=11049&style_id=030%20COMBB0&dept_id=3&deptName=Parts&sub_id=50&subName=Campagnolo%20Parts&lprice=42.98&hprice=42.98
First I just wanted to have a ASAPI rewrite to replaces?, &, = and spaces by -
Would that be enough?
We also have the problem of the product prices that have dots in it, and also the word ID in some of the paremeters...
Any ideas?
Thanks
I believe it's not the "styleID" that's the problem, it's the next bit ... "ID=" which screams Session ID to a bot.
I understand that parameter names should not be longer than 10 characters for effective spidering of dynamic pages in Google.
For the other SEs paid inclusion or trusted feed seems to be the only solution I can find for dynamic pages.
I can't help you with the ISAPI rewrite thing as I'm only just starting to play with this myself so haven't tested it fully, however it looks like it's the answer to my prayers.
Back in November we were really struggled with this problem, everyone I talked to said that our urls were fine, however google on had half a dozen of these pages in the index. looking through my logs, I would notice the bot calling from the page without the query string. Of course that would give them an error because the page could display anything (other than an error message) without having the query string data. For example our looked like this:
www.widegets.com/somefile.asp?idcat=32&idproduct=221
In December, we moved to a new host so we could use ISAPI Rewrite. Having had such bad experiences for several months and because it was just as easy, we used ISAPI Rewrite to rewrite the urls completely removing the "idcat" and "idproduct" from the url so they now look like this:
www.widgets.com/product32-221.html
With-in 3-4 we went from 40 indexed pages (only about 6 had the dynamic urls) to over 500. Currently Google has a just about all 950 of our pages indexed.
Now although many have different opinions on whether id is a problem, IMO the id= that you have is a problem, but all I can really say for sure is that in our experience, the query string and it's content, we believe, was giving us problems getting indexed.
If you are struggling getting indexed, I highly suggest rewriting the urls. You can always do a some minor rewriting at first to get rid of the "?" and the "&" adn then if that doesn't get the pages indexed, then rewrite them a little more.
However before starting rewriting you should really consider what pages do you want indexed, what pages not, and how do you want your new URL map to look. Otherwise you'll sink in ensuing confusion.
Our composite solution to URL rewriting troubles (as we have dynamic pages with multiple states and subcommands) was, despite of all, canonical URLs. We built a system that actually exposes an URL scheme that's totally classic; it looks entirely like static HTML URLs and in fact, we use a spider to download the entire site tree and then rsync it up to the production webserver - we have practically zero dynamic content on the website.
Thanks
[webmasterworld.com...]