Forum Moderators: Robert Charlton & goodroi

Message Too Old, No Replies

Sitemaps, crawl paths, 301s, and supplementals

I think I have unconvered some of the problems

         

Dan92SLC

4:02 pm on Mar 15, 2007 (gmt 0)

10+ Year Member



We have had a functional e comm site for 10 years. In march 2006, we migrated to an all new platform that has the same product merchandising and site search as some major, major brands/etailers. Our old site had a basic static rewrite that was highly effective, as well as 500 or so static htm pages that each targeted a specific kw.

Those pages no longer exist, and our IT guy set up 302 redirects (later 301) from those old hallway urls...he configured the redirects to search the site for the kws in the extension of the old site...in other words: if the old satic page was /widgetx.htm, the 302/301 redirect was to a sitesearch=widget%20x? etc url.

Fast forward a year or so.
site:www.domain.com command in Goog shows 7,000 pages indexed.
site:www.domain.com -www show 5,700 pages as supplemental. Are these in addition to the 7,000, or is it of the 7000?

A second issue:
Our site allows customers to sort products within subcategories via all sorts of refinements like brand, type, size etc. We have blocked robots from crawling the refinements (once a customer clicks any refinements, urls become rather long and it is questionable whether the boots would even follow them).

The bots are not restricted from the regular hierarchical category->subcategory->product path. However, the urls in the site map are different than what the site navigation path urls are. Both the navigation and site map are visible to bots/spiders.
examples:
regular site navigation url: www.domain.com/search_subcategory.do?categoryName=widgetC&categoryId=9000&refine=1&page=GRID

sitemap url for the same subcategory:
www.domain.com/siteMap.do?action=map2&catId=32

The site map version shows the same category description text as the navigation one..but has only text links to the products within this category whereas the navigation version contains images with alt attributes plus snippets of descriptive text for each product. Google seems to pick and choose whether it indexes the sitemap url, navigation path url, both, or neither.

Could this contribute to supplemental and or duplicate content woes?

Our competitors in our niche are far and away thin content sites and nobody has close to the amount of on page relevant and themed information for any given product. We are not newbies. We have many thousands of good IBLs gained over the last decade, and practice Mr Wall's and Brett Tabke's methods.

We are trying to figure out what it is that is still hampering our organic traffic and rankings post-new site implementation, and this is what I've come up with.

Am I on the right track? What next?

trinorthlighting

5:26 pm on Mar 15, 2007 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Shorter URL's will help. We recently had a site with very long urls that we shortened by 25%. We did 301 redirects, blocked the sort options and pulled right out of the supplemental index.

Honestly, I think a big factor in supplemental indexing is the length of the url.

Dan92SLC

5:52 pm on Mar 15, 2007 (gmt 0)

10+ Year Member



By shorter urls, are you suggesting we make the regular site navigation invisible to the bots and instead make them see the site only via the site map?

trinorthlighting

6:01 pm on Mar 15, 2007 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



No, but if you spin off new products or add content, keep short urls in mind. If you hide navigation google might think your up to something sneaky.

trinorthlighting

6:04 pm on Mar 15, 2007 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



The bots are not restricted from the regular hierarchical category->subcategory->product path. However, the urls in the site map are different

That right there is what is killing you, duplicate content, two urls leading to the same content. Choose one or the other, then do some 301 redirects. My suggestion, look at both of the ways to get there, keep the shorter url and redirect the longer one.

Make them the same in both the regular navigation and the site map. Then you will have two links pointing to one product.

[edited by: trinorthlighting at 6:08 pm (utc) on Mar. 15, 2007]

dazzlindonna

6:04 pm on Mar 15, 2007 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



If you can prevent Google from seeing duplicate content via blocking them from all but one version of a page, you should.

Even better, if you can rewrite those urls to show something like www.domain.com/widgets/blue-widgets, you'll be even better off.

trinorthlighting

6:10 pm on Mar 15, 2007 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



I would not chance hiding them, just fix them so the url is the same in the navigation as it is the same in the site map. Google has no issue with indexing dynamic websites if the php code is properly wrote so spiders will not index session id's

We use zen cart and zen has the code right.

lfgoal

6:20 pm on Mar 15, 2007 (gmt 0)

10+ Year Member Top Contributors Of The Month



Just curious. Do supplementals pass pagerank and reputation?

trinorthlighting

6:34 pm on Mar 15, 2007 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Yes, but if the content is changed on the supplemental page, then it might take 2-6 months to update.

lfgoal

7:05 pm on Mar 15, 2007 (gmt 0)

10+ Year Member Top Contributors Of The Month



trinorthlighting,

Did you mean to say "if the content is NOT changed on the supplemental page, then it might take 2-6 months to update."?

I hope that's what you meant because I don't see why changing the content would contribute to such a long lag. From everything I've read lately, the key to dealing with supplementals seems to be :

1. more pagerank (to justify not being a supplemental I guess)

2. having more unique content (so if you have a supplemental page, you'd probably want to go back and edit the page to add more distinctive content)

walkman

3:49 am on Mar 16, 2007 (gmt 0)



I thimk it's faster than the 3-6 months. Google is going NUTS on my server--which is OK since I only have less than 1000 pages--and I see the numbers going up daily. Pages that were supps a few days ago are now in the main index. I hope to be supplemental free in a week or so.

Then, I hope that rankings are restored once the pages are indexed. Is it too much to expect better rankings once the supplementals are gone? My site is really old and I have a lots of unsolicited, one way, links.