Forum Moderators: open
I've never had this problem before, but I am now creating pages from a database that really isn't constructed that well, and some products are showing up in more than one place on it. I can't figure out a way to keep from sometimes showing the same product in more than one of the category pages, and then I end up with some product detail pages being duplicated.
I will try to find a way to avoid it, but my programming skills aren't that great so I hope sites don't get hurt bad by a few pages being duplicated.
root/widget1.htm, widget2.htm, widget3.htm, widget4.htm, index.html etc
But as I developed more and more product pages I decided to split them into categories as there were now so many, as follows:
root/blue widgets/widget1.htm, widget2.htm
/red widgets/widget3.htm, widget4.htm
/index.html
So I uploaded my new 'blue widgets' folders with the relevant product pages but forgot to delete the exact same product pages that were in my root directory, so I now had two versions of every single product page on my site (Doh!):
root/blue widgets/widget1.htm, widget2.htm
/red widgets/widget3.htm, widget4.htm
/index.html
/widget1.htm
/widget2.htm
/widget3.htm
/widget4.htm etc
I didn't even notice until a month later when my PR bar went grey and my entire site dropped out of the index. There was no other explanation for such a penalty - the new pages had been indexed, found to be duplicates of the existing ones, so zap, PR goes grey and you're outta here.
I immediately deleted the rogue pages in the root directory (which were now orphans anyway) and emailed Google explaining my oversight, insisting I wasn't trying to get 2 versions of all my product pages indexed, but all they said was I was welcome to send in a reinclusion request if I thought my site now met their guidelines.
I didn't bother as I'd read once you have a penalty it's difficult to shake off, so I put the exact same site on a new domain (obviously without the product pages in the root) and it's been fine ever since.
So the moral of this looooooooooong story is be VERY careful when it comes to duplicates, intentional or otherwise. In my experience Google can take a very hard line. :(
I have two directories of products due to the large number of pages and upload them as a zip and decompress on the server. When decompressing one of the files, I misspelled the directory name it another directory was created. This new directory has no links to it from anywhere.
Problem is the Google spidered 10,000 pages that night. I hope Brett is correct. I believe I'll have a look at the log files to see what Google did that night.
black - I don't worry so much about near duplicate pages on different sites, especially, since I don't do that myself with my own sites.
I think I may have found a way to avoid it, at least some of the potential duplicates. I'll try it today or tomorrow and see if I can do it or not.
I can see how Google may think someone would be trying to get multiple listings for a search term by putting duplicate pages on one site. But since it can be by accident to me it would make more sense for them to just index one rather than penalize a site.
Like I said, I moved the exact same site to a different url, killed the old one completely and started again (a VERY painstaking process) and within 4 months I was back to where I was before. Since then no problems at all, which means it MUST have been the duplication as I was booted out within days of the duplicates being indexed.
Hopefully for you Trisha, if it's just a few pages rather than 97% of your pages that are duplicated you will be ok.
Are you sure you didn't have "orphaned pages" which just happened to be duplicates or very similar to other pages within your site?
Duplicates may not cause a problem ... I don't really know anymore as I (thankfully) haven't made that particular mistake recently. But orphans (pages with no links to them from anywhere within your site) certainly are a problem and can be considered doorway pages, for which you will most certainly be penalized.
In addition, this 2nd domain shows up for the same backlinks that my Original domain has. I have never optimized for the 2nd one. I purchased Inktomi submit for both last year. Recently I upgraded Domain #1 to Overture site match, but Domain #2 is still riding under Inktomi.
Now Domain #2 (in the last few weeks) has been showing up for all kinds of keywords on the 1st page of Yahoo. But Domain #1 is still not to be seen.
Has anyone ever heard of this? Also, will/can this hurt me in regards to 'duplicate sites', even though I only have 'one' site...and 2 domains? They seem to be treated as the same site. I am confused. Any ideas?
They were orphans, yes. When I changed my directory structure the pages in my root folder became orphans and I simply forgot to delete them.
Do you think this is what annoyed Google, that they thought my orphaned pages were doorways? Interesting, I hadn't even considered that yet it makes sense.
I thought Google overreacted and were a bit harsh kicking me out of the index for duplicates, this probably explains it! :)
Its easy to do if you are juggling more than one task at a time and I usually have a phone hanging off each ear while eating my lunch and trying to update a page at the same time.
Since then, I make darned sure I delete the page first and then the links!