homepage Welcome to WebmasterWorld Guest from 54.166.65.9
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Become a Pro Member
Home / Forums Index / Google / Google SEO News and Discussion
Forum Library, Charter, Moderators: Robert Charlton & aakk9999 & brotherhood of lan & goodroi

Google SEO News and Discussion Forum

    
De-indexing duplicate content related http or https
helenp




msg:4563593
 7:14 pm on Apr 10, 2013 (gmt 0)

Hi,
Due to duplicate http and https content I have a redirection on every page telling if page should be http or https,
and I am seing some changes in indexing already.
However in a folder I have similar pages as in root, ie the content is equal but the menu, footer etc. are diferent. My robots.txt states that these pages should not be spidered.

When I am checking my indexed pages I see to many, these pages are indexed both as http and https, but stating in google search that there is no content due to robots.txt
In order to get the https pages deleted quicker should I let google spider them?
On the other hand I suppose google will sooner or later spider them and deindex the https pages, but I suppose it takes longer as robots.txt tells not to index these pages.
Thanks,

 

phranque




msg:4563638
 9:00 pm on Apr 10, 2013 (gmt 0)

your robots.txt excludes googlebot from crawling but doesn't say anything about indexing.
you will need to allow crawling to solve the protocol canonicalization problem.

helenp




msg:4563651
 9:31 pm on Apr 10, 2013 (gmt 0)

thanks, I will do so,
just hope google dont see the pages as duplicate content,
there arent 100% equal, but very similar.

TheOptimizationIdiot




msg:4563652
 9:34 pm on Apr 10, 2013 (gmt 0)

just hope google dont see the pages as duplicate content

Even if they do all that should happen is they will pick one version of the page to show in the results. No penalty. No other huge devastating issues any more. Those have mostly been cleared up for quite some time now.

Obviously, it's almost always better to control what's considered the canonical version of a page (the one shown in the results) to make sure there's no confusion or glitches on their end and get them to show the one you want, but it's normally not a "huge big deal" any more if you have two essentially the same pages on the same site. They just do their best to pick the best one to show people.

helenp




msg:4563655
 10:01 pm on Apr 10, 2013 (gmt 0)

Thanks, just given google permission.

Global Options:
 top home search open messages active posts  
 

Home / Forums Index / Google / Google SEO News and Discussion
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved