homepage Welcome to WebmasterWorld Guest from 54.234.228.64
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Become a Pro Member
Home / Forums Index / Google / Google SEO News and Discussion
Forum Library, Charter, Moderators: Robert Charlton & aakk9999 & brotherhood of lan & goodroi

Google SEO News and Discussion Forum

    
Deindex Website from Google first and Start with a Clean Slate?
McMohan




msg:4570536
 6:15 am on May 4, 2013 (gmt 0)

I was given a portal that was doing poorly in Google, give its decent link popularity. Alexa rank in the range of 10k. Tens of thousands of pages. (No instances of Alexa rank being influenced :-))

It was a Herculean task getting my head around the overall site structure and navigation. Digging deeper, I ascertained only about 10% of pages are useful. Remaining 90% of pages either duplicate pages, search result pages, various forms, print version pages, test folders, and so on. Now, next task was getting non-useful pages out of index through robots.txt, 404ing duplicate pages, setting URL parameters in WMT (there are about 40 different types of URL parameters!). Overall cleaning up the mess. I told it will take time for Google to notice and readjust its index.

The client asked. How about deindexing the site, ensure site is out of Google index completely and let it index, so that only the useful 10% pages are in the index? With URL removal tool it can be done in a matter of days. I searched for and answer, that it might be too drastic, not convinced with my own answer.

Anyone ever done that? Can it be thought of as an option to speed up the process?

 

tedster




msg:4570581
 12:12 pm on May 4, 2013 (gmt 0)

I agree with your initial assessment. The client may be impatient for results, but a complete deindexing in this situation is probably too drastic.

goodroi




msg:4570664
 5:22 pm on May 4, 2013 (gmt 0)

Deindexing from Google will not fix the problem of having poor quality pages on the site or the 404 errors you will have if you only delete those poor quality pages.

McMohan




msg:4571009
 6:33 am on May 6, 2013 (gmt 0)

Thanks for the confirmation Tedster

Deindexing from Google will not fix the problem of having poor quality pages on the site or the 404 errors you will have if you only delete those poor quality pages.


During the cleanup act, many entries are made to robots.txt - few folders, few files and many wildcards are disallowed. URL parameter handling was defined in WMT. A majority of these disallowed pages continue to be present in SERPs and it may be awhile before they are out. To cut the time, it was a suggestion to remove all the pages and let only useful pages in, since not-useful pages are blocked in robots.txt. Not so bad an idea to dismiss outright?

lucy24




msg:4571028
 8:18 am on May 6, 2013 (gmt 0)

Compromise? Leave index alone but remove everything from g###s cache, so at least you don't have all those garbage pages flopping around in the background.

phranque




msg:4571046
 9:39 am on May 6, 2013 (gmt 0)

A majority of these disallowed pages continue to be present in SERPs and it may be awhile before they are out.


these may never be out of the index.
a disallow excludes googlebot from crawling but it's not an indexing directive.

McMohan




msg:4571089
 12:33 pm on May 6, 2013 (gmt 0)

these may never be out of the index


Pages in a folder can be removed from google's index through URL removal tool on WMT. But there are many, thousands in fact, such as search result pages, that are not in a particular folder but generated through dynamic queries. They are now disallowed through a wildcard in robots.txt. Will they always remain in the index, unless you deindex the whole site?

P.S: Adding META noindex/nofollow to those pages doesn't seem to be possible, given the way the site is built.

Global Options:
 top home search open messages active posts  
 

Home / Forums Index / Google / Google SEO News and Discussion
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved