Welcome to WebmasterWorld Guest from 54.162.239.134

Message Too Old, No Replies

Identifying low quality pages and removing them

     
5:05 pm on Dec 15, 2011 (gmt 0)

5+ Year Member



Hi guys I'm trying to reduce/remove/eliminate the amount of low quality pages on my site.

To identify them I'm using the following process:

- Using site:www.example.com in google
- make a note of all these pages
- navigating to the final page to get to the
In order to show you the most relevant results, we have omitted some entries very similar to the 100 already displayed.
If you like, you can repeat the search with the omitted results included.

- Go back to page 1 and see which pages have been added and compare the two.

My first query is is this the easiest way of identifying low quality pages?

Once these pages have been identified what is the best way to deal with them?

- Simply remove the files from the server?
- Add noindex no follow in the meta tags of the page?
- Remove the pages using webmaster tools?

Any guidance appreciated!

[edited by: tedster at 7:30 pm (utc) on Dec 15, 2011]
[edit reason] switch to example.com [/edit]

8:14 pm on Dec 15, 2011 (gmt 0)

WebmasterWorld Senior Member 5+ Year Member Top Contributors Of The Month



Another factor you should consider before removing a page is how much traffic it brings in.

Also, if a page has any backlinks from other websites, it might be best to re-direct it rather than delete it.
8:41 pm on Dec 15, 2011 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Speaking for myself, I don't see a lot of reasons to remove a page IF that page serves a purpose for the site (in other words, is not simply a bridge page). I would noindex it; remove it from sitemap.xml; and even block its indexing in robots.txt. Those 3 steps should make it clear to Google that they can leave it alone, and should alleviate any concerns on your part that the page(s) will hurt you with Panda.

........................
9:50 pm on Dec 15, 2011 (gmt 0)

5+ Year Member



I think i also need to go into analytics and see what pages are not gaining any traffic, look for high boune rates and evaluate from there.
10:01 pm on Dec 15, 2011 (gmt 0)

10+ Year Member



I would noindex it; remove it from sitemap.xml; and even block its indexing in robots.txt
We are doing the same , come the next Panda update we will see what happens.

Objective: is the page useful for our visitors = yes - as reported in web statistics. Is it thin or details found elsewhere i.e. waste of time being in Google index =yes. If yes to both remove from Google but keep page.

If we have thin pages not found elsewhere we keep them in.

We will see...
9:27 am on Dec 16, 2011 (gmt 0)

5+ Year Member



The process you've described is a good one, however if I were you I will try to create 2 groups of pages.
  1. Low quality pages in the eyes of google: you can identify those pages as you said before, divide them into:
    • the ones which give you traffic (maybe they're landing pages of a referral link), don't delete these ones, just be sure they're removed from SE with noindex and/or robots exclusion
    • the ones which doesn't add value nor receive traffic, you can delete these ones, redirecting the ones which has inbound links

  2. low quality pages in the eyes of your users: have a look at those pages which receive traffic but have an high bounce rate and decide what is the best strategy: url removal or improve their content or merge their content with another page.
2:55 pm on Dec 16, 2011 (gmt 0)

WebmasterWorld Senior Member tedster is a WebmasterWorld Top Contributor of All Time 10+ Year Member



If you have the historical records from whatever analytics program you use, look first for those pages that lost the most Google search traffic when Panda first had its impact on your site.

From what I've seen it is most likely these pages that got the low Panda scores. Then that low score spread as a ranking factor from these "seed pages" to some degree throughout the other URLs of your site. Ideally, you would improve these seed pages pages - take them beyond whatever "shallowness" you can clearly see. When that seems impractical, then remove them. Or if that also seems impractical, then noindex them.
4:53 pm on Dec 16, 2011 (gmt 0)

5+ Year Member



Can I just clarify something..I guess Panda has spelt out the need for quality pages...but what if you have a 5 year old site with 3000 thousand unique, original content pages (I don't, but hypothetically speaking)...not everyone of those pages will gain significant traffic or user views...and some will be buried deep in your category..perhaps relating to a specific event some years ago.

Does that mean you would have to noindex/amend robots.txt all those pages which do not achieve much traffic (even though they are quality articles with original analysis?

Just because google deems them 'low quality'?
5:00 pm on Dec 16, 2011 (gmt 0)

WebmasterWorld Senior Member tedster is a WebmasterWorld Top Contributor of All Time 10+ Year Member



No - from what I see, pages such as you described are not hurting the websites that publish them. You only need to address those pages that LOST traffic on a Panda update, to the degree that you can. This often includes a variety of things - pages created merely to address subtle variations in keywords, for instance.

Google has given us quite a bit of input as to the kind of "content" that they don't want to rank well. Even if your pages aren't currently being devalued bu Panda, it's still wise to future-proof your site by paying attention.
7:09 pm on Dec 16, 2011 (gmt 0)

WebmasterWorld Senior Member planet13 is a WebmasterWorld Top Contributor of All Time 5+ Year Member Top Contributors Of The Month



...but what if you have a 5 year old site with 3000 thousand unique, original content pages... not everyone of those pages will gain significant traffic or user views...


Maybe moving those particular pages - if they have a common theme - to a NEW site that is more closely aligned with that theme will help out in terms of generating traffic and improving metrics for those pages.

that's not so much a Panda consideration, but more of a traffic / monetization consideration.
 

Featured Threads

Hot Threads This Week

Hot Threads This Month