TheBigK - 12:46 pm on Nov 25, 2012 (gmt 0)
My apologies. I should have mentioned that. The site has over 50k discussions and I'd expect it to have about 90-110k individual pages (pagination considered). The blog has about 4k posts. Considering how 'good' I'm at estimates, I'd not think that there are more than 150k pages. I've blocked several 'thin content' pages (like member profiles, which would add about 150k pages).
But the "not selected" pages are well over 5 million - which is certainly out of place.
I've removed all the 'duplicate' content as reported in GWT and don't really think there is any 'significant' duplicate content on the site. At least not to the tune of what's being reported by GWT.
I've been told that the exorbitant "not selected" could be an indication to Google that the website has lot of duplicate content. But that isn't the case. I suspect some technical error in the site, which I'm not able to find.