Msg#: 4642370 posted 5:14 pm on Feb 4, 2014 (gmt 0)
If you're referring to the latest GWT glitch that saw many websites "double" their size overnight (happened mid January), there's no way of confirming as to what really happened.
There are no known commands to type in Google to "unlock" the supplemental index (mind you, I'm not saying there is such index!) or a different tier of the index, but with a bit of creativity, you can find pages that may contain thin content.
Here's an example. I was recently looking at a Pandalised site, however, the site:example.com query didn't yield any pages with thin content.
Then I browsed around randomly and stumbled upon a page that contained only an image and this string of text: "Click on the image to reveal a larger size"
Next, I used this query: "Click on the image to reveal a larger size" site:example.com and voila! it showed me it had 200+ such pages in its index. Funny, when I did just site:example.com it didn't show a single page of that type.
Now if I were to try to rescue the site, I'd simply remove those pages or "noindex" them.
Another possibility is that each of your page has got two different urls.
However, most likely, Google doesn't give you the precise number of pages it's got in its index.
Msg#: 4642370 posted 6:41 pm on Feb 6, 2014 (gmt 0)
Another alternative: Adjust your search settings to 100 results per page. Do your search, then go to the last page of the results, and it is likely you'll see the quote below:
In order to show you the most relevant results, we have omitted some entries very similar to the 510 already displayed. If you like, you can repeat the search with the omitted results included.
Click on the link You will see many more results. If you get your site really "tuned up" you should not see the message above. Remember to carefully check for the same page being indexed as "https://" and "http://". Do a text search of the results page for "https".
The 100 results per page setting can save you a lot of time getting to the 1000th result for a large site. The message above is only shown on the last page of results.
Msg#: 4642370 posted 1:27 pm on Feb 7, 2014 (gmt 0)
When you click that omitted results link the URL will contain the parameters I specified in the query string
Yes, but when you do it the old fashioned way, looking for Google's omitted results message, you receive positive feedback from Google that your site may have some content Google considers duplicate. Some may disagree with this assessment.