I'm struggling to get a few hundred thousand pages out of Googles index.
I recently discovered several hundred thousand indexed pages, that were coming from my members status updates. These were page with nothing more than "hello", "good morning" etc etc.
I quickly added noindex, follow to them and after a few weeks, the count went down to 1,000 but has since gone back up to 85,000 and has stayed there for about a week now.
All of the pages appear under /statuses/, would it be a better idea to remove that DIR via Webmaster Tools and then disallow via robots.txt?
After the introduction of statuses in December, the site was hit by Panda in January. Nothing is ever 100% certain, but I think there's a good chance these pages did the damage. So I want to get them removed asap, hopefully to catch the next Panda refresh.