Forum Moderators: Robert Charlton & goodroi
I noindex,nofollowed, and robot.txt blocked all the thin content.
In other words, since people can still visit no-indexed pages, does the Google algorithm include them in its evaluation of the overall content and quality of the site?
...Matt Cutts And Singhl advocate for noindex only...
In other words, since people can still visit no-indexed pages, does the Google algorithm include them in its evaluation of the overall content and quality of the site?
Blocking “Low Quality” Content
Matt reiterated that enough low quality content on a site could reduce rankings for that site as a whole. Improving the quality of the pages or removing the pages altogether are typically good ways to fix that problem, but a few scenarios need a different solution.
For instance, a business review site might want to include a listing for each business so that visitors can leave reviews, but those pages typically have only business description information that’s duplicated across the web until visitors have reviewed it. A question/answer site will have questions without answers… until visitors answer them.
In cases like this, Google’s Maile Ohye recommended using a <meta name=robots content=noindex> on the pages until they have unique and high-quality content on them. She recommends this over blocking via robots.txt so that search engines can know the pages exist and start building history for them so that once the pages are no longer blocked, they can more quickly be ranked appropriately.
Will switching from html to php harm my serps?
Matt Cutts – Google Not prepared, but informal remarks. High order nits: what do people worry about? He often finds that honest webmasters worry about dupe content when they don’t need to. G tries to always return the “best” version of a page.
Hopefully one or the other? Googlebot will not see the noindex, nofollow if those documents are Disallowed via robots.txt. You have to use one or the other, not both.
Just an update, Redirected pages now begin to show in WMT as 404 (Not found), Linked From - unavailable.