Forum Moderators: Robert Charlton & goodroi
Is this warning a real threat?
Are there any steps to be taken to fix this?
What is the best approach to exclude these faceted navigation URLs from being indexed and crawled?
The bots need to be able to crawl to understand that the URLs should not be indexed.
Now I have no clue of how to get them to refresh.
these pages will/may continue to appear in the SERPs with a description to the effect "there is no information about this page because it is blocked from crawling"
they might, but I wouldn't expect them to be anywhere near page 1 for any search term, and if they haven't been crawled it is difficult to predict what seacrh terms they might appear for... and that's why I count this particular warning among the many, many GSC warnings that are best dealt with by ignoring them.
One month after the new site was launched, we noticed an increase in the section "Indexed, not submitted in sitemap" in GSC
My concern is, how do they index the pages if they don't crawl them?
HEAD instead of GETGood idea: that lets you verify that a page exists, without seeing its content. But if you're roboted-out, you're not supposed to be making HEAD requests either.
"HEAD /amp_preconnect_polyfill_404_or_other_error_expected._Do_not_worry_about_it?1550188800000 HTTP/1.1"from last February that I'm sure must have been discussed hereabouts somewhere.
Nope, HEAD.+Googlebot turns up only fakers
[edited by: seo21 at 1:19 pm (utc) on Oct 18, 2019]