Forum Moderators: Robert Charlton & goodroi
Can you tell me how to stop google indexing this type of string?
...
In a list of 404s on the site in search console, this type of url structiure appears 25% of the time
and always gets a 404.
If you have no way to include a "no index" header with the URL, you could prevent crawling with a robots.txt disallow:Disallow: /?i=
A description for this result is not available because of this site's robots.txt
Would it be an idea to have a list of such urls deliver a 410? (rather than a 404?)
I don't know what the ?i means at the start of the string.
We've got redirect file of 2k redirects to try and compensate.
Getting rid of those 404s to 410s would help clean things up.
I would pass on the 410 and instead continue with a custom 404. Don't let that traffic go some where else. Give them a 404 page that has resources on it. Especially if you know the old product code, then redirect them to new related products and services instead. Let Google take care of Google and you take care of the visitor.
If it's a proper 404, I'd say it's not your problem. Unless you are linking or redirecting to those URLs somewhere or somehow, or other domains are.
I would pass on the 410 and instead continue with a custom 404. Don't let that traffic go some where else.Seems to me this is just the place for a 410 with custom 410 page. (Don’t know about IIS, but Apache's default 410 is scary.) If humans are looking for a discontinued product, that's the time to tell them “We used to carry that, but now you might like X, Y or Z instead”.
The robots.txt fix above is good advice, but that only applies to compliant robots.
Is there a way to stop google indexing the query string?
you should ask yourself why they would crawl these urls
One way to go about it is to give the site a crawl with Screaming Frog to see if SF picks up those same query-string URLs. If it does pick those URLs up then something is misconfigured.
Unless you are linking or redirecting to those URLs somewhere or somehow, ...
Any clue here?
https://www.example.com/product?qs=1&productenquiry=1
you should ask yourself why they would crawl these urls
We've got redirect file of 2k redirects to try and compensate.
Getting rid of those 404s to 410s would help clean things up.
are these redirects or 404s?