Forum Moderators: Robert Charlton & goodroi
Levo, did you ever use a header checker to confirm that these urls were actually gone? What's their header response now?
The issue is, at least for my website, Google is rendering crawled source code from 2 years ago and calls it a successful page fetch, with recent 'Last crawl' date on Search Console & 'snapshot' date on cache page. The rendered/cached pages are also full of broken links, missing images, and look awful due to mixing two-year-old source code with recent css/js files.
When you upload an image in WordPress, WordPress does not only store the image, it also creates a separate so-called attachment URL for every image. These attachment URLs are very "thin": they have little to no content outside of the image.
In our major Yoast SEO 7.0 update, there was a bug concerning attachment URLs. We quickly resolved the bug, but some people have suffered anyhow (because they updated before our patch). This post serves both as a warning and an apology....
...But if you have hundreds or thousands of pages, removing URLs one by one can be time-consuming. John Mueller, a Google Webmaster Trends Analysts said you can use temporary sitemaps. First you 404 or set the pages to noindex. Then you upload a temporary sitemap file with the URLs you want removed but make sure to list them with the last modification date as of the date you set them to 404. It can help speed things up by giving Google a hint to look at these pages because they have changed. When they figure out the pages have changed and are 404ed, Google may remove them faster.Because of my time limitations, all of the above is pretty sketchy, but I'm hoping it helps. Please let us know.
I never used wordpress, nor Yoast. I just used their suggestion of creating a temporary sitemap to expedite removal of indexed pages from Google
...a temporary sitemap file listing these URLs with the last modification date (eg, when you changed them to 404 or added a noindex), so that we know to recrawl & reprocess them....This is something you’d just want to do for a limited time (maybe a few months), and then remove...
HTTP/1.1 200 OK
Content-Type: text/html; charset=UTF-8
Server: Apache
Cache-Control: max-age=90
Last-Modified: Wed, 19 Apr 2017 00:14:37 GMT
Content-Language: en-US
Content-Encoding: gzip
Content-Length: 4058
Date: Wed, 19 Apr 2017 00:14:37 GMT
Connection: keep-alive
Vary: Accept-Encoding
X-Google-Crawl-Date: Thu, 11 Apr 2019 23:49:15 GMT