i've just been reading this thread - [webmasterworld.com
] - and realised that 10,000 of my http pages are still in google's index. This is seven months after I changed over to https. I'm guessing that I've messed up somehow for that to happen.
I've got both URLs listed in google's search console and the http one has dropped from 160,000 to 10,000, whilst the https one has gone up to 160,000, which is the correct number.
does anyone know of a way to find out what http pages are still in the index? The search console isn't much use for that. I've tried the old
search operator on google but it just returns all the https pages.
as far as I can tell all of my redirects are set up okay, so I don't think that's it. the only thing I can think of is that google no longer has a way to crawl those pages because maybe I blocked them in robots.txt or something (I can't remember making any changes to it).
I suppose I could do the nuclear option and just ask google to remove all the pages in the search console, but I'd rather not do that because it's a bit scary!