Welcome to WebmasterWorld Guest from 54.234.45.10

Forum Moderators: Robert Charlton & goodroi

Conflicting Data in GSC - # of Pages Indexed

     
9:03 pm on Jan 9, 2018 (gmt 0)

Senior Member

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month

joined:May 29, 2003
posts:770
votes: 19


On my recently SSL converted site,
In the Google Search Console,
--- Under "Google Index" - "Index Status", it says ---> 424 Indexed (new HTTPS version), (old HTTP version 14 Indexed)

--- Under "Crawl" - Sitemaps, it says ---> 421 Submitted, 407 Indexed (both HTTPS and HTTP - notice delta of 14)
That 14 just won't go away.

Been this way for a while. Can anyone clarify what is going on?

Is there a surefire way to determine JUST WHICH PAGES are gumming up the works (remain unindexed?).
And why I am JUST FINE by one "pages indexed" metric (over 100%), but 3% screwed in the other?

I have been running Screaming Frog, but can't find a thing. Thanks in advance for any insight offered.
.
10:56 pm on Jan 9, 2018 (gmt 0)

Senior Member from GB 

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month

joined:Sept 7, 2006
posts: 989
votes: 80


When you say "recently", how long ago? It was a couple of months before GSC sorted the whole of my (smaller) site, and unless there are any errors I wouldn't worry too much.

If you can access the raw server logs, that is the place to check: look for erroneous 200s in the http log (there should be none), and for anything else (but particularly 4xx codes) in the https log. If everything there checks out OK, just give it time.
1:10 am on Jan 10, 2018 (gmt 0)

Senior Member

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month

joined:May 29, 2003
posts:770
votes: 19


I converted everything (I THOUGHT) in July.
My host did not properly implement the 301 redirections, until early November (plesk system).
By December, I was almost fully reindexed as HTTPS.
I lost most of the links to my top page (was over 30,000, now 4,100).
Still trying to crawl my way back to relevance.

I will bring up viewing "raw server logs", which I have never seen before. Are they normally provided by the host? (I am not a programmer - just a little self-taught HTML.)
.
3:46 am on Jan 10, 2018 (gmt 0)

Administrator from US 

WebmasterWorld Administrator not2easy is a WebmasterWorld Top Contributor of All Time 10+ Year Member Top Contributors Of The Month

joined:Dec 27, 2006
posts:3516
votes: 194


You mention under Crawl > Sitemaps that you still have sitemaps for both http and https. Is that correct? When you create a new domain in GSC for the SSL version, it can help you check on indexing changes (from old to new) to keep the old non-https version listed in GSC as well, but you should not have a non-https sitemap in that domain at GSC after switching to https - unless there are still pages that are accessible only via http.

@Sally Stitts - re: raw server logs -You can download your logs via ftp, usually they are available in a zip or gzip format. They can be viewed in a text editor or as an office spreadsheet. Best to us a tool that decent search capabilities. Various hosts have minor differences in their log policies. Generally each line shows you the IP, the date/time, the file requested, the protocol, the browser (User Agent) used and occasionally a referrer.

[edited by: not2easy at 4:24 am (utc) on Jan 10, 2018]

3:58 am on Jan 10, 2018 (gmt 0)

Moderator from US 

WebmasterWorld Administrator keyplyr is a WebmasterWorld Top Contributor of All Time 10+ Year Member Top Contributors Of The Month

joined:Sept 26, 2001
posts:10425
votes: 600


Can anyone clarify what is going on?
In addition to what been said, know that GSC is notorious for displaying conflicting data... especially indexed & crawl stats. It's intermittent, but chronic.
 

Join The Conversation

Moderators and Top Contributors

Hot Threads This Week

Featured Threads

Free SEO Tools

Hire Expert Members