Forum Moderators: Robert Charlton & goodroi

Message Too Old, No Replies

Drop of indexed pages, boost of not indexed Crawled+Discovered

         

guarriman3

3:11 pm on May 21, 2023 (gmt 0)

10+ Year Member Top Contributors Of The Month



Hi,

I manage a website with 676k URLs in the sitemaps. On Feb 21, there were 671k URLs indexed (according to Google Search Console > Page indexing), and 937k URLs not indexed. These 937k URLs were old webpages that I've been removing from index, after consolidating the content (681k URLs with redirect) or after removing it (127k with 'noindex'). The rest were 'Crawled - currently not indexed' (43k) or 'Discovered - currently not indexed' (78k).

The number of indexed pages has been decreasing until May 2 (646k). Then, on May 3 the number of indexed pages dropped to 612k, and now it is 583k.
- The number of redirected page has increased slightly until now (from 681k to 688k)
- The number of 'noindex' pages has decreased slightly until now (from 127k to 122k)
- The number of 'Crawled - currently not indexed' boosted (On May 2, passed from 43k to 80k, and now it is 99k)
- The number of 'Discovered - currently not indexed' has increased significantly (from 78k to 91k)

As far as I understand, on May 2 Google definitely penalized my website because Google understands that I have no quality content, is it right?

I've got a website created from a large database (600k records) of commercial products, a site that may be considered as 'thin content'. For the last years, I've been removing (noindexing) the content with low quality (products with short alphanumeric data, that generated short pages with duplicate content), and consolidating (by redirections) different webpages of the same products. I have been taking care not to have duplicate content, and to generate quality content for users from the database records. I've got hundreds of links from top domains included 'nytimes.com' or 'huffpost.com', and thousands of links from Wikipedia.

I would appreciate some tips to cope with this issue. Thank you very much.

JDietz12

6:54 pm on Jun 27, 2023 (gmt 0)



Any resolve on this? I've had similar issues on one of my sites

Meatboat

1:12 pm on Jun 28, 2023 (gmt 0)



These 937k URLs were old webpages that I've been removing from index, after consolidating the content


What kind of consolidation did you do? The "crawled - currently not indexed" category implies that Google discovered it and sees little value in indexing your page for relevant queries. Perhaps your consolidation efforts made your content unattractive to Google as opposed to your old set up?

As far as I understand, on May 2 Google definitely penalized my website because Google understands that I have no quality content, is it right?


I don't know about penalizing; I think it may just not like your consolidated pages...are you noticing that those pages are the ones showing up under "crawled - currently not indexed"?

Nutterum

9:40 am on Jul 4, 2023 (gmt 0)

10+ Year Member Top Contributors Of The Month



Google does remove and add pages based on search seasonality. Are your topics follow under these? If yes, then the change is not as dramatic. If they are "evergreen" or not part of any seasonal shifts, it may be that some tangent topics are and thus these took precedent, due to content freshness, again niche seasonality , etc. In addition, products or info pages that have not received any traffic for a while usually get deprecated after some time. The shift is not so dramatic as to warrent any concern. But if it continues to drop with same pase, you need to check your technical and on page SEO. There might be some island pages, slow server times due to ads blocking content display, image problems, etc.

tangor

5:16 am on Jul 5, 2023 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



How many other websites are using the same

a website created from a large database (600k records) of commercial products, a site that may be considered as 'thin content'.


It is quite possible g sees this as duplicate or redundant info and goes to the site with the best user experience...

digitalgangsta302

12:08 pm on Jul 6, 2023 (gmt 0)



I'm also experiencing the same Google keeps on piling up product pages under 'crawled not indexed' banner. I know product pages need quality content which I've already started doing. What else should I do to overcome the indexing crisis?