Forum Moderators: Robert Charlton & goodroi

Message Too Old, No Replies

Huge Crawled - currently not indexed

Why I am seeing Feed URLs?

         

rht999999

3:52 am on Nov 19, 2022 (gmt 0)

Top Contributors Of The Month



I have a personal blog with 1200+ pages published and indexed. I see a lot of Crawled - currently not indexed in Page Indexing which are not actual URLs but kind of feed URLs. So is it fine to have 700 urls in that section?



[edited by: not2easy at 12:48 pm (utc) on Nov 19, 2022]
[edit reason] specifics [/edit]

not2easy

1:05 pm on Nov 19, 2022 (gmt 0)

WebmasterWorld Administrator 10+ Year Member Top Contributors Of The Month



Not all URLs that Google crawls are being indexed and they seem to be getting more selective about indexing. They crawl everything that is not blocked so that means they choose selectively what to show in the indexed serps. You wouldn't probably want them to index your /feeds/ URLs so I wouldn't worry about it. You could use a X-Robots header in the /feeds/ directory to noindex those URLs without blocking the crawler.

Here are a few similar discussions to see how others handle it:
[webmasterworld.com...]
[webmasterworld.com...]
[webmasterworld.com...]

There are more, it seems to be common.

lucy24

5:30 pm on Nov 19, 2022 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



I see a lot of Crawled - currently not indexed in Page Indexing which are not actual URLs but kind of feed URLs. So is it fine to have 700 urls in that section?
It sounds as if G### is acting intelligently. “We are aware that this URL exists, and we are also aware that its content is identical to some other URL which we have chosen to index.”

Sgt_Kickaxe

2:54 pm on Nov 28, 2022 (gmt 0)



Out of curiosity I added /feed to the end of sites I know are wordpress just to see what would happen. Instead of an error page, or displaying the feed, my browser began downloading the feed of the pages as I tried to visit them.

Opening the downloaded file with a code editor shows a properly formated but empty feed page and no other page contents. I'd say something has changed with wordpress, or perhaps some/all? of the old methods of disabling feeds is no longer working. Maybe it's a browser change? I couldn't replicate on Chrome consistently but Firefox was download happy.

- example.com/feed
- example.com/article/feed
- example.com/article/comments/feed
- etc. All behaved the same and all display the wordpress version in the downloaded file.
- Missing a download prompt occasionaly.

I don't know if this is causing an increase in reported issues but I'd recommend adding /feed to the end of your pages to see how the site responds. Adjust accordingly.

I don't have the time, or a wordpress test site, to dig deeper this morning. Is anyone else seeing this?