Forum Moderators: Robert Charlton & goodroi
I don't know if it has anything to do with the fact that cloudfront sends a lot of 304 responses and the 'naked server' doesn'tIt might. Do your pages include dynamic content that would normally prevent the server from sending out a 304 response? And then if they're on a CDN it sends out “flat” pages, so a 304 becomes possible? “Dynamic” doesn’t necessarily mean that anything has actually changed. It just means anything built on the fly, like a php navigation header. Google ought to be able to tell the difference, and see whether anything has really changed, but maybe it can’t.
as many as 35 pages at onceYikes. Since they refuse to honor “Crawl-Delay” have you tried setting a rate in GSC? 35 at once really seems overkill, unless you've got the kind of site that gets thousands of human visitors every minute so the odd 35 wouldn't even be noticeable. (I'm guessing you don't, since you did notice ;))
Also note that I started noticing it only after I went behind Cloudfront for a couple of weeks and came back.
I do notice a lot more PDF links in search results compared to earlierSame here, now that you mention it. Are search engines getting better at reading pdf's, or are the pdf's themselves getting better? If the file is just a flat page image--which an older pdf may well be--do major search engines now run their own OCR, so there's text content to find in searches even if you, the human user, can't get your browser to find that same text?