Forum Moderators: Robert Charlton & goodroi
1. Type "site:cnn.com?" (with a question mark at the end) and you will get around 800,000+ hits.
2. Now type "site:cnn.com。". You will get about 2,000+ links, but every CACHE LINK is either outdated or inaccessible due to google adding a question mark at the end of the .com link.
Some hits even show up with this method that normally aren't available in the database. List may also may contain duplicates and/or other outdated links.
Strange..?
I've noticed different cached copies available for some of the resulting pages that are displayed. Try this on various sites to see different results. I guess this really does prove that there can be multiple/different cached copies of pages that exist out there (even if they are old and outdated!)
Many have noted that Google saves historical records for a long time. Sometimes one of them creeps out into a regular search result - but even without that evidence, the patents Google has filed make it clear that tjhey are watching the historical records quite closely.
Nice find!
I'm basically seeing duplicate crawls of the exact same page, and, as you said, historical pages are being saved. If this is the case, I would assume this is wrong given Google's policy for storing cached pages.
I hope someone from Google is seeing this or reading this forum. Maybe it will help me in my quest for removing what really shouldn't be there.