Forum Moderators: Robert Charlton & goodroi

Message Too Old, No Replies

Potential Google Bug?

Unicode Error

         

juggyd

10:09 pm on Aug 29, 2007 (gmt 0)

10+ Year Member



This is really strange.

1. Type "site:cnn.com?" (with a question mark at the end) and you will get around 800,000+ hits.

2. Now type "site:cnn.com。". You will get about 2,000+ links, but every CACHE LINK is either outdated or inaccessible due to google adding a question mark at the end of the .com link.

Some hits even show up with this method that normally aren't available in the database. List may also may contain duplicates and/or other outdated links.

Strange..?

juggyd

1:19 am on Aug 30, 2007 (gmt 0)

10+ Year Member



Just to add to the previous post. When examining the 2,000+ hits, you can still access a different version of the google cache by removing the "%E3%80%82" after the dot com in your URL toolbar and pressing enter to retrieve that specific pages cache.

I've noticed different cached copies available for some of the resulting pages that are displayed. Try this on various sites to see different results. I guess this really does prove that there can be multiple/different cached copies of pages that exist out there (even if they are old and outdated!)

tedster

2:29 am on Aug 30, 2007 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Hello juggyd, and welcome to the forums.

Many have noted that Google saves historical records for a long time. Sometimes one of them creeps out into a regular search result - but even without that evidence, the patents Google has filed make it clear that tjhey are watching the historical records quite closely.

Nice find!

juggyd

3:11 am on Aug 30, 2007 (gmt 0)

10+ Year Member



That is a shame. I tried using the web removal tool to remove one of these "exclusive" pages, and it doesn't work because of the error with the? question mark after the dot com in the URL.

I'm basically seeing duplicate crawls of the exact same page, and, as you said, historical pages are being saved. If this is the case, I would assume this is wrong given Google's policy for storing cached pages.

I hope someone from Google is seeing this or reading this forum. Maybe it will help me in my quest for removing what really shouldn't be there.

tedster

5:05 am on Aug 30, 2007 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



As long as these peculiar urls aren't showing up in regular search results, there's no real problem I'd say.

juggyd

6:02 am on Aug 30, 2007 (gmt 0)

10+ Year Member



The thing is, they *ARE* showing up in regular results (Just recently this last month in fact).

juggyd

10:10 pm on Sep 5, 2007 (gmt 0)

10+ Year Member



Has anyone else encountered this issue with their sites? Google has yet to fix this problem.