Welcome to WebmasterWorld Guest from

Forum Moderators: Robert Charlton & aakk9999 & andy langton & goodroi

Message Too Old, No Replies

Google changes specified charset in cached pages

Foreign characters are gibberish in the Google cache

1:19 am on Jan 3, 2007 (gmt 0)

New User

10+ Year Member

joined:Jan 28, 2004
votes: 0

I write a website that includes many foreign characters. On each page in the <head>, I specify the windows-1252 character set in a <meta> tag as follows:

<meta http-equiv="Content-Type" content="text/html; charset=windows-1252">

I recently noticed that my home page is not displaying properly in the Google cache. Foreign characters are transposed into gibberish. For example displays as d'. I checked the the source for the Google cache and found that Google had inserted a conflicting charset definition in a <meta> tag at the top of the page:

<meta http-equiv="Content-Type" content="text/html; charset=windows-1250">

Windows-1250 is a different foreign charset specifically for Eastern & Central European characters and differs significantly from windows-1252. Since Google's charset <meta> tag comes first, it is overriding my charset <meta> tag in the cache and causing misinterpretation of my foreign characters. The page is not turning up in some Google searches because words containing foreign characters are not cached correctly.

All other pages on the website that I have checked are cached correctly, and Google's <meta> tag properly calls for a windows-1252 charset.

Is there any way I can force my charset definition to override the conflicting and erroneous definition inserted by Google in some cached pages?

3:11 am on Jan 5, 2007 (gmt 0)

Senior Member from CA 

WebmasterWorld Senior Member encyclo is a WebmasterWorld Top Contributor of All Time 10+ Year Member

joined:Aug 31, 2003
votes: 4

Continued in duplicate thread: