Welcome to WebmasterWorld Guest from

Message Too Old, No Replies

Google changes specified charset in cached pages

Foreign characters are gibberish in the Google cache



1:19 am on Jan 3, 2007 (gmt 0)

10+ Year Member

I write a website that includes many foreign characters. On each page in the <head>, I specify the windows-1252 character set in a <meta> tag as follows:

<meta http-equiv="Content-Type" content="text/html; charset=windows-1252">

I recently noticed that my home page is not displaying properly in the Google cache. Foreign characters are transposed into gibberish. For example displays as d'. I checked the the source for the Google cache and found that Google had inserted a conflicting charset definition in a <meta> tag at the top of the page:

<meta http-equiv="Content-Type" content="text/html; charset=windows-1250">

Windows-1250 is a different foreign charset specifically for Eastern & Central European characters and differs significantly from windows-1252. Since Google's charset <meta> tag comes first, it is overriding my charset <meta> tag in the cache and causing misinterpretation of my foreign characters. The page is not turning up in some Google searches because words containing foreign characters are not cached correctly.

All other pages on the website that I have checked are cached correctly, and Google's <meta> tag properly calls for a windows-1252 charset.

Is there any way I can force my charset definition to override the conflicting and erroneous definition inserted by Google in some cached pages?


3:11 am on Jan 5, 2007 (gmt 0)

WebmasterWorld Senior Member encyclo is a WebmasterWorld Top Contributor of All Time 10+ Year Member

Continued in duplicate thread:

Featured Threads

Hot Threads This Week

Hot Threads This Month