lucy24 - 7:08 am on Jul 22, 2012 (gmt 0)
These always show incorrectly on our sites
You have to keep careful track of the file encoding. Either explicitly set it to be the same at every stage, or explicitly do a conversion. Word is a special case because I have to assume it uses Windows-Latin-1 encoding, which is bound to cause trouble when it runs into UTF-8 if you don't detour via a conversion. Or if the user's OS-plus-browser doesn't take care of it on its own. Can I assume that all your pages carry an explicit "charset" (misleading term!) declaration?
Something tells me your sites are all in English ;) I shudder to think what my html would look like if I were forced to use entities. Matter of fact, I see it in action in php/bb forums, at least in v.2. Everything displays correctly, but the edit window turns into illegible entities-- useless when the original question was about transliterating Greek and you have to find the typo. But even French and German come out pretty horrendous. "Trink, brüderlein, trink" just doesn't have the same oomph.
... and then you threw me by saying "your test site" after I'd heroically avoided linking to myself ::snrk:: (/fonts/font_input.html and /font_name.html are where I generally send people.)
My most ancient browser, MSIE 5.2.2, will only display characters if they have named HTML 4 entities. You don't have to use the entity-- decimal, hexadecimal and the real thing* will all work-- but there has to be an entity. That's an eleven-year-old browser, but even then it lagged far behind the norm. I think they just discovered Font Substitution the week before last.
I don't think these Forums convert entities into characters. They turn things like α-ω into entities, but they don't turn back.
* Except that you have to set the file encoding manually, because the browser is too stupid to read the "charset" declaration.