Forum Moderators: open
I have read that the problem derives from the fact that html originally was invented by an English speaking person.
That fact does not explain why something better was not created when other languages began to use html. What are the reasons? Technical limitations? Political decisions?
And is there any hope of a better, richer and more usefull html?
[tbray.org...]
Sorry if I sounded rude and arrogant and indeed came close to accusing somebody of something. And sorry for making a rash statement based on ignorance.
Funny. Usually it's the other way around: Other people complaining about something missing. Me telling them that indeed it's there and the only problem is their own unability to see it.
I have some reading to do. I'll be back with more questions if there is still something that I do not understand.
If I'm not mistaken, ISO-8859-1 is a subset of UTF-8, so is there a reason why not to use UTF-8 instead?
For Latin-1, no... For other languages? Maybe...
Use UTF if you are positive you are actually using the true unicode characters. Otherwise, use the specific subset.
For example, a lot of Russian Web sites would look something like this when viewed as UTF-8:
Íàøè óñëóãè
But try viewing that using the windows-1251 charset :)
For example, a lot of Russian Web sites would look something like this when viewed as UTF-8:Íàøè óñëóãè
DrDoc, the only reason why you see characters like these (Íàøè óñëóãè) is that - apparently - you have not installed the Cyrilic character set (to display Russian language characters) on your system.
No matter which Russian character set these Russian Webmasters use (KOI8-R, ISO-8859-5, UTF-8), if you have installed the corresponding character set of that language, you will see the page correctly.
I recommend to use UTF-8 to everybody who has pages in many different languages, just to make things easy, otherwise you have to find out which character set is the most appropriate for every language...
We have pages in more than sixty (60) languages, and all pages display correctly thanks to UTF-8 encoding :-)