homepage Welcome to WebmasterWorld Guest from 54.227.56.174
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Pubcon Platinum Sponsor 2014
Home / Forums Index / Code, Content, and Presentation / XML Development
Forum Library, Charter, Moderators: httpwebwitch

XML Development Forum

    
Non-unicode characters in XML syntax
httpwebwitch




msg:3773404
 2:33 pm on Oct 25, 2008 (gmt 0)

A well-intentioned attempt to make XML less exclusive to certain ethic groups actually risks causing breakage for those it's intended to help.

XML co-inventor Tim Bray and others have raised a last-minute objection to the planned XML Fifth Edition working its way through the World Wide Web Consortium (W3C). They say it could make it harder to program with or parse some legacy XML documents.

"programmers writing in scripts such as Amharic or Cherokee, which have been added since then [1998, when XML 1.0 was created], can't use their characters in tag or attribute names."

source [theregister.co.uk]

also see Tim Bray's reaction [tbray.org]

 

httpwebwitch




msg:3773574
 1:33 am on Oct 26, 2008 (gmt 0)

the point here is that Unicode is constantly growing (like, the more recent addition of characters used to write Cherokee and Amharic), but the character set allowed in tag, entity and attribute names in XML does not. XML5 plans to remedy that by bringing the XML spec in gear with Unicode. However, as Tim points out:
the change introduces an inconsistency between XML 1.0 and XML Namespaces 1.0, which is intolerable. They have to be either revised together or not at all.
source [tbray.org]
coopster




msg:3774152
 12:29 pm on Oct 27, 2008 (gmt 0)

Some interesting responses in there in regards to the whitespace characters, especially from mainframe/midrange programmers. The comments seem a tad off topic to me though, unless I am missing the connection?

httpwebwitch




msg:3774188
 1:33 pm on Oct 27, 2008 (gmt 0)

yeah I agree, the comments do go a little off the rails.

That whitespace IBM episode seems like a sore topic among XML RFCgazers. Keep in mind the type of personality who pays attention to all the granular details of XML specs - that's the same personality to whom those things would matter, a LOT.

Global Options:
 top home search open messages active posts  
 

Home / Forums Index / Code, Content, and Presentation / XML Development
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved