Forum Moderators: open
1) I copy and paste text from Microsoft Word documents I receive.
2) People in the communications department enter text in a Content Managment System.
Either way, I have problems with character entities (html special characters).
The problem with the first way (copy from Word and paste to Dreamweaver MX 2004) is that Dreamweaver uses named entities instead of the correct decimal entities. For exemple, Dreamweaver uses œ instead of œ for a small oe ligature. Named entities do not display correctly in NN4x but decimal entities do. Any suggestions on how to avoid that?
What is the best way to ensure that text content will always use decimal entities wether it is entered by hand from Word or in a CMS?
If you search for 'html ascii converter' (sans quotes) in any search engine, you should find some software that you can use to pre-process the Word/CMS text.
If you know PHP, I would also look into the functions:
htmlspecialchars()
htmlentities()
get_html_translation_table()
and
strtr()