homepage Welcome to WebmasterWorld Guest from 54.161.147.106
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Become a Pro Member
Home / Forums Index / WebmasterWorld / New To Web Development
Forum Library, Charter, Moderators: brotherhood of lan & mack

New To Web Development Forum

    
Question on accents in French words
cinnamongirl




msg:961017
 4:51 pm on Jul 26, 2002 (gmt 0)

Hello, I'm the web developer for the French Department at a university. My question concerns the accents in many French words. For example, I can write the code as either "franšais" (using my keyboard shortcut), or as follows "fran&ccedile;aise." Is there any reason I should use one form over the other? Can this cause problems with display in older browsers? Thanks!

:)

 

dcheney




msg:961018
 4:58 pm on Jul 26, 2002 (gmt 0)

Personally I use the "franšais" form for a few reasons. First, search engine can't mess up searching for it, fewer characters in the html (every little bit helps :), and as long as Content-Type is set correctly and Unicode encoding is used, it seems to work in more browsers than the long form - especially when you get into less common letters.

P.S. Welcome to WMW

mavherick




msg:961019
 5:06 pm on Jul 26, 2002 (gmt 0)

As a good practice, always use français.

It can become a size issue with large documents, I'll give you that, but it's safer for display purposes that way.

I've never had any display problems using the HTML entities in Netscape 4.X and IE 5 and above as well as Opera and others. It's a recommendation on the Government of Canada and they do testing in text to speech browser, braille ect. so I'm pretty positive that it's the safest way to go. Anybody disagree?

mavherick

dcheney




msg:961020
 5:13 pm on Jul 26, 2002 (gmt 0)

mavherick,
From my experience a lot of the official codes just aren't recognized by browsers (especially 4.x). I don't think it will be an issue for French. But some of the less common ones just aren't supported. (I'll try to get a group of examples together - but it won't be until after lunch!)

mavherick




msg:961021
 5:13 pm on Jul 26, 2002 (gmt 0)

Could you elaborate a little bit on the search engine porblem you mentionned dcheney? You're getting me worried here :( although I never noticed any problem with my french sites.

<added>I'm too slow!! Thanks dcheney</added>

mavherick

Sinner_G




msg:961022
 5:23 pm on Jul 26, 2002 (gmt 0)

What about the unicode? Like &#231; for š? Does it work better than &ccedile;? And maybe the spiders could read it (wishful thinking I guess)?

dcheney




msg:961023
 5:54 pm on Jul 26, 2002 (gmt 0)

ok, all of the following are valid character entity references according to HTML 4.01 - see how many your browser can see (you should not see anything that looks like &amp;name; ). (This is a subset of the whole group, I didn't bother with various accents on each vowel, uppercase variants, and greek/math stuff.)

&ccedil;&atilde;&acirc;&agrave;&aacute;&auml;&aring;&iexcl;&cent;&pound;
&curren;&yen;&brvbar;&uml;&copy;&ordf;&laquo;&not;&shy;&reg;&macr;&deg;
&plusmn;&sup2;&sup3;&acute;&micro;&para;&cedil;&sup1;&ordm;&raquo;&middot;
&frac14;&frac12;&frac34;&iquest;&aelig;&eth;&ntilde;&thorn;&szlig;&divide;
&oslash;&yuml;&oelig;&scaron;&circ;&lsquo;&rsquo;&ldquo;&rdquo;&bdquo;
&dagger;&Dagger;&permil;

*** hmm, looks like WMW won't let that form be interpretted :(

mavherick




msg:961024
 6:02 pm on Jul 26, 2002 (gmt 0)

I see your point. Thanks for the info. Everything works fine in IE 5.0 but those entities doesn't in Netscape 4.78 (Windows 2000):

&oelig; &scaron; &circ; &lsquo; &rsquo; &ldquo; &rdquo; &bdquo; &dagger; &Dagger; &permil;

mavherick

cinnamongirl




msg:961025
 6:20 pm on Jul 26, 2002 (gmt 0)

Yes, all seems fine in IE, and those that Netscape has trouble with aren't any that I'll need to worry about for my site for the most part. Thanks for all the input and the welcome! :) Oh, and can anyone point me to some good threads on SEO for someone who knows absolutely nothing about the subject and needs to start from scratch??

bird




msg:961026
 6:52 pm on Jul 26, 2002 (gmt 0)

The most important thing is to make sure that your pages have a character set declared. The french accents are part of ISO-8859-1 (Latin-1), so that's an obvious choice. The following line in the head section of a HTML document does the trick:

<meta http-equiv="Content-Type" content="text/html; charset=ISO-8859-1" />

Once you have this (or an equivalent HTTP header sent by the server), you can use your accents just by writing them normally. However, you should make sure that your editing software doesn't use some Windows specific character set for those characters, they really need to be in Latin-1.

If your document is encoded with a different character set that doesn't include the cedille, then you need to use the &entity; spelling.

cinnamongirl




msg:961027
 8:53 pm on Jul 26, 2002 (gmt 0)

Bird, Thanks for the great tip. My editing software sets that charset by default (I beleive it's the same charset for English and French?) So it is good to know that it will cover me.

bird




msg:961028
 9:43 pm on Jul 26, 2002 (gmt 0)

ISO-8859-1 covers the special characters from all western european languages.

ergophobe




msg:961029
 1:55 am on Jul 27, 2002 (gmt 0)


The most important thing is to make sure that your pages have a character set declared

That's the key.

If you use the named entity for (apmersand, name, semicolon), you can be sure that the user agent will get it right if it understands the HTML version in which the named entity first appeared. This will work with or without a charset declaration.

If you use plain text with accents but don't have a charset declaration, it may or may not render correctly depending on whether or not the user is using the same character set as the one your file is in.

If you declare the charset and the browser is capable of reading that declaration, you'll always get it right. Don't mess up though - you might have one program that saves as UTF-8 and another (probably older program) saving as ISO-8859-1 and you'll get unpredictable results (that's generally true, but pretty much every charset you might use will render standard ASCII characters the same way).

There's a useful page on that gives you pretty much the entire character set in both charsets and some useful notes at Blooberry. It may not be the definitive reference, but it's the only one I could actually understand

[blooberry.com...]

Tom

Global Options:
 top home search open messages active posts  
 

Home / Forums Index / WebmasterWorld / New To Web Development
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved