homepage Welcome to WebmasterWorld Guest from 54.211.219.178
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Become a Pro Member
Home / Forums Index / Hardware and OS Related Technologies / Website Technology Issues
Forum Library, Charter, Moderators: phranque

Website Technology Issues Forum

    
Can charset choice affect download time
An observation? A quirk?
Marshall




msg:658528
 9:13 pm on Feb 25, 2002 (gmt 0)

On most of my sites I use HTML 4.01 transitional with a charset of iso-8859-1. There was one site that was using charset of windows-1252 (by editor program default) with the 4.01 transitional.

Last night I changed the charset on that site to 8859-1 and it seems the pages download faster. Is this possible or just a quirk.

 

brotherhood of LAN




msg:658529
 10:20 pm on Feb 25, 2002 (gmt 0)

Just a side note, does displying the character set in the head even matter?

I dont include it in my head of pages and they all view fine.

bird




msg:658530
 11:09 pm on Feb 25, 2002 (gmt 0)

does displying the character set in the head even matter?

Historically, the default for HTML was iso-8859-1, so that's what you're supposed to use when you don't specify anything. I'm not sure about HTML 4.01, but I think a well formed and conforming XHTML page must contain a content-type/charset header (unless the information is provided in the http headers by the server).

Why does it matter? If you don't specify anything, but write your texts on Windows, then you're likely to end up with special characters that are encoded in the wrong character set. The most common example are double quotes encoded in Windows-1252, which will look strangely in a standards conforming browser that tries to display the page as iso-8859-1 (question marks on Linux, empty boxes on a Mac).

As you might guess by now, IE will display pages very nicely that are incorrectly encoded in this way. You'd be surprised how many high profile sites don't get this right. Of course, once you start to use languages that include non-ascii characters, things get even funnier...;)

bill




msg:658531
 2:48 am on Feb 26, 2002 (gmt 0)

does displying the character set in the head even matter?
bird summed it up very nicely...I just wanted to add an amen. ;)

There are quite a few people out there on the net that don't have English set as their default language. Without a charset in the <head>, you'd be surprised what shows up. I surf a lot between Japanese, Chinese and English, and can't tell you what a pain it is to have to manually set the encoding for each page I visit on some sites. All this hassle could have been avoided if the person who made the page had simply included that charset info...

bird




msg:658532
 3:46 am on Feb 26, 2002 (gmt 0)

Coming back to the original question:
The download itself will ceratinly take the same time, as you didn't change the size of the document. If there is any apparent speed difference, then that's possibly due to different amounts of processing required.

I don't know if this is really the case, I'm just making an educated guess here. But since the original default is iso-8859-1, it could well be that the rendering engine in your browser is optimized for that. This would mean that a page encoded in a different character set would have to get munged internally into something more easily digestable for the rendering engine. This shouldn't take more than split seconds (if it really happens), but could still be noticeable under certain circumstances.

But then, small speed differences can be caused by any other thing that does or doesn't happen on your machine, especially under Windows. And human perception is very easy to fool as well. Things may look faster to you eg. when you're tired, because you aren't prepared to follow them in the same way as you would be after the third cup of coffee in the morning...;)

brotherhood of LAN




msg:658533
 4:11 am on Feb 26, 2002 (gmt 0)

I second you on that Bird

The modem or other means of net access will have to retrieve a set amount of data on a page, i.e. so many 1's and 0's

Its the browsers job to interpret it so I assume a difference in speed may come down to how many windows you have open etc, or if your browser has a certain "char set" more readily accessible, thus the faster speed. I guess these boring comp architecture classes I sit through give me an inkling of what REALLY goes on with computers :)

Son_House




msg:658534
 5:49 am on Feb 26, 2002 (gmt 0)

I noticed when using Netscape 4.73 and the page used charset=UTF-8 all I would get was little square box's and no readable text until the whole page was downloaded. After the whole page was downloaded, Netscape would quickly reload the page and it would be readable. IE 5 and 6 did not have that problem for me.

Marshall




msg:658535
 7:07 am on Feb 26, 2002 (gmt 0)

bird,

I agree about the perception bit, but this was notably faster. The site is an entertainment industry directory I operate and check frequently. If fact, when I noticed it after I changed the charset, the modem transfer speed displayed on my computer was about 3000 bytes slower than usually. That's another reason I thought the charset made a difference.

Global Options:
 top home search open messages active posts  
 

Home / Forums Index / Hardware and OS Related Technologies / Website Technology Issues
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved