Welcome to WebmasterWorld Guest from 126.96.36.199
Forum Moderators: mack
[edited by: Brett_Tabke at 11:14 am (utc) on Oct. 12, 2004]
[edit reason] split off from older thread [/edit]
I looked at one of my pages, and there is some minor munging of content. For example, the © symbol shows as a question mark. This is on a page which declares itself to be charset=iso-8859-1", using charcter code 169 for the copyright. I.e., I am not using one of those "ampersand umptisquat semi-colon" special HTML sequences. (I probably should, but I don't. ISO-8859-1 should be a sufficient declaration of context.)
... just checked the encoding of the cache'd copy. It's UTF-8, which I understand. ("... and one encoding to rule them all!") Their ISO-8859-1 to UTF-8 converter has a minor glitch. Checked another page using the footnote symbol ¹ (code 185). It displayed as a question mark also. At a guess any char greater than 127 will be mis-displayed for ISO-8859-1 source pages.
Wow, this is getting weirder. The ¹ (super-script 1) displays correctly on the SERP, which is also encoded as UTF-8. MS Search must be using two different encoding converters.
Having the cache date is nice, also having top and bottom next >> previous links, that's a good touch.
Somehow I doubt that MSN will be unveiling their version of the sandbox any time soon, Google will now have to stop doing that unless they really want to lose marketshare, it's getting exciting.
One BIG plus about the new MSN is that it reflects changes to sites very fast while the results of the current MSN are totally stale... they don't seem to change for months on end.
Looks like we'll have to wait another month for the real excitment...