Forum Moderators: open
I thought that I had originally used JIS encoding (charset=iso-2022-jp) and thus used this encoding in the updated site. I have just changed it to Shift JIS (charset=Shift_JIS). Is this the best? Will it solve my problems?
<snip>
<Policy Note - I'll leave the link for a few days to help solve this problem, after which it will be removed as per the TOS. - Woz>
<Snipped Google search as per note above. - Woz>
[edited by: Woz at 12:13 am (utc) on Aug. 14, 2002]
charset=iso-2022-jp is causing the ??? problems with indexing in Google. I checked your site in google.co.jp using Japanese Win98 and Japanese IE 5.5 and the ????? are listed in the title.
> I have just changed it to Shift JIS (charset=Shift_JIS). Is this the best? Will it solve my problems?
charset=Shift_Jis or charset=x-sjis should solve the problem. It will still appear as ???? until the next update of Google's database.
I have always used charset=x-sjis with great success.
The ????? appear when viewed using my local Linux box and the same would probably apply with Google because they use Linux servers.
The following discussion has more info about Japanese text and compatability problems
[webmasterworld.com...]
I was posting around the same time and used the business name in Japanese as keywords and the ?????? appeared in the title.
Clicking on the link from your latest post the results were really weird. I checked the source code for your homepage and noticed that charset=iso-2022-jp is there. That is definately creating the problems becuse there is more than one character set in iso-2022-jp
Shift_JIS is best for overall results using Windows
JIS is very limited and not recommended
EUC is used on Unix/Linux boxes for indexing Japanese text in databases.
Brownie I checked your listings on a Japanese system and came up with pretty much the same conclusions as Gorufu. That iso-2022-jp character set you're using is not one of my favorites. (You should see what it does to Opera 6.x on an English OS...yeech) Like Gorufu I'm a big fan of x-sjis encoding for Japanese pages. It rarely gets mucked up in any of the major browsers.
A few other encoding suggestions for your HTML that aren't absolutely necessary, but won't hurt, are the following:
<html lang="ja"> <meta http-equiv="content-language" content="ja">
<meta name="language" content="ja"> [edited by: bill at 8:02 am (utc) on Nov. 25, 2004]