delorie.com/archives/browse.cgi   search  
Mail Archives: cygwin/2010/01/27/01:28:41

X-Recipient: archive-cygwin AT delorie DOT com
X-SWARE-Spam-Status: No, hits=-2.0 required=5.0 tests=AWL,BAYES_00,SARE_MSGID_LONG40,SPF_PASS
X-Spam-Check-By: sourceware.org
MIME-Version: 1.0
In-Reply-To: <s1sy6jk1407.fsf@jaist.ac.jp>
References: <416096c61001230751m308ac854x4f026b1f83b966d0 AT mail DOT gmail DOT com> <20100124093750 DOT GA2402 AT calimero DOT vinschen DOT de> <416096c61001240217l130c3e05ob5df918fd822be2d AT mail DOT gmail DOT com> <20100124120508 DOT GD2402 AT calimero DOT vinschen DOT de> <20100124123212 DOT GE2402 AT calimero DOT vinschen DOT de> <e22ab97b1001240800q1414e1a1qd10ea1669b37e300 AT mail DOT gmail DOT com> <20100124162358 DOT GG2402 AT calimero DOT vinschen DOT de> <s1s3a1slt71 DOT fsf AT jaist DOT ac DOT jp> <20100126170615 DOT GV2402 AT calimero DOT vinschen DOT de> <s1sy6jk1407 DOT fsf AT jaist DOT ac DOT jp>
Date: Wed, 27 Jan 2010 06:28:23 +0000
Message-ID: <416096c61001262228m2e9f8604xed00e5c91d8006f2@mail.gmail.com>
Subject: Re: Please support CP932. (I have problem using subversion with SJIS)
From: Andy Koppe <andy DOT koppe AT gmail DOT com>
To: cygwin AT cygwin DOT com
X-IsSubscribed: yes
Mailing-List: contact cygwin-help AT cygwin DOT com; run by ezmlm
List-Id: <cygwin.cygwin.com>
List-Unsubscribe: <mailto:cygwin-unsubscribe-archive-cygwin=delorie DOT com AT cygwin DOT com>
List-Subscribe: <mailto:cygwin-subscribe AT cygwin DOT com>
List-Archive: <http://sourceware.org/ml/cygwin/>
List-Post: <mailto:cygwin AT cygwin DOT com>
List-Help: <mailto:cygwin-help AT cygwin DOT com>, <http://sourceware.org/ml/#faqs>
Sender: cygwin-owner AT cygwin DOT com
Mail-Followup-To: cygwin AT cygwin DOT com
Delivered-To: mailing list cygwin AT cygwin DOT com

2010/1/27 Kazuhiro Fujieda:
> Andy said `Seems SJIS really isn't suited for Unix command line
> use.' I said there is no problem, no difference with EUC-JP, and
> no need to change the default.

That comment primarily referred to standard SJIS, with its mappings of
the ASCII backslash and tilde codepoints to yen and overscore. This is
addressed by using MS's CP932 version instead.

Another issue is SJIS's use of bytes in the ASCII range as trailing
byte, which is bound to confuse programs that have been written
without considering that. Yes, any such programs should be considered
buggy, but they won't fix themselves.

But that's beside the point. The change is not due to SJIS's
shortcomings, but for the purpose of Unix/Linux compatibility. There's
no particular requirement on the Windows side that "ja_JP" should mean
SJIS, but there are Unix programs that assume that "ja_JP" means
eucJP.

Also, please remember that Cygwin's default locale is "C.UTF-8", i.e.
Japanese users (like everyone else) will be using UTF-8 by default,
not eucJP. The change only concerns users who explicitly set the
locale to "ja_JP". They will need to set it to "ja_JP.SJIS" instead to
stick with SJIS.


> The default character encoding in the Japanese locale has become
> UTF-8 in other POSIX systems. In most of all Linux distributions,
> the default is UTF-8. In OpenSolaris, the default is also UTF-8.

Are you saying that plain "ja_JP" without an explicit charset implies
UTF-8 there? Or are they setting the locale to "ja_JP.UTF-8"?


> I believe there is no need to change the default in Cygwin to EUC-JP.

Just to stress this again: Cygwin 1.7's default is and remains UTF-8.

Andy

--
Problem reports:       http://cygwin.com/problems.html
FAQ:                   http://cygwin.com/faq/
Documentation:         http://cygwin.com/docs.html
Unsubscribe info:      http://cygwin.com/ml/#unsubscribe-simple

- Raw text -


  webmaster     delorie software   privacy  
  Copyright © 2019   by DJ Delorie     Updated Jul 2019