X-Recipient: archive-cygwin AT delorie DOT com X-SWARE-Spam-Status: No, hits=-2.0 required=5.0 tests=AWL,BAYES_00,SARE_MSGID_LONG40,SPF_PASS X-Spam-Check-By: sourceware.org MIME-Version: 1.0 In-Reply-To: References: <416096c61001230751m308ac854x4f026b1f83b966d0 AT mail DOT gmail DOT com> <20100124093750 DOT GA2402 AT calimero DOT vinschen DOT de> <416096c61001240217l130c3e05ob5df918fd822be2d AT mail DOT gmail DOT com> <20100124120508 DOT GD2402 AT calimero DOT vinschen DOT de> <20100124123212 DOT GE2402 AT calimero DOT vinschen DOT de> <20100124162358 DOT GG2402 AT calimero DOT vinschen DOT de> <20100126170615 DOT GV2402 AT calimero DOT vinschen DOT de> Date: Wed, 27 Jan 2010 06:28:23 +0000 Message-ID: <416096c61001262228m2e9f8604xed00e5c91d8006f2@mail.gmail.com> Subject: Re: Please support CP932. (I have problem using subversion with SJIS) From: Andy Koppe To: cygwin AT cygwin DOT com Content-Type: text/plain; charset=UTF-8 X-IsSubscribed: yes Mailing-List: contact cygwin-help AT cygwin DOT com; run by ezmlm Precedence: bulk List-Id: List-Unsubscribe: List-Subscribe: List-Archive: List-Post: List-Help: , Sender: cygwin-owner AT cygwin DOT com Mail-Followup-To: cygwin AT cygwin DOT com Delivered-To: mailing list cygwin AT cygwin DOT com 2010/1/27 Kazuhiro Fujieda: > Andy said `Seems SJIS really isn't suited for Unix command line > use.' I said there is no problem, no difference with EUC-JP, and > no need to change the default. That comment primarily referred to standard SJIS, with its mappings of the ASCII backslash and tilde codepoints to yen and overscore. This is addressed by using MS's CP932 version instead. Another issue is SJIS's use of bytes in the ASCII range as trailing byte, which is bound to confuse programs that have been written without considering that. Yes, any such programs should be considered buggy, but they won't fix themselves. But that's beside the point. The change is not due to SJIS's shortcomings, but for the purpose of Unix/Linux compatibility. There's no particular requirement on the Windows side that "ja_JP" should mean SJIS, but there are Unix programs that assume that "ja_JP" means eucJP. Also, please remember that Cygwin's default locale is "C.UTF-8", i.e. Japanese users (like everyone else) will be using UTF-8 by default, not eucJP. The change only concerns users who explicitly set the locale to "ja_JP". They will need to set it to "ja_JP.SJIS" instead to stick with SJIS. > The default character encoding in the Japanese locale has become > UTF-8 in other POSIX systems. In most of all Linux distributions, > the default is UTF-8. In OpenSolaris, the default is also UTF-8. Are you saying that plain "ja_JP" without an explicit charset implies UTF-8 there? Or are they setting the locale to "ja_JP.UTF-8"? > I believe there is no need to change the default in Cygwin to EUC-JP. Just to stress this again: Cygwin 1.7's default is and remains UTF-8. Andy -- Problem reports: http://cygwin.com/problems.html FAQ: http://cygwin.com/faq/ Documentation: http://cygwin.com/docs.html Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple