X-Recipient: archive-cygwin AT delorie DOT com X-Spam-Check-By: sourceware.org Date: Tue, 12 May 2009 22:08:53 +0200 From: Corinna Vinschen To: cygwin AT cygwin DOT com Subject: Re: [1.7] Proposal: the filename encoding in C locale uses UTF-8 instead of SO/UTF-8 Message-ID: <20090512200853.GA20162@calimero.vinschen.de> Reply-To: cygwin AT cygwin DOT com Mail-Followup-To: cygwin AT cygwin DOT com References: <3f0ad08d0905121029j119c8a7ep41d3a261d8bea338 AT mail DOT gmail DOT com> <20090512173741 DOT GZ21324 AT calimero DOT vinschen DOT de> <20090512192253 DOT GB21324 AT calimero DOT vinschen DOT de> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.5.19 (2009-02-20) Mailing-List: contact cygwin-help AT cygwin DOT com; run by ezmlm Precedence: bulk List-Id: List-Unsubscribe: List-Subscribe: List-Archive: List-Post: List-Help: , Sender: cygwin-owner AT cygwin DOT com Mail-Followup-To: cygwin AT cygwin DOT com Delivered-To: mailing list cygwin AT cygwin DOT com On May 12 15:53, Mark J. Reed wrote: > On Tue, May 12, 2009 at 3:22 PM, Corinna Vinschen > > > > http://cygwin.com/1.7/cygwin-ug-net/using-specialnames.html#pathnames-unusual > > OK, got it. So Mr. Iwamuro's proposal is that Cygwin ignore the > locale setting, and just automatically convert the Windows UTF-16 > filenames to UTF-8 (and back) no matter what. No. Only if LANG=C. > That seems rife with possible confusion, though. If I have my codepage > set to ISO-2022 and paste in a filename, I expect it to be interpreted Cygwin 1.7 doesn't use the codepage. It uses what $LANG says. See http://cygwin.com/1.7/cygwin-ug-net/setup-locale.html > as ISO-2022, not as UTF-8 (which will probably fail with an invalid > encoding sequence). > > OTOH, the SO/UTF-8 hack would seem to bode ill for the portability of, > say, tar archives created under Cygwin. The filenames potentially look weird, but they are valid filenames. If anybody has a better idea how to workaround the problem of UTF-16 chars which don't translate into the current singlebyte or multibyte charset, feel free to suggest. Corinna -- Corinna Vinschen Please, send mails regarding Cygwin to Cygwin Project Co-Leader cygwin AT cygwin DOT com Red Hat -- Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple Problem reports: http://cygwin.com/problems.html Documentation: http://cygwin.com/docs.html FAQ: http://cygwin.com/faq/