X-Recipient: archive-cygwin AT delorie DOT com X-SWARE-Spam-Status: No, hits=-2.2 required=5.0 tests=AWL,BAYES_00,SPF_HELO_PASS,SPF_PASS X-Spam-Check-By: sourceware.org To: cygwin AT cygwin DOT com From: Lapo Luchini Subject: Re: The C locale Date: Tue, 22 Sep 2009 18:26:32 +0200 Lines: 37 Message-ID: References: <416096c60908300959i1e0084b1xc8f6e65e792b035d AT mail DOT gmail DOT com> <3f0ad08d0909020656v7d9fce6ft4afea63ed363b9a9 AT mail DOT gmail DOT com> <416096c60909071308qc5ff057sbe9cb1dbc270554f AT mail DOT gmail DOT com> <20090908193456 DOT GC17515 AT calimero DOT vinschen DOT de> <416096c60909081449r1fe024dbm7b82a3719be05e9e AT mail DOT gmail DOT com> <20090921103758 DOT GE20981 AT calimero DOT vinschen DOT de> <416096c60909211420g4ac8ea93l80fc1f00dcd5c0f3 AT mail DOT gmail DOT com> <416096c60909212347r7e03a4f3q7d518ff7e8bce55d AT mail DOT gmail DOT com> <416096c60909220549jaa601d9l26621e9910136a3 AT mail DOT gmail DOT com> Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit User-Agent: Thunderbird 2.0.0.23 (X11/20090831) In-Reply-To: <416096c60909220549jaa601d9l26621e9910136a3@mail.gmail.com> OpenPGP: id=C8F252FB X-IsSubscribed: yes Mailing-List: contact cygwin-help AT cygwin DOT com; run by ezmlm Precedence: bulk List-Id: List-Unsubscribe: List-Subscribe: List-Archive: List-Post: List-Help: , Sender: cygwin-owner AT cygwin DOT com Mail-Followup-To: cygwin AT cygwin DOT com Delivered-To: mailing list cygwin AT cygwin DOT com Note-from-DJ: This may be spam Andy Koppe wrote: > You've presumably got mintty set to UTF-8, hence mintty's output > conversion turned ls's ISO-8859-1 "Ť" (i.e. "\xC3\xA4") into "ä". There never was any ISO-8859-1 "Ť" in the first place, only one a-umlaut entered in WindowsExplorer (in the expected way) and correctly interpreted by a UTF8-capable terminal which is doing his job. Nobody ever intended to write a Latin1 string with the meaning of "A-ring + currency symbol" which has been translated by chance in a a-umlaut... >> you mean that a script sees it as 62C3A468 as opposed as 62E468? >> Or that actual "bŤh" is shown somewhere? > > Both. For the latter, try it in the default Cygwin console, without > any locale variables set. OK, if you consider "what is shown in cmd.exe" as "the real stuff" then I agree with you. But cmd.exe isn't even capable of printing the Euro sign (no cygwin involved, I mean the plain Windows Prompt), I guess there's no hope to ever seeing in there anything but a very limited output... (which surprises me a bit: Euro sign is present in CP1252) I agree with you that the "default console" installed by the default installation SHOULD be able to show the more common accents at the very least (àèéìòù in Italy, umaluts and ß in Germany and so on,), but wouldn't it be possible to offer the user *something better* than plain limited cmd.exe, in the default installation? -- Lapo Luchini - http://lapo.it/ “There is no reason anyone would want a computer in their home.” (Ken Olson, founder of DEC, 1977) -- Problem reports: http://cygwin.com/problems.html FAQ: http://cygwin.com/faq/ Documentation: http://cygwin.com/docs.html Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple