delorie.com/archives/browse.cgi   search  
Mail Archives: cygwin/2006/03/28/20:44:00

X-Spam-Check-By: sourceware.org
Message-ID: <4429E652.60501@byu.net>
Date: Tue, 28 Mar 2006 18:43:46 -0700
From: Eric Blake <ebb9 AT byu DOT net>
User-Agent: Thunderbird 1.5 (Windows/20051201)
MIME-Version: 1.0
To: cygwin AT cygwin DOT com
Subject: Re: Locales with wrong umlauts
References: <loom DOT 20060326T135539-102 AT post DOT gmane DOT org> <Pine DOT GSO DOT 4 DOT 63 DOT 0603272344260 DOT 18642 AT access1 DOT cims DOT nyu DOT edu> <44294107 DOT 8050107 AT byu DOT net>
In-Reply-To: <44294107.8050107@byu.net>
X-IsSubscribed: yes
Mailing-List: contact cygwin-help AT cygwin DOT com; run by ezmlm
List-Subscribe: <mailto:cygwin-subscribe AT cygwin DOT com>
List-Archive: <http://sourceware.org/ml/cygwin/>
List-Post: <mailto:cygwin AT cygwin DOT com>
List-Help: <mailto:cygwin-help AT cygwin DOT com>, <http://sourceware.org/ml/#faqs>
Sender: cygwin-owner AT cygwin DOT com
Mail-Followup-To: cygwin AT cygwin DOT com
Delivered-To: mailing list cygwin AT cygwin DOT com

According to Eric Blake on 3/28/2006 6:58 AM:
> Interesting question; I don't recall ever seeing this topic asked before
> here or upstream, so I don't know where to search either.  I'll do some
> upstream asking, to see what might be improved.

As for whether upstream will agree to make the default quoting style look
at the environment variable QUOTING_STYLE instead of blindly going with
'locale', the jury is still out.

But I am currently playing with a hack in my copy of the implementation of
the 'locale' quoting style, where if the high bit is set (c >= 0x80), then
I treat the character as a literal printing char (under the assumption
that the current code page and font can distinctly render the 8-bit
character) rather than the current scenario of turning the character into
an octal escape.  It worked for my default Latin character set, at least.
I plan on releasing another test version of coreutils soon with this hack
in place, for comments; and if it works out, the hack will also be applied
to the next findutils and tar releases (since all three packages use the
quote() module of gnulib).  This hack should only be needed on newlib
platforms, where isprint() does not know about locales other than "C".

-- 
Life is short - so eat dessert first!

Eric Blake             ebb9 AT byu DOT net










--
Unsubscribe info:      http://cygwin.com/ml/#unsubscribe-simple
Problem reports:       http://cygwin.com/problems.html
Documentation:         http://cygwin.com/docs.html
FAQ:                   http://cygwin.com/faq/

- Raw text -


  webmaster     delorie software   privacy  
  Copyright © 2019   by DJ Delorie     Updated Jul 2019