delorie.com/archives/browse.cgi   search  
Mail Archives: cygwin/2009/06/06/03:24:43

X-Recipient: archive-cygwin AT delorie DOT com
X-SWARE-Spam-Status: No, hits=-1.9 required=5.0 tests=AWL,BAYES_00,SARE_MSGID_LONG40,SPF_PASS
X-Spam-Check-By: sourceware.org
MIME-Version: 1.0
In-Reply-To: <200906051625.n55GP6t3028411@mail.bln1.bf.nsn-intra.net>
References: <20090512165404 DOT GW21324 AT calimero DOT vinschen DOT de> <416096c60905120956n5521929bm69586f5e6325a994 AT mail DOT gmail DOT com> <20090512173153 DOT GY21324 AT calimero DOT vinschen DOT de> <3f0ad08d0905140858j17c7b374paa649f18ef18178d AT mail DOT gmail DOT com> <200905201652 DOT n4KGqYGm000509 AT mail DOT bln1 DOT bf DOT nsn-intra DOT net> <200906051625 DOT n55GP6t3028411 AT mail DOT bln1 DOT bf DOT nsn-intra DOT net>
Date: Sat, 6 Jun 2009 08:24:20 +0100
Message-ID: <416096c60906060024k7e10d469i7ac670be0b8e989b@mail.gmail.com>
Subject: Re: [Fwd: [1.7] wcwidth failing configure tests]
From: Andy Koppe <andy DOT koppe AT gmail DOT com>
To: cygwin AT cygwin DOT com
Cc: newlib AT sourceware DOT org
X-IsSubscribed: yes
Mailing-List: contact cygwin-help AT cygwin DOT com; run by ezmlm
List-Id: <cygwin.cygwin.com>
List-Unsubscribe: <mailto:cygwin-unsubscribe-archive-cygwin=delorie DOT com AT cygwin DOT com>
List-Subscribe: <mailto:cygwin-subscribe AT cygwin DOT com>
List-Archive: <http://sourceware.org/ml/cygwin/>
List-Post: <mailto:cygwin AT cygwin DOT com>
List-Help: <mailto:cygwin-help AT cygwin DOT com>, <http://sourceware.org/ml/#faqs>
Sender: cygwin-owner AT cygwin DOT com
Mail-Followup-To: cygwin AT cygwin DOT com
Delivered-To: mailing list cygwin AT cygwin DOT com

2009/6/5 Thomas Wolff:
> the locale syntax allows for an optional "modifier" which can be used to
> specify deviations, e.g.
> =C2=A0 =C2=A0 =C2=A0 =C2=A0de_DE =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 has c=
harmap ISO-8859-1
> =C2=A0 =C2=A0 =C2=A0 =C2=A0de_DE AT euro =C2=A0 =C2=A0 =C2=A0has charmap ISO=
-8859-15
> =C2=A0 =C2=A0 =C2=A0 =C2=A0uz_UZ =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 has c=
harmap ISO-8859-1
> =C2=A0 =C2=A0 =C2=A0 =C2=A0uz_UZ AT cyrillic =C2=A0has charmap UTF-8
> =C2=A0 =C2=A0 =C2=A0 =C2=A0aa_ER and aa_ER AT saaho both have charmap UTF-8 =
(with some other difference).
> Thus you could define e.g.
> =C2=A0 =C2=A0 =C2=A0 =C2=A0ja_JP DOT UTF-8 AT cjk
> or
> =C2=A0 =C2=A0 =C2=A0 =C2=A0ja_JP DOT UTF-8 AT cjkwidth
> to indicate CJK width properties. I guess this is the most compliant way =
to go.

This looks the right approach to me.

However, to make the locale setting more convenient for CJK users,
there could be modifiers for both widths. Without modifier, the CJK
locales would default to "Ambiguous Wide", while everything else would
default to "Ambiguous Narrow".

In the time-honoured tradition of keeping Unix identifiers brief and
obscure, I propose the modifiers should be "@aw" and "@an". Otherwise,
how about "@ambigwide" and "@ambignarrow"?

Calling it something like "cjkwide" has the problem that it gives the
impression that the actual CJK ideographs are affected by this,
whereas this really concerns things like line drawing characters and
non-latin non-CJK letters. That confused me to start with anyway.

Puzzled that this hasn't been solved in glibc years ago ...

Andy

--
Unsubscribe info:      http://cygwin.com/ml/#unsubscribe-simple
Problem reports:       http://cygwin.com/problems.html
Documentation:         http://cygwin.com/docs.html
FAQ:                   http://cygwin.com/faq/

- Raw text -


  webmaster     delorie software   privacy  
  Copyright © 2019   by DJ Delorie     Updated Jul 2019