X-Recipient: archive-cygwin AT delorie DOT com X-SWARE-Spam-Status: No, hits=-1.9 required=5.0 tests=AWL,BAYES_00,SARE_MSGID_LONG40,SPF_PASS X-Spam-Check-By: sourceware.org MIME-Version: 1.0 In-Reply-To: <200906051625.n55GP6t3028411@mail.bln1.bf.nsn-intra.net> References: <20090512165404 DOT GW21324 AT calimero DOT vinschen DOT de> <416096c60905120956n5521929bm69586f5e6325a994 AT mail DOT gmail DOT com> <20090512173153 DOT GY21324 AT calimero DOT vinschen DOT de> <3f0ad08d0905140858j17c7b374paa649f18ef18178d AT mail DOT gmail DOT com> <200905201652 DOT n4KGqYGm000509 AT mail DOT bln1 DOT bf DOT nsn-intra DOT net> <200906051625 DOT n55GP6t3028411 AT mail DOT bln1 DOT bf DOT nsn-intra DOT net> Date: Sat, 6 Jun 2009 08:24:20 +0100 Message-ID: <416096c60906060024k7e10d469i7ac670be0b8e989b@mail.gmail.com> Subject: Re: [Fwd: [1.7] wcwidth failing configure tests] From: Andy Koppe To: cygwin AT cygwin DOT com Cc: newlib AT sourceware DOT org Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable X-IsSubscribed: yes Mailing-List: contact cygwin-help AT cygwin DOT com; run by ezmlm Precedence: bulk List-Id: List-Unsubscribe: List-Subscribe: List-Archive: List-Post: List-Help: , Sender: cygwin-owner AT cygwin DOT com Mail-Followup-To: cygwin AT cygwin DOT com Delivered-To: mailing list cygwin AT cygwin DOT com 2009/6/5 Thomas Wolff: > the locale syntax allows for an optional "modifier" which can be used to > specify deviations, e.g. > =C2=A0 =C2=A0 =C2=A0 =C2=A0de_DE =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 has c= harmap ISO-8859-1 > =C2=A0 =C2=A0 =C2=A0 =C2=A0de_DE AT euro =C2=A0 =C2=A0 =C2=A0has charmap ISO= -8859-15 > =C2=A0 =C2=A0 =C2=A0 =C2=A0uz_UZ =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 has c= harmap ISO-8859-1 > =C2=A0 =C2=A0 =C2=A0 =C2=A0uz_UZ AT cyrillic =C2=A0has charmap UTF-8 > =C2=A0 =C2=A0 =C2=A0 =C2=A0aa_ER and aa_ER AT saaho both have charmap UTF-8 = (with some other difference). > Thus you could define e.g. > =C2=A0 =C2=A0 =C2=A0 =C2=A0ja_JP DOT UTF-8 AT cjk > or > =C2=A0 =C2=A0 =C2=A0 =C2=A0ja_JP DOT UTF-8 AT cjkwidth > to indicate CJK width properties. I guess this is the most compliant way = to go. This looks the right approach to me. However, to make the locale setting more convenient for CJK users, there could be modifiers for both widths. Without modifier, the CJK locales would default to "Ambiguous Wide", while everything else would default to "Ambiguous Narrow". In the time-honoured tradition of keeping Unix identifiers brief and obscure, I propose the modifiers should be "@aw" and "@an". Otherwise, how about "@ambigwide" and "@ambignarrow"? Calling it something like "cjkwide" has the problem that it gives the impression that the actual CJK ideographs are affected by this, whereas this really concerns things like line drawing characters and non-latin non-CJK letters. That confused me to start with anyway. Puzzled that this hasn't been solved in glibc years ago ... Andy -- Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple Problem reports: http://cygwin.com/problems.html Documentation: http://cygwin.com/docs.html FAQ: http://cygwin.com/faq/