X-Recipient: archive-cygwin AT delorie DOT com X-SWARE-Spam-Status: No, hits=-1.9 required=5.0 tests=AWL,BAYES_00,SARE_MSGID_LONG40,SPF_PASS X-Spam-Check-By: sourceware.org MIME-Version: 1.0 In-Reply-To: <20090512165404.GW21324@calimero.vinschen.de> References: <20090512165404 DOT GW21324 AT calimero DOT vinschen DOT de> Date: Tue, 12 May 2009 17:56:47 +0100 Message-ID: <416096c60905120956n5521929bm69586f5e6325a994@mail.gmail.com> Subject: Re: [Fwd: [1.7] wcwidth failing configure tests] From: Andy Koppe To: newlib AT sourceware DOT org, cygwin AT cygwin DOT com Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable X-IsSubscribed: yes Mailing-List: contact cygwin-help AT cygwin DOT com; run by ezmlm Precedence: bulk List-Id: List-Unsubscribe: List-Subscribe: List-Archive: List-Post: List-Help: , Sender: cygwin-owner AT cygwin DOT com Mail-Followup-To: cygwin AT cygwin DOT com Delivered-To: mailing list cygwin AT cygwin DOT com > And here's another question. =C2=A0The utf8*.h files claim they have been > generated from the unicode.txt file of the Unicode 3.2 standard. =C2=A0Do= we > have the script which generated the utf8*.h files? =C2=A0Can we regenerate > the files to match the current Unicode 5.1 standard? There's Markus Kuhn's wcwidth implementation, which says it's based on Unicode 5.0: http://www.cl.cam.ac.uk/~mgk25/ucs/wcwidth.c Trouble is, there's the thorny issue of the "CJK Ambiguous Width" category of characters, which consists of things like Greek and Cyrillic letters as well as line drawing symbols. Those have a width of 1 in Western use, yet with CJK fonts they have a width of 2. That's why Markus Kuhn's code includes the mk_wcswidth_cjk() variant. Andy -- Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple Problem reports: http://cygwin.com/problems.html Documentation: http://cygwin.com/docs.html FAQ: http://cygwin.com/faq/