delorie.com/archives/browse.cgi   search  
Mail Archives: cygwin/2010/01/29/16:11:44

X-Recipient: archive-cygwin AT delorie DOT com
X-Spam-Check-By: sourceware.org
Date: Fri, 29 Jan 2010 22:11:25 +0100
From: Corinna Vinschen <corinna-cygwin AT cygwin DOT com>
To: cygwin AT cygwin DOT com
Subject: Re: Japanese/Chinese language question
Message-ID: <20100129211125.GC28659@calimero.vinschen.de>
Reply-To: cygwin AT cygwin DOT com
Mail-Followup-To: cygwin AT cygwin DOT com
References: <20100121134055 DOT GE2402 AT calimero DOT vinschen DOT de> <uiqak1y1z DOT fsf AT acm DOT org>
MIME-Version: 1.0
In-Reply-To: <uiqak1y1z.fsf@acm.org>
User-Agent: Mutt/1.5.20 (2009-06-14)
Mailing-List: contact cygwin-help AT cygwin DOT com; run by ezmlm
List-Id: <cygwin.cygwin.com>
List-Unsubscribe: <mailto:cygwin-unsubscribe-archive-cygwin=delorie DOT com AT cygwin DOT com>
List-Subscribe: <mailto:cygwin-subscribe AT cygwin DOT com>
List-Archive: <http://sourceware.org/ml/cygwin/>
List-Post: <mailto:cygwin AT cygwin DOT com>
List-Help: <mailto:cygwin-help AT cygwin DOT com>, <http://sourceware.org/ml/#faqs>
Sender: cygwin-owner AT cygwin DOT com
Mail-Followup-To: cygwin AT cygwin DOT com
Delivered-To: mailing list cygwin AT cygwin DOT com

On Jan 30 05:00, Kazuhiro Fujieda wrote:
> >>> On Thu, 21 Jan 2010 14:40:55 +0100
> >>> Corinna Vinschen said:
> 
> > When comparing strings linguistically (strcoll/wcscoll),
> >
> > - are Hiragana and Katakana forms of the same character to be
> >   treated as equal or as different?
> 
> They should be treated as different.
> 
> > - are half-width and full-width forms of the same CJK character
> >   treated as equal or as different?
> 
> Different, too.
> 
> It is difficult to implement the collation algorithm from
> scratch. I recommend to use LCMapString to generate sort keys.

Yes, that's how I implemented it.  strcoll/wcscoll are using
CompareStringW, strxfrm/wcsxfrm are using LCMapStringW.

I was just asking because I wasn't sure if I had to use the
NORM_IGNOREKANATYPE and NORM_IGNOREWIDTH flags or not.  But another look
into the definition of strcoll/strxfrm answered the question eventually.  


Thanks,
Corinna

-- 
Corinna Vinschen                  Please, send mails regarding Cygwin to
Cygwin Project Co-Leader          cygwin AT cygwin DOT com
Red Hat

--
Problem reports:       http://cygwin.com/problems.html
FAQ:                   http://cygwin.com/faq/
Documentation:         http://cygwin.com/docs.html
Unsubscribe info:      http://cygwin.com/ml/#unsubscribe-simple

- Raw text -


  webmaster     delorie software   privacy  
  Copyright © 2019   by DJ Delorie     Updated Jul 2019