delorie.com/archives/browse.cgi   search  
Mail Archives: cygwin/2015/10/28/23:59:18

X-Recipient: archive-cygwin AT delorie DOT com
DomainKey-Signature: a=rsa-sha1; c=nofws; d=sourceware.org; h=list-id
:list-unsubscribe:list-subscribe:list-archive:list-post
:list-help:sender:subject:to:references:from:message-id:date
:mime-version:in-reply-to:content-type; q=dns; s=default; b=tXlw
w/are6Z+07jPZ+H3b5S1TWhHJajnEREaCD0gditYjL8XpCl9cOcPL2KeiXcGFmRo
DsLOS0YuzwUpXlc6Pr9upN4V3MpDEyNTgXA2QarpRdEeuB2/EwtXChnuyha0/ebg
zNAllWtD/kRZYd/rMFdx9vohN9dHluqYZPaV8PQ=
DKIM-Signature: v=1; a=rsa-sha1; c=relaxed; d=sourceware.org; h=list-id
:list-unsubscribe:list-subscribe:list-archive:list-post
:list-help:sender:subject:to:references:from:message-id:date
:mime-version:in-reply-to:content-type; s=default; bh=1cB2l0TBwb
V5BKKcCJiEU0Pm52A=; b=IjktRw+r4MIBCt6wftr1+V3PD0utWdYIwWkaY7/jEM
jHPQ5Vuu91L+xEI/gKlWRiDWhEkfqlTMzs1H8pGwvSBP41WrucQQDTNecEpgvh+N
XGn6bhP/bplNNht00w4pBv2HjtfIDjFDoY2VAf9BgLjc9Lp+Gh+IWi4u4Zfz/6Nd
g=
Mailing-List: contact cygwin-help AT cygwin DOT com; run by ezmlm
List-Id: <cygwin.cygwin.com>
List-Subscribe: <mailto:cygwin-subscribe AT cygwin DOT com>
List-Archive: <http://sourceware.org/ml/cygwin/>
List-Post: <mailto:cygwin AT cygwin DOT com>
List-Help: <mailto:cygwin-help AT cygwin DOT com>, <http://sourceware.org/ml/#faqs>
Sender: cygwin-owner AT cygwin DOT com
Mail-Followup-To: cygwin AT cygwin DOT com
Delivered-To: mailing list cygwin AT cygwin DOT com
Authentication-Results: sourceware.org; auth=none
X-Virus-Found: No
X-Spam-SWARE-Status: No, score=-1.8 required=5.0 tests=AWL,BAYES_00,RP_MATCHES_RCVD,SPF_HELO_PASS autolearn=ham version=3.3.2
X-HELO: mx1.redhat.com
Subject: Re: Bug in collation functions?
To: cygwin AT cygwin DOT com
References: <563148AF DOT 1000502 AT cornell DOT edu>
From: Eric Blake <eblake AT redhat DOT com>
Openpgp: url=http://people.redhat.com/eblake/eblake.gpg
Message-ID: <5631996D.7040908@redhat.com>
Date: Wed, 28 Oct 2015 21:58:37 -0600
User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:38.0) Gecko/20100101 Thunderbird/38.3.0
MIME-Version: 1.0
In-Reply-To: <563148AF.1000502@cornell.edu>
X-IsSubscribed: yes

--QFNUOsHSj3ln9AcWVt6W29pOMmeJiuilg
Content-Type: text/plain; charset=utf-8
Content-Transfer-Encoding: quoted-printable

On 10/28/2015 04:14 PM, Ken Brown wrote:
> It's my understanding that collation is supposed to take whitespace and
> punctuation into account in the POSIX locale but not in other locales.

Not quite right. It is up to the locale definition whether whitespace
affects collation.  But you are correct that in the POSIX locale,
whitespace must not be ignored in collation.

> This doesn't seem to be the case on Cygwin.  Here's a test case using
> wcscoll, but the same problem occurs with strcoll.

That's because the locale definitions are different in cygwin than they
are in glibc.  But it is not a bug in Cygwin; POSIX allows for different
systems to have different locale definitions while still using the same
locale name like en_US.UTF-8.

--=20
Eric Blake   eblake redhat com    +1-919-301-3266
Libvirt virtualization library http://libvirt.org


--QFNUOsHSj3ln9AcWVt6W29pOMmeJiuilg
Content-Type: application/pgp-signature; name="signature.asc"
Content-Description: OpenPGP digital signature
Content-Disposition: attachment; filename="signature.asc"

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v2
Comment: Public key at http://people.redhat.com/eblake/eblake.gpg
Comment: Using GnuPG with Thunderbird - http://www.enigmail.net/

iQEcBAEBCAAGBQJWMZltAAoJEKeha0olJ0NqWFwIAKp1Uc8jEQxPsQPyJO/Icxhs
r5uh9DPX57/D5WkIprie6Bh2h0oztNj5+c8q5O8umD/mBbO26lsEHlviwOiq2jCy
kKua240Ns0wBEMcDOArginOJv4hK0JejkscaBRfcNFJRK2mgX3LJ3GJHxd/wlg/V
PCWZ1FBls9haTDAjyyaOWcguvIZlpVfXaP8sWpE9gZGJ/gxA6qC4yD9WCcWF5m8q
BCfmLF1kEg76cdSclaefJ80pJRmXmtLQVCRkQvTCYV/a3tIdpJM+AYjPtT2ZJMqQ
P5SIUTE8AQdTjgGEsyhwFThpZEJNXtRQn06Mp0YfbLUrbLoYCSGmS9Ew52VT2J8=
=UQ6z
-----END PGP SIGNATURE-----

--QFNUOsHSj3ln9AcWVt6W29pOMmeJiuilg--

- Raw text -


  webmaster     delorie software   privacy  
  Copyright © 2019   by DJ Delorie     Updated Jul 2019