delorie.com/archives/browse.cgi   search  
Mail Archives: cygwin/2013/07/22/16:41:10

X-Recipient: archive-cygwin AT delorie DOT com
DomainKey-Signature: a=rsa-sha1; c=nofws; d=sourceware.org; h=list-id
:list-unsubscribe:list-subscribe:list-archive:list-post
:list-help:sender:date:from:to:subject:message-id:reply-to
:references:mime-version:content-type:in-reply-to; q=dns; s=
default; b=ja+kTpgfcs5oryL+sFiaQsdplGFzYT64BidWCHp0lWXB1Owd1D4+m
JJuM9111fnpLI+drpMyC63XigQ5IiPT+p+hNC+D80w84yK+laq72v+BRUUFhq440
eFkwvc7nkm4FwoJS2KzAbUlxKPBnS+L+HVPS8/fgcST7rzMm3S5fpA=
DKIM-Signature: v=1; a=rsa-sha1; c=relaxed; d=sourceware.org; h=list-id
:list-unsubscribe:list-subscribe:list-archive:list-post
:list-help:sender:date:from:to:subject:message-id:reply-to
:references:mime-version:content-type:in-reply-to; s=default;
bh=+2+QgsrsKHLDFsT8u8LQBYCggrE=; b=uTDBpiy+HKoF1D8og0jeBOyMctBq
rfJXzIly0JMichBVIVyyAQwnEG0KLeYIi9CIKJHqXlR7GTPvrPP4zLsazVRT0TVu
G5pu1eq0cLpoZnXSH6N8N59bv/Eb+VYyumGkonevpLvFKKeBda3lSjbhy8OVD4/U
luGxgpnaEzDRHB0=
Mailing-List: contact cygwin-help AT cygwin DOT com; run by ezmlm
List-Id: <cygwin.cygwin.com>
List-Subscribe: <mailto:cygwin-subscribe AT cygwin DOT com>
List-Archive: <http://sourceware.org/ml/cygwin/>
List-Post: <mailto:cygwin AT cygwin DOT com>
List-Help: <mailto:cygwin-help AT cygwin DOT com>, <http://sourceware.org/ml/#faqs>
Sender: cygwin-owner AT cygwin DOT com
Mail-Followup-To: cygwin AT cygwin DOT com
Delivered-To: mailing list cygwin AT cygwin DOT com
X-Spam-SWARE-Status: No, score=0.1 required=5.0 tests=AWL,BAYES_50,RDNS_NONE,TW_EG autolearn=no version=3.3.1
Date: Mon, 22 Jul 2013 22:40:41 +0200
From: Corinna Vinschen <corinna-cygwin AT cygwin DOT com>
To: cygwin AT cygwin DOT com
Subject: Re: regex library fails git tests
Message-ID: <20130722204041.GA9473@calimero.vinschen.de>
Reply-To: cygwin AT cygwin DOT com
Mail-Followup-To: cygwin AT cygwin DOT com
References: <ksepor$cag$1 AT ger DOT gmane DOT org> <20130721193953 DOT GC2661 AT calimero DOT vinschen DOT de> <51ECA00D DOT 6030105 AT gmail DOT com> <20130722081200 DOT GE2661 AT calimero DOT vinschen DOT de> <51ED6933 DOT 4050400 AT redhat DOT com>
MIME-Version: 1.0
In-Reply-To: <51ED6933.4050400@redhat.com>
User-Agent: Mutt/1.5.21 (2010-09-15)

On Jul 22 11:17, Eric Blake wrote:
> On 07/22/2013 02:12 AM, Corinna Vinschen wrote:
> 
> >>> However, please note that this behaviour, while being provided by glibc
> >>> and now by Cygwin, is *not* standards-compliant.  In the narrow sense
> >>> the characters beyond 0x7f are still invalid ASCII chars, and other
> >>> functions working with wchar_t strings won't be as forgiving when using
> >>> invalid input.
> >>>
> 
> > After some sleep, I think I now understand why the glibc devs made
> > regcomp to work this way.  This behaviour is backward compatible to non
> > locale-aware applications.  In the "C" locale, a char is just some
> > arbitrary byte between 0 and 255.  So this pattern always worked before
> > in the "C locale, therefore it makes sense that it continues to work,
> > even if it won't when using other locales/codesets.
> 
> By the way, there is currently a big debate going on in the Austin Group
> (the people responsible for POSIX) on whether the "C" locale must be
> 8-bit clean (the way glibc behaves) or whether it was intended to allow
> UTF-8 encoding by default (the way musl libc wants to behave); and
> resolution of the debate will require input from the C standards
> committee.  There may be some interesting fallout, no matter which
> solution is finally reached.  http://austingroupbugs.net/view.php?id=663

Thanks for letting us know.  This really may get interesting...


Corinna

-- 
Corinna Vinschen                  Please, send mails regarding Cygwin to
Cygwin Maintainer                 cygwin AT cygwin DOT com
Red Hat

--
Problem reports:       http://cygwin.com/problems.html
FAQ:                   http://cygwin.com/faq/
Documentation:         http://cygwin.com/docs.html
Unsubscribe info:      http://cygwin.com/ml/#unsubscribe-simple

- Raw text -


  webmaster     delorie software   privacy  
  Copyright © 2019   by DJ Delorie     Updated Jul 2019