delorie.com/archives/browse.cgi   search  
Mail Archives: cygwin/2023/07/31/17:13:39

X-Recipient: archive-cygwin AT delorie DOT com
DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org C4ACC3858CD1
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=cygwin.com;
s=default; t=1690838018;
bh=Mzj2DcDKhRr1M5ZycF1Tx11KP8OIfv+ae24sABy+Xb0=;
h=Date:Subject:To:References:In-Reply-To:List-Id:List-Unsubscribe:
List-Archive:List-Post:List-Help:List-Subscribe:From:Reply-To:Cc:
From;
b=fCL14r/PAXzux91ObXBNg1bKsaXjDdcRsnmhxXlzCYENvY9yIR3O+K7qkMOXlXpAt
moAD76XDBr/whwXnEIquKi935nlms9BRUDuPfXJrb2WgRiKGofoZzmggbjbzKtBXEx
tQWks1agF3fHcZkx6zU4yBc6fvTFkg/pn67Un0Qc=
X-Original-To: cygwin AT cygwin DOT com
Delivered-To: cygwin AT cygwin DOT com
DMARC-Filter: OpenDMARC Filter v1.4.2 sourceware.org DD1F03858D28
X-Authority-Analysis: v=2.4 cv=e5oV9Il/ c=1 sm=1 tr=0 ts=64c823f3
a=DxHlV3/gbUaP7LOF0QAmaA==:117 a=DxHlV3/gbUaP7LOF0QAmaA==:17
a=IkcTkHD0fZMA:10 a=s__HOThu1SqdKUU4zGgA:9 a=QEXdDO2ut3YA:10
Message-ID: <223e3d56-1a63-57ef-5236-bc1df37716a0@Shaw.ca>
Date: Mon, 31 Jul 2023 15:13:22 -0600
MIME-Version: 1.0
User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:102.0) Gecko/20100101
Thunderbird/102.13.0
Subject: Re: character class "alpha"
To: cygwin AT cygwin DOT com
References: <3884636 DOT 3uDm00564X AT nimes> <ZMfzbOOJth8Mk+rJ AT calimero DOT vinschen DOT de>
<ZMf7aqxU8awRQM4v AT calimero DOT vinschen DOT de> <4474610 DOT kIfH5X4irW AT nimes>
Organization: Inglis
In-Reply-To: <4474610.kIfH5X4irW@nimes>
X-CMAE-Envelope: MS4xfK4As0cpFvyOKPjaGV0Koc/jo693sufsHJD0viX1oO1WoRkI601576fJuMJ6VFDHuRK5adeA8WwHxycP4lfBckm+Ezk7Z53CQJsNe3ZHDCSyfG6fXDw/
TDa7vGuydx2thbW2COzU17ZHjYrsd3NF5mqdDSDm44JTxvB6IeoLXpYKSx9736tX87GdT3WyfCaoCOFU7GK8RC10NYY1o8d4IvU=
X-Spam-Status: No, score=-3.3 required=5.0 tests=BAYES_00, DKIM_SIGNED,
DKIM_VALID, DKIM_VALID_AU, DKIM_VALID_EF, NICE_REPLY_A, RCVD_IN_DNSWL_LOW,
SPF_HELO_NONE, SPF_PASS, TXREP,
T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6
X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on
server2.sourceware.org
X-BeenThere: cygwin AT cygwin DOT com
X-Mailman-Version: 2.1.29
List-Id: General Cygwin discussions and problem reports <cygwin.cygwin.com>
List-Unsubscribe: <https://cygwin.com/mailman/options/cygwin>,
<mailto:cygwin-request AT cygwin DOT com?subject=unsubscribe>
List-Archive: <https://cygwin.com/pipermail/cygwin/>
List-Post: <mailto:cygwin AT cygwin DOT com>
List-Help: <mailto:cygwin-request AT cygwin DOT com?subject=help>
List-Subscribe: <https://cygwin.com/mailman/listinfo/cygwin>,
<mailto:cygwin-request AT cygwin DOT com?subject=subscribe>
From: Brian Inglis via Cygwin <cygwin AT cygwin DOT com>
Reply-To: cygwin AT cygwin DOT com
Cc: Brian Inglis <Brian DOT Inglis AT Shaw DOT ca>, Bruno Haible <bruno AT clisp DOT org>
Errors-To: cygwin-bounces+archive-cygwin=delorie DOT com AT cygwin DOT com
Sender: "Cygwin" <cygwin-bounces+archive-cygwin=delorie DOT com AT cygwin DOT com>
X-MIME-Autoconverted: from base64 to 8bit by delorie.com id 36VLDdhj021391

On 2023-07-31 12:43, Bruno Haible via Cygwin wrote:
> Corinna Vinschen wrote:
>> there are more of those expressions which are disabled on glibc and
>> fail on Cygwin, for instance in test-c32iscntrl.c.  Maybe it's actually
>> the better idea to disable them on Cygwin, too, rather than to change
>> a working system...
> 
> Sure. There is no standard how to map the Unicode properties to POSIX
> character classes. Other than the mentioned ISO C constraints for
> 'digit' and 'xdigit' and a few POSIX constraints, you are free to
> map them as you like. For glibc and gnulib, I mapped them in a way
> that seemed to make most sense for applications. But different
> people might come to different meanings of "make sense".

It seems to me that most application developers needing to support 
non-Western-European languages might want a non-POSIX interpretation of digits.

Are the Unicode character attribute classes supported for those application use 
cases that need more than POSIX limitations allow?

I know that I sometimes want to see some alternative numeric digit forms and 
expect to be able to find those with an appropriate grep expression.

-- 
Take care. Thanks, Brian Inglis              Calgary, Alberta, Canada

La perfection est atteinte                   Perfection is achieved
non pas lorsqu'il n'y a plus rien à ajouter  not when there is no more to add
mais lorsqu'il n'y a plus rien à retirer     but when there is no more to cut
                                 -- Antoine de Saint-Exupéry

-- 
Problem reports:      https://cygwin.com/problems.html
FAQ:                  https://cygwin.com/faq/
Documentation:        https://cygwin.com/docs.html
Unsubscribe info:     https://cygwin.com/ml/#unsubscribe-simple

- Raw text -


  webmaster     delorie software   privacy  
  Copyright © 2019   by DJ Delorie     Updated Jul 2019