delorie.com/archives/browse.cgi   search  
Mail Archives: cygwin/2023/04/14/16:21:39

X-Recipient: archive-cygwin AT delorie DOT com
DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org D0FE43858C2C
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=cygwin.com;
s=default; t=1681503651;
bh=Yr4tGanzhWOwvIh/S8QScfN3GXfqSviO/Z928UEQ9vs=;
h=Date:To:Subject:References:In-Reply-To:List-Id:List-Unsubscribe:
List-Archive:List-Post:List-Help:List-Subscribe:From:Reply-To:Cc:
From;
b=T57z3SZJR5i8s2j8vfy2RynoeuxX4I9mZVbva8Z30ggss9binj3BuHa/pQEolY4Io
wZGzZ5YjQP1Y0pH43TSVVQspHrGYEJEHtH3dxkcyDmAxLZ3dpRvwhnJfoONEz/kXo5
dPEaWI9ZLez5xZtdmuQzWoeZtj2sMa6jXIUtKQkI=
X-Original-To: cygwin AT cygwin DOT com
Delivered-To: cygwin AT cygwin DOT com
DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org 3BB983858C54
Date: Fri, 14 Apr 2023 22:20:14 +0200
To: Brian Inglis via Cygwin <cygwin AT cygwin DOT com>
Subject: Re: Can not stat file with utf char U+F020
Message-ID: <ZDm1fmKcJF1PM7FB@calimero.vinschen.de>
Mail-Followup-To: Brian Inglis via Cygwin <cygwin AT cygwin DOT com>,
Brian Inglis <Brian DOT Inglis AT shaw DOT ca>,
Gionatan Danti <g DOT danti AT assyoma DOT it>
References: <992b3c28d7f1cfc17f7c9bb47b53f770 AT assyoma DOT it>
<ZDmiyYS+m0x4QZmh AT calimero DOT vinschen DOT de>
<f4d0bd30-731a-fb5e-43d2-a86d1af761b6 AT Shaw DOT ca>
MIME-Version: 1.0
In-Reply-To: <f4d0bd30-731a-fb5e-43d2-a86d1af761b6@Shaw.ca>
X-BeenThere: cygwin AT cygwin DOT com
X-Mailman-Version: 2.1.29
List-Id: General Cygwin discussions and problem reports <cygwin.cygwin.com>
List-Unsubscribe: <https://cygwin.com/mailman/options/cygwin>,
<mailto:cygwin-request AT cygwin DOT com?subject=unsubscribe>
List-Archive: <https://cygwin.com/pipermail/cygwin/>
List-Post: <mailto:cygwin AT cygwin DOT com>
List-Help: <mailto:cygwin-request AT cygwin DOT com?subject=help>
List-Subscribe: <https://cygwin.com/mailman/listinfo/cygwin>,
<mailto:cygwin-request AT cygwin DOT com?subject=subscribe>
From: Corinna Vinschen via Cygwin <cygwin AT cygwin DOT com>
Reply-To: cygwin AT cygwin DOT com
Cc: Corinna Vinschen <corinna-cygwin AT cygwin DOT com>,
Brian Inglis <Brian DOT Inglis AT shaw DOT ca>,
Gionatan Danti <g DOT danti AT assyoma DOT it>
Errors-To: cygwin-bounces+archive-cygwin=delorie DOT com AT cygwin DOT com
Sender: "Cygwin" <cygwin-bounces+archive-cygwin=delorie DOT com AT cygwin DOT com>
X-MIME-Autoconverted: from base64 to 8bit by delorie.com id 33EKLEWT032360

On Apr 14 13:54, Brian Inglis via Cygwin wrote:
> On 2023-04-14 13:00, Corinna Vinschen via Cygwin wrote:
> > On Apr 14 19:53, Gionatan Danti via Cygwin wrote:
> > > [1] https://sourceware.org/legacy-ml/cygwin/2009-11/msg00043.html
> 
> > While this patch would have fixed your problem, a later followup patch
> > broke your usage of U+F020 (space replacement) and, FWIW, of U+F02E
> > (dot replacement) again:
> > 	https://cygwin.com/cgit/newlib-cygwin/commit/?id=8802178fddfd
> > This was done to accomodate filesystems implementing the idiotic
> > approach to support only DOS filenames, i.e. not allowing leading or
> > trailing spaces and not allowing trailing dots. These are Netapp and
> > Novell Netware filesystems. See the last paragraph of
> > https://cygwin.com/cygwin-ug-net/using-specialnames.html#pathnames-specialchars
> > Any chance you can just rename the files?
> 
> UCSUR Under-ConScript Unicode Registry and its predecessor ConScript Unicode
> Registry CSUR
> 
> 	https://www.kreativekorp.com/ucsur/
> 
> 	http://www.evertype.com/standards/csur/
> 
> unofficially register Unicode PUA glyphs for academic, artificial,
> constructed, historical, invented, and minority language scripts, some of
> which have made it into Unicode e.g.
> 
> 	Script		CSUR		Unicode
> 	PHAISTOS DISC	U+E6D0-U+E6FF	U+101D0-U+101DF
> 	SHAVIAN		U+E700-U+E72F	U+10450-U+1047F
> 	DESERET		U+E830-U+E88F	U+10400-U+1044F
> 
> and maintain their own Unidata e.g.
> 
> 	https://www.kreativekorp.com/ucsur/UNIDATA/Blocks.txt
> 
> and some Unicode fonts have -CSUR addition files (like -Italic etc.) that
> support BMP and SMP PUA glyphs.
> 
> For Cygwin purposes:
> 
> F000−F7FF	unassigned	Reserved for hacks and corporate use
> 
> so Cygwin's special Windows file name characters mappings are clear:
> 
For completeness sake, starting with commit 8802178fddfd:

        F020    <space>
> 	F022	"
> 	F02A	*
        F02E    .
> 	F03A	:
> 	F03C	<
> 	F03E	>
> 	F03F	?
> 	F07C	|


Corinna

-- 
Problem reports:      https://cygwin.com/problems.html
FAQ:                  https://cygwin.com/faq/
Documentation:        https://cygwin.com/docs.html
Unsubscribe info:     https://cygwin.com/ml/#unsubscribe-simple

- Raw text -


  webmaster     delorie software   privacy  
  Copyright © 2019   by DJ Delorie     Updated Jul 2019