X-Recipient: archive-cygwin AT delorie DOT com DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org D0FE43858C2C DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=cygwin.com; s=default; t=1681503651; bh=Yr4tGanzhWOwvIh/S8QScfN3GXfqSviO/Z928UEQ9vs=; h=Date:To:Subject:References:In-Reply-To:List-Id:List-Unsubscribe: List-Archive:List-Post:List-Help:List-Subscribe:From:Reply-To:Cc: From; b=T57z3SZJR5i8s2j8vfy2RynoeuxX4I9mZVbva8Z30ggss9binj3BuHa/pQEolY4Io wZGzZ5YjQP1Y0pH43TSVVQspHrGYEJEHtH3dxkcyDmAxLZ3dpRvwhnJfoONEz/kXo5 dPEaWI9ZLez5xZtdmuQzWoeZtj2sMa6jXIUtKQkI= X-Original-To: cygwin AT cygwin DOT com Delivered-To: cygwin AT cygwin DOT com DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org 3BB983858C54 Date: Fri, 14 Apr 2023 22:20:14 +0200 To: Brian Inglis via Cygwin Subject: Re: Can not stat file with utf char U+F020 Message-ID: Mail-Followup-To: Brian Inglis via Cygwin , Brian Inglis , Gionatan Danti References: <992b3c28d7f1cfc17f7c9bb47b53f770 AT assyoma DOT it> MIME-Version: 1.0 Content-Disposition: inline In-Reply-To: X-BeenThere: cygwin AT cygwin DOT com X-Mailman-Version: 2.1.29 Precedence: list List-Id: General Cygwin discussions and problem reports List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , From: Corinna Vinschen via Cygwin Reply-To: cygwin AT cygwin DOT com Cc: Corinna Vinschen , Brian Inglis , Gionatan Danti Content-Type: text/plain; charset="utf-8" Errors-To: cygwin-bounces+archive-cygwin=delorie DOT com AT cygwin DOT com Sender: "Cygwin" Content-Transfer-Encoding: 8bit X-MIME-Autoconverted: from base64 to 8bit by delorie.com id 33EKLEWT032360 On Apr 14 13:54, Brian Inglis via Cygwin wrote: > On 2023-04-14 13:00, Corinna Vinschen via Cygwin wrote: > > On Apr 14 19:53, Gionatan Danti via Cygwin wrote: > > > [1] https://sourceware.org/legacy-ml/cygwin/2009-11/msg00043.html > > > While this patch would have fixed your problem, a later followup patch > > broke your usage of U+F020 (space replacement) and, FWIW, of U+F02E > > (dot replacement) again: > > https://cygwin.com/cgit/newlib-cygwin/commit/?id=8802178fddfd > > This was done to accomodate filesystems implementing the idiotic > > approach to support only DOS filenames, i.e. not allowing leading or > > trailing spaces and not allowing trailing dots. These are Netapp and > > Novell Netware filesystems. See the last paragraph of > > https://cygwin.com/cygwin-ug-net/using-specialnames.html#pathnames-specialchars > > Any chance you can just rename the files? > > UCSUR Under-ConScript Unicode Registry and its predecessor ConScript Unicode > Registry CSUR > > https://www.kreativekorp.com/ucsur/ > > http://www.evertype.com/standards/csur/ > > unofficially register Unicode PUA glyphs for academic, artificial, > constructed, historical, invented, and minority language scripts, some of > which have made it into Unicode e.g. > > Script CSUR Unicode > PHAISTOS DISC U+E6D0-U+E6FF U+101D0-U+101DF > SHAVIAN U+E700-U+E72F U+10450-U+1047F > DESERET U+E830-U+E88F U+10400-U+1044F > > and maintain their own Unidata e.g. > > https://www.kreativekorp.com/ucsur/UNIDATA/Blocks.txt > > and some Unicode fonts have -CSUR addition files (like -Italic etc.) that > support BMP and SMP PUA glyphs. > > For Cygwin purposes: > > F000−F7FF unassigned Reserved for hacks and corporate use > > so Cygwin's special Windows file name characters mappings are clear: > For completeness sake, starting with commit 8802178fddfd: F020 > F022 " > F02A * F02E . > F03A : > F03C < > F03E > > F03F ? > F07C | Corinna -- Problem reports: https://cygwin.com/problems.html FAQ: https://cygwin.com/faq/ Documentation: https://cygwin.com/docs.html Unsubscribe info: https://cygwin.com/ml/#unsubscribe-simple