DMARC-Filter: OpenDMARC Filter v1.4.2 delorie.com 4AOAUtbr1186468
Authentication-Results: delorie.com; dmarc=pass (p=none dis=none) header.from=cygwin.com
Authentication-Results: delorie.com; spf=pass smtp.mailfrom=cygwin.com
DKIM-Filter: OpenDKIM Filter v2.11.0 delorie.com 4AOAUtbr1186468
Authentication-Results: delorie.com;
	dkim=pass (1024-bit key, unprotected) header.d=cygwin.com header.i=@cygwin.com header.a=rsa-sha256 header.s=default header.b=mJw1bsWi
X-Recipient: archive-cygwin@delorie.com
DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org 3154E385843D
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=cygwin.com;
	s=default; t=1732444250;
	bh=Cwm4ukXua6Qs9+pqwZYlOxZlv4Mu+nlwKenPUoZpx30=;
	h=Date:Subject:To:References:In-Reply-To:List-Id:List-Unsubscribe:
	 List-Archive:List-Post:List-Help:List-Subscribe:From:Reply-To:
	 From;
	b=mJw1bsWiBsNSZEP6riwk66sWxCgLmjkUlLw6GiHB9eMAWiYTtEwH7wXS71pOLKcty
	 lQBLlzZhA5Dbr4XHMWNUTqanwD0SBehksRVA92oe/xrUZSF4pss3XlDc5alpRLBwvt
	 StFviyNLI/U87qjWY4PoPHBnOgJb+JBY35p1PXh8=
X-Original-To: cygwin@cygwin.com
Delivered-To: cygwin@cygwin.com
DMARC-Filter: OpenDMARC Filter v1.4.2 sourceware.org D07483858D29
ARC-Filter: OpenARC Filter v1.0.0 sourceware.org D07483858D29
ARC-Seal: i=1; a=rsa-sha256; d=sourceware.org; s=key; t=1732444224; cv=none;
 b=fd8ra1N7FKVWdDh9wn8Kt0o1uBPvGse3KKwYbX3MAgA+Ahq/mYP3yqVH99UeMZID32OzINtdHboOwQ0JAzUQ+vQcr9BWMbaG8Qf0od5gvDgm3HVTDHZ2H3yQ3dN+zGorX6n3EzQ+PsAJHGGhdp/7yweHL7onFz6xQJjhOtQqvaU=
ARC-Message-Signature: i=1; a=rsa-sha256; d=sourceware.org; s=key;
 t=1732444224; c=relaxed/simple;
 bh=5VQ2yAhBr+4KjkxJCeJ++XLR86hG2+ibjm7C8oeAy0k=;
 h=DKIM-Signature:Message-ID:Date:MIME-Version:Subject:To:From;
 b=iNi9Czm9Clt1P13O+27JACIM0VxteLqTQzpBfg6SaNiWsW3KMuYEQC7TnbCktXqxKFg9XfGQJtYUWMmIS5cMqSs7dIIS/6LJe1fDaHhfkpcC4MM3+8CM4wCfAMmFShiBMqMtRsdYmoOnLAojtBX2vvF464cWK2l6l7CDmyQaU7s=
ARC-Authentication-Results: i=1; server2.sourceware.org
DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org D07483858D29
X-UI-Sender-Class: 55c96926-9e95-11ee-ae09-1f7a4046a0f6
Message-ID: <54ea0698-23d3-4bc7-8197-21624bb56c4e@towo.net>
Date: Sun, 24 Nov 2024 11:30:18 +0100
MIME-Version: 1.0
User-Agent: Mozilla Thunderbird
Subject: Re: /bin/ls -l cannot handle printable Unicode characters outside the
 BMP ...
To: cygwin@cygwin.com
References: <CALXu0UcnZnQBbJQcSsbianeKiyB2vkOmvE1weGN_-EQSU=RNrQ@mail.gmail.com>
 <CALXu0UfYmRP5yMG4J6znd4svqq1kbgEkpvHj-CWjB6APE8C3uw@mail.gmail.com>
 <7eaa3a6a-7997-edef-e30b-1d50fdc39330@t-online.de>
Autocrypt: addr=towo@towo.net; keydata=
 xsDNBGNaf3QBDACVevqudcTSevLThXKQPU1QpaDxtGuYjtwmr7i9wXxVGih4Y4oxOJN4PYlu
 KBX9IVAI4651dA+xYtXuyIkWOPZWyyzkGKavQOn3Q7dk09oj7bh2IwOndpxXXde337D408EQ
 bQEGbMHr9lOWhSAideowzgCeFIvGTf2AovbPh97HpexJn1/HCRiRAhTNlrkS1DByUgCAeEMK
 fEr6aGM/Ou29MT+eTnQwOIZTnl9Z9LxM2FtqqMH3MycC7I2OoW3XXhuL8BPQdyJUjWa0/J11
 Oo5jFkRXtWenIns6jGn18oW72jnDmo9jXwwS+iZWAV6Y51nhD7jSC+3xs9ORmPCdtHUSpTr1
 zh67UueUJ3DUUNVuA25Hn/9EJMJ2L60BGUEr88NEB6pcZhmcwdkurAQeYT6t+frzBz2ctsoN
 BoxP/Xc02yd+z7hXWRRMrJWh9WHlQHA3Z4FfmyNhyPhs3MgKTJ1E9QfzGquigAmF3/k/Dc1m
 7cSOKhGYhpEJdSpdXccJFKkAEQEAAc0cVGhvbWFzIFdvbGZmIDx0b3dvQHRvd28ubmV0PsLB
 BwQTAQgAMRYhBHUiRKsHn5d8BpWdP8bz0e72Bp0CBQJjWn93AhsDBAsJCAcFFQgJCgsFFgID
 AQAACgkQxvPR7vYGnQKSMAv8Di+8MXB2mcfsemRdShfLLKcLOv+d0CXAtPVaY3XKxbKpRvC9
 +AAT5wIHYjQft77/b2y87vGIh+nQ5hKLtNtQPSDtqG/Igkb5jAXpLi28fSUzgM96DvARmwve
 5wSnAU3prxH+Y63YpOpslEcGMRoEtYCDy1ANMYPcEZT/YvDd4CplyyEai4VYrw3/LsESDYlY
 GK6uMQzZ1jl2cNOUFu6BwLUeZIcwaqGto8n4R4nbf4jxUEpa21bWBPqE+Jf49uipjPr/iJ72
 5HbdWuuCfyTTJEJjfNEBigWP2RXM9iNDcO61V3aEjh76tThfBK2MMlLWfZkQaQziu24x8R4B
 I0efJYWBX2Sv2qnsH/EWj7FUIZjRqGG7LnWHLShfG6yjSOTOWYi8BbsvoftpaLWgZX28aGX4
 uzuSZ5L0caXh/pr/gSgqoH/YbuFIgqtQH4seOBgTybd22Vpe78rnc+8450pN8qwchHAZaJka
 UxS0SpYxXzXmHUKILA4C43s0U/z2Mez9zsDNBGNaf3cBDADeJ7paMrb6f1+k8wM7tyk0/Ded
 KX/pOejt/D20Ceerw2iL/4tUmBL+A3ic2yjiSFUSsEfHwgCVwKrn4MwZtkesdiphm2lk6xWc
 k1ENCQy44QwQT6UZ/mHWYWcj5LS6ua183x1zdn9iF3lv150nm/ssw56D7USz/ap1Vh0lf5te
 D+CIheGLocVDqxWiu7rHP8jKRWFgq/+OU6HKX8p2Yv1oYsykh9qF2bFzawLDS+S1VbfRicfD
 G0RtceL/BAf7b6UE5u9TGdfrFEa2TKZeS/FS/ViKUfwsXQIki1sWt2FQENbuDY28vxyR46ZZ
 0gixDCFUoBw5pkmOGVQa+1RQYrRqlN4X0CAgp7mFVeEHl5NTgiL1bemkQVmHOUDG+CzNg+Lk
 UGoedAtT672l3JjrnSs4j8zNshpgV2OfAhAC+V9XvqCjMnxzVfXkVlbuWpPfUWQeFclLGg8P
 agpQUE0Ux+VV4DoeQCxYEnRCf/n7n+IRfILj5+2l6Zw4M7zSu6ii0tUAEQEAAcLA9gQYAQgA
 IBYhBHUiRKsHn5d8BpWdP8bz0e72Bp0CBQJjWn97AhsMAAoJEMbz0e72Bp0CQr4L/REdT0SF
 mbapnZIe92THCdtAUgwEv8VdNiNFBJelz8P/fuXuNPtisYvQQD4e64zpWe2UC4Cxo9DUk/pW
 6Qci1xaXRKEiSPjHdSGGVB1PFIcqiS75GCf/ga/Dnfsy0Y4Uh6OGTQnkvZLBCe3vvcVLDQ7F
 PuV79zA9/eOeOW6aGoO6bq/wH+z96f9LyTITkQDy07fm6JYTGuzAoJE2AEboU1mgbtlx+tAa
 QFkpAQkp2g1Vhc3A7k4vntlHOrjMC+uVFh7QTGFfIlLRF6izUjSe6EZ06LErzlIiE05RP3yF
 FSRWidW0wze26peYlxYVgH1+T9wMTW2oiTBybfAMHBAxUP7Gr1WUo/oJEr0srWhatz8AwydP
 y7NwFbdpYn0NcFBaIlLW/JL11Eovwlivow+oGpzGFuuzSuflp2q9s2JWtn4EhW0kEs93D0LP
 iuJWvRaCZ6aD3uF3FMW8wyVWZYsLrzune2jH8w/uKMprDEOGOm+BcyhEFedTyY1ygbZKl+0G kQ==
In-Reply-To: <7eaa3a6a-7997-edef-e30b-1d50fdc39330@t-online.de>
X-Provags-ID: V03:K1:pC/oBgzy9BogwFfGQpy//1voAF5KeTsWjdQOrDjIOZpZH9z7hu6
 1fgAjc3AkVXXoydubtSnjyT0sBFnG2MMy5/RqJrVW7q8JqM74m0qjJnYqzaXHKgnihbrwiY
 voTXElJrY5rG6vGYSlvz64WgW10bRREriDtcxZ8CGa2kT5n/zCaTRSFAXRNyB3xp5fspLIn
 oPZH8yaYgMhHz2Pkj5Wxg==
UI-OutboundReport: notjunk:1;M01:P0:ETgt3zbEzfo=;P6P1IL0pvkhkm58GT7308aKl120
 fXZa9ruGA2HGWoo4Kd4/xyqWnJflgwiUcvolU18Jt+TlvhEkdsx8cj09EKDBlz3sHYz1UU8nx
 vJbKSo2+f1fLKewKQBihBqUzHyM0olXWJvBsXXzjLu7FhDoY0Trb56+lr4y7zchVXMEAndWBO
 ycpr0RxJIrOBXWlv6FlMGVNu4zOUa62Isk9B/e7wf0sBDdWkPcyOmXNcpedTf/oRKgFNz2J3S
 C8LIz6vDmrOO1JMGGEoLSxjSrp5Eodq8Z34TkCKDzxLK0MOqYMnjyLT6M2DZeGcLDwWfp+BV1
 +0IOhkdfDi0nt0U8IFEF8aH6ekiCNoL0EZn7Uk46uhnFDD2Wwbh1bkREIbgD7vj/d9GOgPQRZ
 qN9cZZZuXsPDMy5Mkiv/jh3GrMafbEx0vZ3yqAqPoBpHbHUCaAEjd2VdO+VQro9MYnGeG2JXR
 VeL5+3T8pjjCDEZu0dnbvZCWYChfS3lxHXOa/JscpK1uyVMoZmsV66ADYaSRsRwx8SSnJDP5u
 lA9iLMBb/0mPWtfA7Ww/GBo0KrDenmBLqT+74tENrCzgVjqffkpZcuz7jVsk6Ez2zfbSURRzc
 /Xg0pr9TeUfJO8h4oHsdgUzx2xgRXii3iVFDQ5KH29b7WTL5hRmUsTPKhGhoF0xD73z4iJ53H
 WUtrfb43Ypsm5vx9cs2QnaVOj/D1E3AowddMngvP69X7GeJToXJW6fN7vLzDpP6WjV89uDmRK
 HjLq5CNc1WqxsfxZt1ruUUwIwqBTJ7DvhC9zzmLxB3/oiVEzrdSarQoAN+anA5N2sifwagNel
 OQ4SMXPByLJVg9eU081BIqI2Sgndy1sOJPYK1LLjxbXcsaaetjsgtjOkXglYP5xO8o
X-BeenThere: cygwin@cygwin.com
X-Mailman-Version: 2.1.30
Precedence: list
List-Id: General Cygwin discussions and problem reports <cygwin.cygwin.com>
List-Unsubscribe: <https://cygwin.com/mailman/options/cygwin>,
 <mailto:cygwin-request@cygwin.com?subject=unsubscribe>
List-Archive: <https://cygwin.com/pipermail/cygwin/>
List-Post: <mailto:cygwin@cygwin.com>
List-Help: <mailto:cygwin-request@cygwin.com?subject=help>
List-Subscribe: <https://cygwin.com/mailman/listinfo/cygwin>,
 <mailto:cygwin-request@cygwin.com?subject=subscribe>
From: Thomas Wolff via Cygwin <cygwin@cygwin.com>
Reply-To: Thomas Wolff <towo@towo.net>
Content-Type: text/plain; charset="utf-8"; Format="flowed"
Errors-To: cygwin-bounces~archive-cygwin=delorie.com@cygwin.com
Sender: "Cygwin" <cygwin-bounces~archive-cygwin=delorie.com@cygwin.com>
Content-Transfer-Encoding: 8bit
X-MIME-Autoconverted: from base64 to 8bit by delorie.com id 4AOAUtbr1186468


Am 23.11.2024 um 15:01 schrieb Christian Franke via Cygwin:
> Cedric Blancher via Cygwin wrote:
>> On Sat, 23 Nov 2024 at 11:44, Cedric Blancher
>> <cedric.blancher@gmail.com> wrote:
>>> Good morning!
>>>
>>> /bin/ls -l cannot handle printable Unicode characters outside the BMP
>>>
>>> Example using '𝒯'
>>> bash -c 'printf "\U0001D4AF\n"' # MATHEMATICAL SCRIPT CAPITAL T
>>> (yes, our mathematicians want to use THAT as file name)
>>>
>>> On Linux:
>>> LC_ALL=en_US.UTF-8 bash -c 't="$(printf "\U0001D4AF\n")" ; touch
>>> "$t" "$t$t"'
>>> ls -la
>>> total 8
>>> -rw-r--r--  1 ced staden  0 Nov 23 11:29 ööööööö
>>> -rw-r--r--  2 ced staden  4 Nov 23 11:31 𝒯
>>> -rw-r--r--  2 ced staden  4 Nov 23 11:31𝒯𝒯
>>>
>>> On Cygwin:
>>> LC_ALL=en_US.UTF-8 bash -c 't="$(printf "\U0001D4AF\n")" ; touch
>>> "$t" "$t$t"'
>>> $ ls -la
>>> -rw-r--r-- 1 ced staden  0 Nov 23 11:29  ööööööö
>>> -rw-r--r-- 2 ced staden  4 Nov 23 11:31 ''$'\360\235\222\257'
>>> -rw-r--r-- 2 ced staden  4 Nov 23 11:31
>>> ''$'\360\235\222\257\360\235\222\257'
>>>
>>> Looks like the Cygwin locale has a problem with non-BMP chars.
>> find(1) is even worse:
>> $ find .
>> .
>> ./ööööööö
>> ./????
>> ./x??x
Workaround: ls ... | cat ; find ... | cat

>>
>> The Microsoft Explorer GUI shows the file names correctly, so IMO this
>> is not a Windows or Win32 API problem.
>
> Slightly different filename problem which may be related or not:
> https://sourceware.org/pipermail/cygwin/2024-September/256451.html
>


-- 
Problem reports:      https://cygwin.com/problems.html
FAQ:                  https://cygwin.com/faq/
Documentation:        https://cygwin.com/docs.html
Unsubscribe info:     https://cygwin.com/ml/#unsubscribe-simple

