delorie.com/archives/browse.cgi   search  
Mail Archives: cygwin/2024/11/23/09:02:20

DMARC-Filter: OpenDMARC Filter v1.4.2 delorie.com 4ANE2JYW820376
Authentication-Results: delorie.com; dmarc=pass (p=none dis=none) header.from=cygwin.com
Authentication-Results: delorie.com; spf=pass smtp.mailfrom=cygwin.com
DKIM-Filter: OpenDKIM Filter v2.11.0 delorie.com 4ANE2JYW820376
Authentication-Results: delorie.com;
dkim=pass (1024-bit key, unprotected) header.d=cygwin.com header.i=@cygwin.com header.a=rsa-sha256 header.s=default header.b=juPxsj3Q
X-Recipient: archive-cygwin AT delorie DOT com
DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org 995F33858C42
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=cygwin.com;
s=default; t=1732370538;
bh=ShacZw44Lgu9qK9VQVmjmvNQMe3fDRXz7XdL3RbYR3Q=;
h=Subject:To:References:Date:In-Reply-To:List-Id:List-Unsubscribe:
List-Archive:List-Post:List-Help:List-Subscribe:From:Reply-To:Cc:
From;
b=juPxsj3QylCAfNr1BKqNnjEfWpYpopi2+QnMlEdo5GjJ/WMa6vYKkuf39mWAPBwCF
mNZmJ52dsdMv6lMg6YhJf7+RPNPeCwB14jkGQ+/VB92C7Yt71BV00zNthLswtXF23s
dipmJ9yJqxg7G2TrvX2PQyMqgd1+O+l415erlZmM=
X-Original-To: cygwin AT cygwin DOT com
Delivered-To: cygwin AT cygwin DOT com
DMARC-Filter: OpenDMARC Filter v1.4.2 sourceware.org CFE7C3858D37
ARC-Filter: OpenARC Filter v1.0.0 sourceware.org CFE7C3858D37
ARC-Seal: i=1; a=rsa-sha256; d=sourceware.org; s=key; t=1732370512; cv=none;
b=o0SfiWKcnuQTDnCEZDeRRwYjl0f/ddpHR8hrfCD7fZjZBzskCPKxba0QFnV+MPA305anZBW+uWWVNrFNewg4wDemklZtj/mO8QnWKUgFvvjDYTKRQrPAiFY/zljnWaeOmw6/2f6zkc69wF4udJC9Dkbx8btwEVGQC3hKjp5BXH0=
ARC-Message-Signature: i=1; a=rsa-sha256; d=sourceware.org; s=key;
t=1732370512; c=relaxed/simple;
bh=MdIC87FGVXYDXvvpgn7tqpjnbOitBEr84pfjmuXnGI4=;
h=Subject:To:From:Message-ID:Date:MIME-Version;
b=Dtf22ZsGOjF+e4uHb3uL8ikmqkL7bensTbi6Fi0LBqXzFf+wTKOO/RjlFrR/vwgLz/oQIyx6eYnx6tqvMGrlRZQu2izSVteywCmHABy1UCSTpxaw1Jvq5I6ByOajaHayZcUH09RY11ljragrPc6+514G+Ll+fs1Yg5MrUYXEK8o=
ARC-Authentication-Results: i=1; server2.sourceware.org
DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org CFE7C3858D37
Subject: Re: /bin/ls -l cannot handle printable Unicode characters outside the
BMP ...
To: cygwin AT cygwin DOT com
References: <CALXu0UcnZnQBbJQcSsbianeKiyB2vkOmvE1weGN_-EQSU=RNrQ AT mail DOT gmail DOT com>
<CALXu0UfYmRP5yMG4J6znd4svqq1kbgEkpvHj-CWjB6APE8C3uw AT mail DOT gmail DOT com>
Message-ID: <7eaa3a6a-7997-edef-e30b-1d50fdc39330@t-online.de>
Date: Sat, 23 Nov 2024 15:01:47 +0100
User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:91.0) Gecko/20100101
SeaMonkey/2.53.19
MIME-Version: 1.0
In-Reply-To: <CALXu0UfYmRP5yMG4J6znd4svqq1kbgEkpvHj-CWjB6APE8C3uw@mail.gmail.com>
X-TOI-EXPURGATEID: 150726::1732370507-5EFEB712-50D29658/0/0 CLEAN NORMAL
X-TOI-MSGID: c6b8a97a-a071-45ce-8ad1-834f197ec0af
X-BeenThere: cygwin AT cygwin DOT com
X-Mailman-Version: 2.1.30
List-Id: General Cygwin discussions and problem reports <cygwin.cygwin.com>
List-Unsubscribe: <https://cygwin.com/mailman/options/cygwin>,
<mailto:cygwin-request AT cygwin DOT com?subject=unsubscribe>
List-Archive: <https://cygwin.com/pipermail/cygwin/>
List-Post: <mailto:cygwin AT cygwin DOT com>
List-Help: <mailto:cygwin-request AT cygwin DOT com?subject=help>
List-Subscribe: <https://cygwin.com/mailman/listinfo/cygwin>,
<mailto:cygwin-request AT cygwin DOT com?subject=subscribe>
From: Christian Franke via Cygwin <cygwin AT cygwin DOT com>
Reply-To: cygwin AT cygwin DOT com
Cc: Christian Franke <Christian DOT Franke AT t-online DOT de>
Errors-To: cygwin-bounces~archive-cygwin=delorie DOT com AT cygwin DOT com
Sender: "Cygwin" <cygwin-bounces~archive-cygwin=delorie DOT com AT cygwin DOT com>
X-MIME-Autoconverted: from base64 to 8bit by delorie.com id 4ANE2JYW820376

Cedric Blancher via Cygwin wrote:
> On Sat, 23 Nov 2024 at 11:44, Cedric Blancher <cedric DOT blancher AT gmail DOT com> wrote:
>> Good morning!
>>
>> /bin/ls -l cannot handle printable Unicode characters outside the BMP
>>
>> Example using '𝒯'
>> bash -c 'printf "\U0001D4AF\n"' # MATHEMATICAL SCRIPT CAPITAL T
>> (yes, our mathematicians want to use THAT as file name)
>>
>> On Linux:
>> LC_ALL=en_US.UTF-8 bash -c 't="$(printf "\U0001D4AF\n")" ; touch "$t" "$t$t"'
>> ls -la
>> total 8
>> -rw-r--r--  1 ced staden  0 Nov 23 11:29 ΓΆΓΆΓΆΓΆΓΆΓΆΓΆ
>> -rw-r--r--  2 ced staden  4 Nov 23 11:31 𝒯
>> -rw-r--r--  2 ced staden  4 Nov 23 11:31𝒯𝒯
>>
>> On Cygwin:
>> LC_ALL=en_US.UTF-8 bash -c 't="$(printf "\U0001D4AF\n")" ; touch "$t" "$t$t"'
>> $ ls -la
>> -rw-r--r-- 1 ced staden  0 Nov 23 11:29  ΓΆΓΆΓΆΓΆΓΆΓΆΓΆ
>> -rw-r--r-- 2 ced staden  4 Nov 23 11:31 ''$'\360\235\222\257'
>> -rw-r--r-- 2 ced staden  4 Nov 23 11:31 ''$'\360\235\222\257\360\235\222\257'
>>
>> Looks like the Cygwin locale has a problem with non-BMP chars.
> find(1) is even worse:
> $ find .
> .
> ./ΓΆΓΆΓΆΓΆΓΆΓΆΓΆ
> ./????
> ./x??x
>
> The Microsoft Explorer GUI shows the file names correctly, so IMO this
> is not a Windows or Win32 API problem.

Slightly different filename problem which may be related or not:
https://sourceware.org/pipermail/cygwin/2024-September/256451.html

-- 
Regards,
Christian


-- 
Problem reports:      https://cygwin.com/problems.html
FAQ:                  https://cygwin.com/faq/
Documentation:        https://cygwin.com/docs.html
Unsubscribe info:     https://cygwin.com/ml/#unsubscribe-simple

- Raw text -


  webmaster     delorie software   privacy  
  Copyright © 2019   by DJ Delorie     Updated Jul 2019