DMARC-Filter: OpenDMARC Filter v1.4.2 delorie.com 4ANAja7n762138
Authentication-Results: delorie.com; dmarc=pass (p=none dis=none) header.from=cygwin.com
Authentication-Results: delorie.com; spf=pass smtp.mailfrom=cygwin.com
DKIM-Filter: OpenDKIM Filter v2.11.0 delorie.com 4ANAja7n762138
Authentication-Results: delorie.com;
	dkim=pass (1024-bit key, unprotected) header.d=cygwin.com header.i=@cygwin.com header.a=rsa-sha256 header.s=default header.b=S5PJYdns
X-Recipient: archive-cygwin@delorie.com
DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org 8780D3857C6E
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=cygwin.com;
	s=default; t=1732358734;
	bh=C6wMnewqJYGwiPH42ERVyZ0HSO4+HQNwP3MgQOD+6ew=;
	h=Date:Subject:To:List-Id:List-Unsubscribe:List-Archive:List-Post:
	 List-Help:List-Subscribe:From:Reply-To:From;
	b=S5PJYdnsESGHK3I8yHqtBE6f5bgAvoCCSVMGeiuogXE1xQbJCQgxOHE1a5ji+RDGA
	 3M2qAjBItaKjKc/i2+bhVvZFJ2rg6BC0K+bDRnKgJg5mKe/Nec9eobOBqs+62UZnzJ
	 +mXel/4TNtpfHKgC8yyg5a7QW5hGPwUuJ3ueOSaQ=
X-Original-To: cygwin@cygwin.com
Delivered-To: cygwin@cygwin.com
DMARC-Filter: OpenDMARC Filter v1.4.2 sourceware.org 236CE3858D37
ARC-Filter: OpenARC Filter v1.0.0 sourceware.org 236CE3858D37
ARC-Seal: i=1; a=rsa-sha256; d=sourceware.org; s=key; t=1732358679; cv=none;
 b=QFmt4qoVtEft3BIBQc4tIEicTbo+ErZNozjXEIq00fq6FHXlElJwjkKOu/LlMXoezpF5h4ig05TOUxtnz9s1pWRCw7D1YEOdTiThD5CEX9FOKhDFsPuIL7t6kesmW7NNSrxEjWf6cle1vUaQ9/WPwjz2zQ3huJr3Ix8hH8j76bQ=
ARC-Message-Signature: i=1; a=rsa-sha256; d=sourceware.org; s=key;
 t=1732358679; c=relaxed/simple;
 bh=4Q4cgo3iC3KN08brt1wEytn9a4jYJfRSc4Okh67OcMA=;
 h=DKIM-Signature:MIME-Version:From:Date:Message-ID:Subject:To;
 b=H/R72kGBqDrHzspWZhPuI/xy+bE4C5x64nTI3N4tDwWP1TPdU1G6QV/Rwxz0OU6aPfBlhRLtBUhEOSWD0E1XZ0Fiq9YmN+kmfkblSO4hRYHRHeQmNxEyPw3kWW/VBqttYc4Y0fFkwCwwdfWObECLWchsIlXNn59FVluRtCL50X4=
ARC-Authentication-Results: i=1; server2.sourceware.org
DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org 236CE3858D37
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;
 d=1e100.net; s=20230601; t=1732358676; x=1732963476;
 h=content-transfer-encoding:to:subject:message-id:date:from
 :mime-version:x-gm-message-state:from:to:cc:subject:date:message-id
 :reply-to;
 bh=jrM6vJEPuCrcfUOt7eGGSeKECoHtXrw0/BIfAUScVs8=;
 b=TGxSk17TPmRt4vUkU8tGWJIV/jYeIqvPKXY1EHQmp2v9HbVRo7u/avvS9ydfRBfaSE
 +X89zXHrvApsojkWCmQOnBgBRm0TNpLtpewsWEnMzpJ1cXcRh/26UI6vAaPHjoMcHpLU
 ibAGHVYSv7xKp4v+cc1BdoCwU6q7eRTSzSS7vzf1oQjIwQWLtqBPPDbXEYG3Q7tjRguX
 bld2JcidzKGQO3Cut26llH2ppDYnbfrAvc+ESQaD+HfuygJcRUfM62skgCpwgjp2CKb9
 aK1stP5/LDMwhLOSrvIC0l735fKhvLtXfCQKCSs0VebodRJnCpNtjW30kx9GM5A1IN7+
 ZIWg==
X-Gm-Message-State: AOJu0YzMGbFsQFUl5Zhat4Q1u4k0uXlDYS9crOYh7wTm1Vi+RI1j/0R/
 YC6uXjq1G14yvLjT677+WEUnXF/B4lBBkyJ7kS+98RmfygIE4mJKThR6YnFJP5rRAxhV6sEZ3NH
 VI/DNx+W64R5LYcIugKbvUSLCAItdxHgy
X-Gm-Gg: ASbGnctyqKAPyXTI2/KYT5DoJOaBbqA8FOYjkg6N2s/grGv6cFBHabeVd6i7n1tIvko
 pglxhgjSKdQfaP3+56w19bwnzhXlZYlo=
X-Google-Smtp-Source: AGHT+IHKBbkFIz3uKDR4miv6mio8RnC08lwBLli/pPPToJCIomJ0A6yvSidI3xRh/y9BKU0JmYqu/Su9FMwS+uPRZRY=
X-Received: by 2002:a05:6402:3888:b0:5cf:be01:d7f4 with SMTP id
 4fb4d7f45d1cf-5d020629f37mr4837385a12.19.1732358676056; Sat, 23 Nov 2024
 02:44:36 -0800 (PST)
MIME-Version: 1.0
Date: Sat, 23 Nov 2024 11:44:00 +0100
Message-ID: <CALXu0UcnZnQBbJQcSsbianeKiyB2vkOmvE1weGN_-EQSU=RNrQ@mail.gmail.com>
Subject: /bin/ls -l cannot handle printable Unicode characters outside the BMP
 ...
To: cygwin@cygwin.com
X-BeenThere: cygwin@cygwin.com
X-Mailman-Version: 2.1.30
List-Id: General Cygwin discussions and problem reports <cygwin.cygwin.com>
List-Archive: <https://cygwin.com/pipermail/cygwin/>
List-Post: <mailto:cygwin@cygwin.com>
List-Help: <mailto:cygwin-request@cygwin.com?subject=help>
List-Subscribe: <https://cygwin.com/mailman/listinfo/cygwin>,
 <mailto:cygwin-request@cygwin.com?subject=subscribe>
From: Cedric Blancher via Cygwin <cygwin@cygwin.com>
Reply-To: Cedric Blancher <cedric.blancher@gmail.com>
Content-Type: text/plain; charset="utf-8"
Sender: "Cygwin" <cygwin-bounces~archive-cygwin=delorie.com@cygwin.com>
Content-Transfer-Encoding: 8bit
X-MIME-Autoconverted: from base64 to 8bit by delorie.com id 4ANAja7n762138

Good morning!

/bin/ls -l cannot handle printable Unicode characters outside the BMP

Example using '𝒯'
bash -c 'printf "\U0001D4AF\n"' # MATHEMATICAL SCRIPT CAPITAL T
(yes, our mathematicians want to use THAT as file name)

On Linux:
LC_ALL=en_US.UTF-8 bash -c 't="$(printf "\U0001D4AF\n")" ; touch "$t" "$t$t"'
ls -la
total 8
-rw-r--r--  1 ced staden  0 Nov 23 11:29 ööööööö
-rw-r--r--  2 ced staden  4 Nov 23 11:31 𝒯
-rw-r--r--  2 ced staden  4 Nov 23 11:31𝒯𝒯

On Cygwin:
LC_ALL=en_US.UTF-8 bash -c 't="$(printf "\U0001D4AF\n")" ; touch "$t" "$t$t"'
$ ls -la
-rw-r--r-- 1 ced staden  0 Nov 23 11:29  ööööööö
-rw-r--r-- 2 ced staden  4 Nov 23 11:31 ''$'\360\235\222\257'
-rw-r--r-- 2 ced staden  4 Nov 23 11:31 ''$'\360\235\222\257\360\235\222\257'

Looks like the Cygwin locale has a problem with non-BMP chars.

Ced
-- 
Cedric Blancher <cedric.blancher@gmail.com>
[https://plus.google.com/u/0/+CedricBlancher/]
Institute Pasteur

-- 
Problem reports:      https://cygwin.com/problems.html
FAQ:                  https://cygwin.com/faq/
Documentation:        https://cygwin.com/docs.html
Unsubscribe info:     https://cygwin.com/ml/#unsubscribe-simple

