DMARC-Filter: OpenDMARC Filter v1.4.2 delorie.com 4ANAja7n762138 Authentication-Results: delorie.com; dmarc=pass (p=none dis=none) header.from=cygwin.com Authentication-Results: delorie.com; spf=pass smtp.mailfrom=cygwin.com DKIM-Filter: OpenDKIM Filter v2.11.0 delorie.com 4ANAja7n762138 Authentication-Results: delorie.com; dkim=pass (1024-bit key, unprotected) header.d=cygwin.com header.i=@cygwin.com header.a=rsa-sha256 header.s=default header.b=S5PJYdns X-Recipient: archive-cygwin AT delorie DOT com DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org 8780D3857C6E DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=cygwin.com; s=default; t=1732358734; bh=C6wMnewqJYGwiPH42ERVyZ0HSO4+HQNwP3MgQOD+6ew=; h=Date:Subject:To:List-Id:List-Unsubscribe:List-Archive:List-Post: List-Help:List-Subscribe:From:Reply-To:From; b=S5PJYdnsESGHK3I8yHqtBE6f5bgAvoCCSVMGeiuogXE1xQbJCQgxOHE1a5ji+RDGA 3M2qAjBItaKjKc/i2+bhVvZFJ2rg6BC0K+bDRnKgJg5mKe/Nec9eobOBqs+62UZnzJ +mXel/4TNtpfHKgC8yyg5a7QW5hGPwUuJ3ueOSaQ= X-Original-To: cygwin AT cygwin DOT com Delivered-To: cygwin AT cygwin DOT com DMARC-Filter: OpenDMARC Filter v1.4.2 sourceware.org 236CE3858D37 ARC-Filter: OpenARC Filter v1.0.0 sourceware.org 236CE3858D37 ARC-Seal: i=1; a=rsa-sha256; d=sourceware.org; s=key; t=1732358679; cv=none; b=QFmt4qoVtEft3BIBQc4tIEicTbo+ErZNozjXEIq00fq6FHXlElJwjkKOu/LlMXoezpF5h4ig05TOUxtnz9s1pWRCw7D1YEOdTiThD5CEX9FOKhDFsPuIL7t6kesmW7NNSrxEjWf6cle1vUaQ9/WPwjz2zQ3huJr3Ix8hH8j76bQ= ARC-Message-Signature: i=1; a=rsa-sha256; d=sourceware.org; s=key; t=1732358679; c=relaxed/simple; bh=4Q4cgo3iC3KN08brt1wEytn9a4jYJfRSc4Okh67OcMA=; h=DKIM-Signature:MIME-Version:From:Date:Message-ID:Subject:To; b=H/R72kGBqDrHzspWZhPuI/xy+bE4C5x64nTI3N4tDwWP1TPdU1G6QV/Rwxz0OU6aPfBlhRLtBUhEOSWD0E1XZ0Fiq9YmN+kmfkblSO4hRYHRHeQmNxEyPw3kWW/VBqttYc4Y0fFkwCwwdfWObECLWchsIlXNn59FVluRtCL50X4= ARC-Authentication-Results: i=1; server2.sourceware.org DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org 236CE3858D37 X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1732358676; x=1732963476; h=content-transfer-encoding:to:subject:message-id:date:from :mime-version:x-gm-message-state:from:to:cc:subject:date:message-id :reply-to; bh=jrM6vJEPuCrcfUOt7eGGSeKECoHtXrw0/BIfAUScVs8=; b=TGxSk17TPmRt4vUkU8tGWJIV/jYeIqvPKXY1EHQmp2v9HbVRo7u/avvS9ydfRBfaSE +X89zXHrvApsojkWCmQOnBgBRm0TNpLtpewsWEnMzpJ1cXcRh/26UI6vAaPHjoMcHpLU ibAGHVYSv7xKp4v+cc1BdoCwU6q7eRTSzSS7vzf1oQjIwQWLtqBPPDbXEYG3Q7tjRguX bld2JcidzKGQO3Cut26llH2ppDYnbfrAvc+ESQaD+HfuygJcRUfM62skgCpwgjp2CKb9 aK1stP5/LDMwhLOSrvIC0l735fKhvLtXfCQKCSs0VebodRJnCpNtjW30kx9GM5A1IN7+ ZIWg== X-Gm-Message-State: AOJu0YzMGbFsQFUl5Zhat4Q1u4k0uXlDYS9crOYh7wTm1Vi+RI1j/0R/ YC6uXjq1G14yvLjT677+WEUnXF/B4lBBkyJ7kS+98RmfygIE4mJKThR6YnFJP5rRAxhV6sEZ3NH VI/DNx+W64R5LYcIugKbvUSLCAItdxHgy X-Gm-Gg: ASbGnctyqKAPyXTI2/KYT5DoJOaBbqA8FOYjkg6N2s/grGv6cFBHabeVd6i7n1tIvko pglxhgjSKdQfaP3+56w19bwnzhXlZYlo= X-Google-Smtp-Source: AGHT+IHKBbkFIz3uKDR4miv6mio8RnC08lwBLli/pPPToJCIomJ0A6yvSidI3xRh/y9BKU0JmYqu/Su9FMwS+uPRZRY= X-Received: by 2002:a05:6402:3888:b0:5cf:be01:d7f4 with SMTP id 4fb4d7f45d1cf-5d020629f37mr4837385a12.19.1732358676056; Sat, 23 Nov 2024 02:44:36 -0800 (PST) MIME-Version: 1.0 Date: Sat, 23 Nov 2024 11:44:00 +0100 Message-ID: Subject: /bin/ls -l cannot handle printable Unicode characters outside the BMP ... To: cygwin AT cygwin DOT com X-BeenThere: cygwin AT cygwin DOT com X-Mailman-Version: 2.1.30 List-Id: General Cygwin discussions and problem reports List-Archive: List-Post: List-Help: List-Subscribe: , From: Cedric Blancher via Cygwin Reply-To: Cedric Blancher Content-Type: text/plain; charset="utf-8" Sender: "Cygwin" Content-Transfer-Encoding: 8bit X-MIME-Autoconverted: from base64 to 8bit by delorie.com id 4ANAja7n762138 Good morning! /bin/ls -l cannot handle printable Unicode characters outside the BMP Example using '𝒯' bash -c 'printf "\U0001D4AF\n"' # MATHEMATICAL SCRIPT CAPITAL T (yes, our mathematicians want to use THAT as file name) On Linux: LC_ALL=en_US.UTF-8 bash -c 't="$(printf "\U0001D4AF\n")" ; touch "$t" "$t$t"' ls -la total 8 -rw-r--r-- 1 ced staden 0 Nov 23 11:29 ΓΆΓΆΓΆΓΆΓΆΓΆΓΆ -rw-r--r-- 2 ced staden 4 Nov 23 11:31 𝒯 -rw-r--r-- 2 ced staden 4 Nov 23 11:31𝒯𝒯 On Cygwin: LC_ALL=en_US.UTF-8 bash -c 't="$(printf "\U0001D4AF\n")" ; touch "$t" "$t$t"' $ ls -la -rw-r--r-- 1 ced staden 0 Nov 23 11:29 ΓΆΓΆΓΆΓΆΓΆΓΆΓΆ -rw-r--r-- 2 ced staden 4 Nov 23 11:31 ''$'\360\235\222\257' -rw-r--r-- 2 ced staden 4 Nov 23 11:31 ''$'\360\235\222\257\360\235\222\257' Looks like the Cygwin locale has a problem with non-BMP chars. Ced -- Cedric Blancher [https://plus.google.com/u/0/+CedricBlancher/] Institute Pasteur -- Problem reports: https://cygwin.com/problems.html FAQ: https://cygwin.com/faq/ Documentation: https://cygwin.com/docs.html Unsubscribe info: https://cygwin.com/ml/#unsubscribe-simple