| delorie.com/archives/browse.cgi | search |
| DMARC-Filter: | OpenDMARC Filter v1.4.2 delorie.com 56OFbDAd1498479 |
| Authentication-Results: | delorie.com; dmarc=pass (p=none dis=none) header.from=cygwin.com |
| Authentication-Results: | delorie.com; spf=pass smtp.mailfrom=cygwin.com |
| DKIM-Filter: | OpenDKIM Filter v2.11.0 delorie.com 56OFbDAd1498479 |
| X-Recipient: | archive-cygwin AT delorie DOT com |
| X-Original-To: | cygwin AT cygwin DOT com |
| Delivered-To: | cygwin AT cygwin DOT com |
| DKIM-Filter: | OpenDKIM Filter v2.11.0 sourceware.org 6B24B3856DF0 |
| Date: | Thu, 24 Jul 2025 17:36:38 +0200 |
| To: | cygwin AT cygwin DOT com |
| Subject: | Re: readdir() returns inaccessible name if file was created with |
| invalid UTF-8 | |
| Message-ID: | <aIJTBuRjlVNDNWD7@calimero.vinschen.de> |
| Mail-Followup-To: | cygwin AT cygwin DOT com |
| References: | <aF5y15iQ840LxLYJ AT calimero DOT vinschen DOT de> |
| <ca205dbd-907f-4552-9e5c-2cb0050f83a3 AT towo DOT net> | |
| <aH-MtwqARmDmLwoo AT calimero DOT vinschen DOT de> | |
| <91f26856-72b0-483b-8d04-bd90a27b6be0 AT towo DOT net> | |
| <4ab2c1b7-3164-4556-ba36-29814ecf5766 AT towo DOT net> | |
| <68f65634-8f4e-436b-ba6a-d30bdf882aaa AT towo DOT net> | |
| <aICVBQzWUiCYwnL2 AT calimero DOT vinschen DOT de> | |
| <11282182-60d1-4841-bf78-5ef78cf30060 AT towo DOT net> | |
| <aIILWiKsr99DOaI8 AT calimero DOT vinschen DOT de> | |
| <aec69850-227c-4c37-8aa9-6ea97dbec25b AT systematicsw DOT ab DOT ca> | |
| MIME-Version: | 1.0 |
| In-Reply-To: | <aec69850-227c-4c37-8aa9-6ea97dbec25b@systematicsw.ab.ca> |
| X-BeenThere: | cygwin AT cygwin DOT com |
| X-Mailman-Version: | 2.1.30 |
| List-Id: | General Cygwin discussions and problem reports <cygwin.cygwin.com> |
| List-Unsubscribe: | <https://cygwin.com/mailman/options/cygwin>, |
| <mailto:cygwin-request AT cygwin DOT com?subject=unsubscribe> | |
| List-Archive: | <https://cygwin.com/pipermail/cygwin/> |
| List-Post: | <mailto:cygwin AT cygwin DOT com> |
| List-Help: | <mailto:cygwin-request AT cygwin DOT com?subject=help> |
| List-Subscribe: | <https://cygwin.com/mailman/listinfo/cygwin>, |
| <mailto:cygwin-request AT cygwin DOT com?subject=subscribe> | |
| From: | Corinna Vinschen via Cygwin <cygwin AT cygwin DOT com> |
| Reply-To: | cygwin AT cygwin DOT com |
| Cc: | Corinna Vinschen <corinna-cygwin AT cygwin DOT com> |
| Errors-To: | cygwin-bounces~archive-cygwin=delorie DOT com AT cygwin DOT com |
| Sender: | "Cygwin" <cygwin-bounces~archive-cygwin=delorie DOT com AT cygwin DOT com> |
On Jul 24 09:28, Brian Inglis via Cygwin wrote: > On 2025-07-24 04:30, Corinna Vinschen via Cygwin wrote: > > Or shall simply go along with CESU-8 when converting back to multibyte > > to keep the string the same as with wcstombs? > > There are 15 * SMP as BMP characters, so many non-Western and emoji > characters will be expanded from 4 UTF-8 bytes to 6 CESU-8 bytes, and this > is not supported anywhere as a string representation, designed for internal > use only per the TR. We're only talking about invalid sequences, not using CESU-8 throughout. Corinna -- Problem reports: https://cygwin.com/problems.html FAQ: https://cygwin.com/faq/ Documentation: https://cygwin.com/docs.html Unsubscribe info: https://cygwin.com/ml/#unsubscribe-simple
| webmaster | delorie software privacy |
| Copyright © 2019 by DJ Delorie | Updated Jul 2019 |