delorie.com/archives/browse.cgi   search  
Mail Archives: cygwin/2006/11/14/17:38:04

X-Spam-Check-By: sourceware.org
Message-ID: <455A453A.8060409@tlinx.org>
Date: Tue, 14 Nov 2006 14:37:46 -0800
From: Linda Walsh <cygwin AT tlinx DOT org>
User-Agent: Thunderbird 1.5.0.8 (Windows/20061025)
MIME-Version: 1.0
To: cygwin AT cygwin DOT com
Subject: Cygwin util replacing win-util for UCS-2 -> UTF-8; utf8 support (redux)
X-IsSubscribed: yes
Mailing-List: contact cygwin-help AT cygwin DOT com; run by ezmlm
List-Subscribe: <mailto:cygwin-subscribe AT cygwin DOT com>
List-Archive: <http://sourceware.org/ml/cygwin/>
List-Post: <mailto:cygwin AT cygwin DOT com>
List-Help: <mailto:cygwin-help AT cygwin DOT com>, <http://sourceware.org/ml/#faqs>
Sender: cygwin-owner AT cygwin DOT com
Mail-Followup-To: cygwin AT cygwin DOT com
Delivered-To: mailing list cygwin AT cygwin DOT com

Right now, to convert an NT text file in UCS-2 format, in bash,
I use:

mode.com codepage select=65001
reg export hklm\\software hklm-sw.reg5
cmd /c type hklm-sw.reg5  > hklm-sw-utf8.txt

It isn't perfect -- any UCS-2 entries that are not valid UTF-16
won't get converted properly (since they don't represent a valid
text string).

You can convert to other codepages, of course, by selecting
an alternate code page with "mode".

What ever happened to the UTF-8 compatibility layer that
someone wrote a patch for a while back?  I don't recall seeing
the issue discussed on the list.  I heard mention that it might
have been on another list, but it seems as a cygwin issue, it
might have gotten more people interested in the discussion had it
been discussed here.

It's troublesome to have filenames that are valid under
windows be inaccessible with cygwin utils (like rsync).
It's a pain that I can't rsync my music directory (which
contains World-beat music) to my mp3 player without getting
multiple "file not found" errors. (due to rsync not
understanding International filenames).

Can't even "ls" in some directories:

/m/World/Omar/Süleyan the Magnificient> ll -gG
ls: 01 Istanbul'dan Görüntüler.mp3: No such file or directory
ls: 17 Istanbul'dan Görüntüler (Reprise).mp3: No such file or directory
ls: 10 Süleyman?n Öyküsü.mp3: No such file or directory
ls: 02 Gögü Yedi Kat?.mp3: No such file or directory
ls: 08 Topkap?n?n Bahçesi.mp3: No such file or directory
ls: 11 Teke Z?plamas?.mp3: No such file or directory
ls: 13 Segâh Pesrev.mp3: No such file or directory
ls: 15 Hicaz Pesrev.mp3: No such file or directory
ls: 05 Ussak Semai.mp3: No such file or directory
total 23088
-rw-r----- 1 2447657 Mar  8  2003 03 Egeli Gemici.mp3
-rw-r----- 1 3253625 Mar  8  2003 04 Aya Sofya.mp3
-rw-r----- 1 1311799 Mar  8  2003 06 Nihâvend Fantazi.mp3
-rw-r----- 1 1733289 Mar  8  2003 07 Kuzeydeki Köy.mp3
-rw-r----- 1 6059476 Mar  8  2003 09 Rast Medhal.mp3
-rw-r----- 1 3791275 Mar  8  2003 12 Mevlânâ.mp3
-rw-r----- 1 1371208 Mar  8  2003 14 Hicaz Taksimi.mp3
-rw-r----- 1 3654052 Mar  8  2003 16 Makber.mp3
/m/World/Omar/Süleyan the Magnificient>


--
Unsubscribe info:      http://cygwin.com/ml/#unsubscribe-simple
Problem reports:       http://cygwin.com/problems.html
Documentation:         http://cygwin.com/docs.html
FAQ:                   http://cygwin.com/faq/

- Raw text -


  webmaster     delorie software   privacy  
  Copyright © 2019   by DJ Delorie     Updated Jul 2019