delorie.com/archives/browse.cgi   search  
Mail Archives: cygwin/2009/09/25/05:15:53

X-Recipient: archive-cygwin AT delorie DOT com
X-Spam-Check-By: sourceware.org
Date: Fri, 25 Sep 2009 11:15:25 +0200
From: Corinna Vinschen <corinna-cygwin AT cygwin DOT com>
To: cygwin AT cygwin DOT com
Subject: Re: [1.7] UTF-8, find vs. tar
Message-ID: <20090925091525.GL30851@calimero.vinschen.de>
Reply-To: cygwin AT cygwin DOT com
Mail-Followup-To: cygwin AT cygwin DOT com
References: <4ABC3BB3 DOT 9020906 AT users DOT sourceforge DOT net>
MIME-Version: 1.0
In-Reply-To: <4ABC3BB3.9020906@users.sourceforge.net>
User-Agent: Mutt/1.5.19 (2009-02-20)
Mailing-List: contact cygwin-help AT cygwin DOT com; run by ezmlm
List-Id: <cygwin.cygwin.com>
List-Unsubscribe: <mailto:cygwin-unsubscribe-archive-cygwin=delorie DOT com AT cygwin DOT com>
List-Subscribe: <mailto:cygwin-subscribe AT cygwin DOT com>
List-Archive: <http://sourceware.org/ml/cygwin/>
List-Post: <mailto:cygwin AT cygwin DOT com>
List-Help: <mailto:cygwin-help AT cygwin DOT com>, <http://sourceware.org/ml/#faqs>
Sender: cygwin-owner AT cygwin DOT com
Mail-Followup-To: cygwin AT cygwin DOT com
Delivered-To: mailing list cygwin AT cygwin DOT com

On Sep 24 22:40, Yaakov S wrote:
> I'm having some difficulty with a package containing a file with a UTF-8  
> character:
>
> wget http://downloads.sourceforge.net/klavaro/klavaro-1.3.1.tar.bz2
> tar jxf klavaro-1.3.1.tar.bz2
> cd klavaro-1.3.1
> tar jcf TEST.tar.bz2 data/dvorak_fr*
> tar jtf TEST.tar.bz2 > tmptar.out
> find data/ -name 'dvorak_fr*' > tmpfind.out
> diff -u tmpfind.out tmptar.out
>
> The character in question is 'é' (aka U+00E9, small e with acute)[1].  
> The difference in rendering is throwing cygport off at the "checking  
> packages for missing/duplicate files" stage.
>
> What to I need to do to get these to match?

Nothing but wait.  The reason that tar doesn't print the characters
while find does is probably related to find callng setlocale and
tar doesn't.  I hope to get this fixed in the next couple of days.
We're discussing the entire locale stuff on the cygwin-developers
list right now, see the threads starting at
http://cygwin.com/ml/cygwin-developers/2009-09/msg00009.html
and
http://cygwin.com/ml/cygwin-developers/2009-09/msg00017.html

My current locally patched DLL doesn't have that problem anymore,
so we're hopefully on the right way.


Corinna

-- 
Corinna Vinschen                  Please, send mails regarding Cygwin to
Cygwin Project Co-Leader          cygwin AT cygwin DOT com
Red Hat

--
Problem reports:       http://cygwin.com/problems.html
FAQ:                   http://cygwin.com/faq/
Documentation:         http://cygwin.com/docs.html
Unsubscribe info:      http://cygwin.com/ml/#unsubscribe-simple

- Raw text -


  webmaster     delorie software   privacy  
  Copyright © 2019   by DJ Delorie     Updated Jul 2019