delorie.com/archives/browse.cgi   search  
Mail Archives: cygwin/2008/03/03/22:00:31

X-Recipient: archive-cygwin AT delorie DOT com
X-Spam-Check-By: sourceware.org
Date: Tue, 4 Mar 2008 03:59:37 +0100
From: Paul-Kenji Cahier <pkc AT F1-Photo DOT com>
X-Mailer: The Bat! (v3.99.24) Professional
Reply-To: Paul-Kenji Cahier <pkc AT F1-Photo DOT com>
Message-ID: <447314643.20080304035937@F1-Photo.com>
To: cygwin AT cygwin DOT com
Subject: Re[2]: ZSH - UTF-8
In-Reply-To: <20080304091535.168@blackhawk>
References: <13510047732 DOT 20080303203325 AT F1-Photo DOT com> <20080304091535 DOT 168 AT blackhawk>
MIME-Version: 1.0
Mailing-List: contact cygwin-help AT cygwin DOT com; run by ezmlm
List-Id: <cygwin.cygwin.com>
List-Unsubscribe: <mailto:cygwin-unsubscribe-archive-cygwin=delorie DOT com AT cygwin DOT com>
List-Subscribe: <mailto:cygwin-subscribe AT cygwin DOT com>
List-Archive: <http://sourceware.org/ml/cygwin/>
List-Post: <mailto:cygwin AT cygwin DOT com>
List-Help: <mailto:cygwin-help AT cygwin DOT com>, <http://sourceware.org/ml/#faqs>
Sender: cygwin-owner AT cygwin DOT com
Mail-Followup-To: cygwin AT cygwin DOT com
Delivered-To: mailing list cygwin AT cygwin DOT com

Here things display okay, but trying to erase a multi-byte character ends up only deleting one byte;
ie (in pseudo code):
"echo test:é<backspace><enter>"->"test:<0x3>"
Which clearly shows the editing is not being utf-8 aware.
It also displays badly if you start doing multi-line commands:
the term&shell get highly confused as the zsh is non-utf-8 enabled.
That's with the cygwin default zsh 4.3.4.

Now, compiling a 4.3.5 using a simple ./configure && make
I simply dont get any utf-8 displaying on the line editor anymore.
Anything non-ascii is displayed as one-byte control codes.
I also tried ./configure --enable-multibyte with the same result.
LC_ALL set to C.UTF-8 in all cases. Cygwin freshly installed.
Also tried LC_ALL=en_US.UTF-8(which shouldnt exist) and it did
not work either.

Setting LC_ALL=C restores the cygwin zsh 4.3.4 behaviour: it displays fine non-ascii codes
but fails when trying to erase/move around them(obviously it's considering them as one byte
but the terminal is seeing them as utf-8, which creates confusion).


>?I am using: zsh 4.3.5 (i686-pc-cygwin)  ( compiled from src )
>?I just did $export LC_ALL=C.UTF-8

>?cmd line editting didn't seem ill affected and worked fine.

>?Paul-Kenji Cahier <pkc AT F1-Photo DOT com> wrote:
>>... (Note: I'm talking of having zsh display&edit properly utf-8 in the line editor,
>>including the completion system).
>>................

>?If you could be me some tests to try I'd be happy to do it. Otherwise I am not quite sure how that should be done.

>?--
>?Unsubscribe info:   http://cygwin.com/ml/#unsubscribe-simple
>?Problem reports:   http://cygwin.com/problems.html
>?Documentation:    http://cygwin.com/docs.html
>?FAQ:         http://cygwin.com/faq/


--
Unsubscribe info:      http://cygwin.com/ml/#unsubscribe-simple
Problem reports:       http://cygwin.com/problems.html
Documentation:         http://cygwin.com/docs.html
FAQ:                   http://cygwin.com/faq/

- Raw text -


  webmaster     delorie software   privacy  
  Copyright © 2019   by DJ Delorie     Updated Jul 2019