delorie.com/archives/browse.cgi   search  
Mail Archives: cygwin/2009/12/09/06:48:50

X-Recipient: archive-cygwin AT delorie DOT com
X-SWARE-Spam-Status: No, hits=-1.8 required=5.0 tests=AWL,BAYES_00,SARE_MSGID_LONG40,SARE_SUB_ENC_UTF8,SPF_PASS
X-Spam-Check-By: sourceware.org
MIME-Version: 1.0
In-Reply-To: <4B1F8BA3.3070302@towo.net>
References: <4B1E4D22 DOT 9020406 AT hones DOT org DOT uk> <416096c60912080520l49cbf64bwa4a2c4510b13a9f0 AT mail DOT gmail DOT com> <4B1E56D1 DOT 2090501 AT hones DOT org DOT uk> <416096c60912081305k3d386207q5d6fb6cdc77c47a AT mail DOT gmail DOT com> <4B1F29F0 DOT 7020701 AT hones DOT org DOT uk> <416096c60912082136x316686b3gab4275c9ac5736de AT mail DOT gmail DOT com> <4B1F8BA3 DOT 3070302 AT towo DOT net>
Date: Wed, 9 Dec 2009 11:48:35 +0000
Message-ID: <416096c60912090348u4a2180e6w4d613346cbbf92bd@mail.gmail.com>
Subject: Re: UTF-8 in Cygwin console on 1.7
From: Andy Koppe <andy DOT koppe AT gmail DOT com>
To: cygwin AT cygwin DOT com
X-IsSubscribed: yes
Mailing-List: contact cygwin-help AT cygwin DOT com; run by ezmlm
List-Id: <cygwin.cygwin.com>
List-Unsubscribe: <mailto:cygwin-unsubscribe-archive-cygwin=delorie DOT com AT cygwin DOT com>
List-Subscribe: <mailto:cygwin-subscribe AT cygwin DOT com>
List-Archive: <http://sourceware.org/ml/cygwin/>
List-Post: <mailto:cygwin AT cygwin DOT com>
List-Help: <mailto:cygwin-help AT cygwin DOT com>, <http://sourceware.org/ml/#faqs>
Sender: cygwin-owner AT cygwin DOT com
Mail-Followup-To: cygwin AT cygwin DOT com
Delivered-To: mailing list cygwin AT cygwin DOT com

2009/12/9 Thomas Wolff:
> So the #define IGN should go as well...

Yep.

> Actually, I don't really understand this code part:
> * What is the purpose of the lookup table for the switch rather than just
> switching on a few special control characters?

Good question.

> * What is the idea of (randomly?) assigning ERR to some control characters
> and NOR to others?

I wondered about that too.

> * How does the case NOR *work* actually?

Due to the code around line 1530 for dealing with multibyte chars.
Further deepens the mystery regarding the ERR/NOR distinction though.
Do only certain control chars (if any) appear in doublebyte encodings?

> Maybe after clarifying this, the whole snipplet should be cleaned up a
> little bit.

Sounds like a good addition to your console patch.

Andy

--
Problem reports:       http://cygwin.com/problems.html
FAQ:                   http://cygwin.com/faq/
Documentation:         http://cygwin.com/docs.html
Unsubscribe info:      http://cygwin.com/ml/#unsubscribe-simple

- Raw text -


  webmaster     delorie software   privacy  
  Copyright © 2019   by DJ Delorie     Updated Jul 2019