delorie.com/archives/browse.cgi   search  
Mail Archives: cygwin/2010/01/12/17:32:48

X-Recipient: archive-cygwin AT delorie DOT com
X-SWARE-Spam-Status: No, hits=-2.0 required=5.0 tests=AWL,BAYES_00,SARE_MSGID_LONG40,SPF_PASS
X-Spam-Check-By: sourceware.org
MIME-Version: 1.0
In-Reply-To: <2BF01EB27B56CC478AD6E5A0A28931F297966C@A1DAL1SWPES19MB.ams.acs-inc.net>
References: <4B4CD234 DOT 9000103 AT bopp DOT net> <2BF01EB27B56CC478AD6E5A0A28931F297966C AT A1DAL1SWPES19MB DOT ams DOT acs-inc DOT net>
Date: Tue, 12 Jan 2010 22:32:31 +0000
Message-ID: <416096c61001121432w55706227k4a034623c4eedaaa@mail.gmail.com>
Subject: Re: various problems resulting from version 1.7.1
From: Andy Koppe <andy DOT koppe AT gmail DOT com>
To: cygwin AT cygwin DOT com
X-IsSubscribed: yes
Mailing-List: contact cygwin-help AT cygwin DOT com; run by ezmlm
List-Id: <cygwin.cygwin.com>
List-Unsubscribe: <mailto:cygwin-unsubscribe-archive-cygwin=delorie DOT com AT cygwin DOT com>
List-Subscribe: <mailto:cygwin-subscribe AT cygwin DOT com>
List-Archive: <http://sourceware.org/ml/cygwin/>
List-Post: <mailto:cygwin AT cygwin DOT com>
List-Help: <mailto:cygwin-help AT cygwin DOT com>, <http://sourceware.org/ml/#faqs>
Sender: cygwin-owner AT cygwin DOT com
Mail-Followup-To: cygwin AT cygwin DOT com
Delivered-To: mailing list cygwin AT cygwin DOT com

2010/1/12 Nellis, Kenneth:
> Thank you, Jeremy. BTW, I deal with legacy machines that don't
> know from SSH. :-(
>
> Well this gets more interesting, at least to me...
> With LANG=UTF-8, man uses the Unicode HYPHEN character (U+2010)
> to break long words at the right margin, and uses the Unicode
> MINUS character (U+2212) to prefix command line options (e.g.,
> ls -l). The Windows Command Prompt window, used by Cygwin.bat,
> doesn't display U+2010 correctly with the Lucida Console font,
> but it works correctly with the default FixedSys font.

The FixedSys font is not a Unicode font, so what happens is that the
console display switches into codepage mode and automatically maps all
Unicode characters not in the font's codepage to some approximation of
them, where possible. For hyphen and minus that works well, but for
most characters it doesn't. You'll just get question marks instead.


> Mintty,
> OTOH, doesn't display U+2212 correctly with the default FixedSys
> font, but does with the Lucida Console font. So, if I can
> tolerate the ugly FixedSys font, then I can use Cygwin.bat and
> the Windows FTP and Telnet tools without (anticipated) problems.
>
> (I wonder if the Mintty developer would want to update Mintty to
> support U+2212.)

It is supported just fine. Mintty does have a special hack to map
U+2010 to plain '-', because most fixed-width Windows seem to be
missing that for some reason, but U+2212 is present in all the fonts
I've tried, except of course in non-Unicode fonts. For example,
Consolas, Courier New, and DejaVu Sans Mono all have it.

Bottom line: use a Unicode font for UTF-8.

Andy

--
Problem reports:       http://cygwin.com/problems.html
FAQ:                   http://cygwin.com/faq/
Documentation:         http://cygwin.com/docs.html
Unsubscribe info:      http://cygwin.com/ml/#unsubscribe-simple

- Raw text -


  webmaster     delorie software   privacy  
  Copyright © 2019   by DJ Delorie     Updated Jul 2019