delorie.com/archives/browse.cgi   search  
Mail Archives: cygwin/2017/12/14/14:20:46

X-Recipient: archive-cygwin AT delorie DOT com
DomainKey-Signature: a=rsa-sha1; c=nofws; d=sourceware.org; h=list-id
:list-unsubscribe:list-subscribe:list-archive:list-post
:list-help:sender:date:from:reply-to:message-id:to:subject
:in-reply-to:references:mime-version:content-type
:content-transfer-encoding; q=dns; s=default; b=BvN3nRejSU03cWCw
H03sNuC5TLDwOA0wbSxm8ONHdgaTPfZArO9LZ7E8JEtFQl/TdOSTdhqY845QiKK7
YLPQ+GKctFW5EHNJMRFGWDPPE2i5e85r3QKATWTmh8UVv5SGVppdPGztPz8PtzYq
s/PKRzq7I6245XFZMuF9xI7aljc=
DKIM-Signature: v=1; a=rsa-sha1; c=relaxed; d=sourceware.org; h=list-id
:list-unsubscribe:list-subscribe:list-archive:list-post
:list-help:sender:date:from:reply-to:message-id:to:subject
:in-reply-to:references:mime-version:content-type
:content-transfer-encoding; s=default; bh=a0q6Eq37koQo2a6a6CTNLx
ZS9LE=; b=KvcAyj+Kg0uo7HTIAm7r4UP5b1Y/FybNEY5pJDyY4OZNS/94XrWpnl
Jsg9aRhMEhbHDX0qsqrxMuikb/hQh+jUnXVvGgLxeCA/HLSOrgDThWTse8PPLnAH
O/6rBhAXMa48s6JDH+yb4AxwId8hPGAoLUsQSf2UYyrIvBAa7pDw8=
Mailing-List: contact cygwin-help AT cygwin DOT com; run by ezmlm
List-Id: <cygwin.cygwin.com>
List-Subscribe: <mailto:cygwin-subscribe AT cygwin DOT com>
List-Archive: <http://sourceware.org/ml/cygwin/>
List-Post: <mailto:cygwin AT cygwin DOT com>
List-Help: <mailto:cygwin-help AT cygwin DOT com>, <http://sourceware.org/ml/#faqs>
Sender: cygwin-owner AT cygwin DOT com
Mail-Followup-To: cygwin AT cygwin DOT com
Delivered-To: mailing list cygwin AT cygwin DOT com
Authentication-Results: sourceware.org; auth=none
X-Virus-Found: No
X-Spam-SWARE-Status: No, score=4.8 required=5.0 tests=BAYES_50,FREEMAIL_FROM,KAM_INFOUSMEBIZ,KAM_THEBAT,LIKELY_SPAM_SUBJECT,MIME_BASE64_BLANKS,RCVD_IN_DNSWL_LOW,SPF_PASS autolearn=no version=3.3.2 spammy=UD:index.htm, indexhtm, index.htm, H*F:D*yandex.ru
X-HELO: forward103j.mail.yandex.net
Authentication-Results: smtp4o.mail.yandex.net; dkim=pass header.i=@yandex.ru
Date: Thu, 14 Dec 2017 22:19:46 +0300
From: Andrey Repin <anrdaemon AT yandex DOT ru>
Reply-To: cygwin AT cygwin DOT com
Message-ID: <1534444625.20171214221946@yandex.ru>
To: Thomas Taylor <tayloth AT gmail DOT com>, cygwin AT cygwin DOT com
Subject: Re: Need help with multibyte UTF-8 characters
In-Reply-To: <1909177a-3f35-52d5-1717-9007d6efaa71@gmail.com>
References: <626a3c06-e9f2-1932-f1f3-47ddb2051215 AT gmail DOT com> <9d3b73ff-f596-51a2-909a-30a767e3e9b3 AT gmail DOT com> <1909177a-3f35-52d5-1717-9007d6efaa71 AT gmail DOT com>
MIME-Version: 1.0
X-IsSubscribed: yes
X-MIME-Autoconverted: from base64 to 8bit by delorie.com id vBEJKjaV006694

Greetings, Thomas Taylor!

> I believe that Cygwin displays certain UTF-8 characters incorrectly.  To 
> see the problem, first save the attached "utf-8_test.sed" text file to 
> your desktop. 

First, your "NBSP" is actually http://www.fileformat.info/info/unicode/char/23b5/index.htm

> Then run "mintty," and set its options by right clicking
> in its title bar, selecting "Options" and then "Text." 

I just keep them clear.

> On the Text page
> set "Locale" to "en_US" and "Character set" to "UTF-8," and then 
> "Save."  Now exit and restart mintty.  Change directory to your desktop 
> and run the editor "vim" on the utf-8_test.sed file.  Once inside vim do 
> a ":set fileencoding=utf-8".  You should now see that vim displays 
> correctly a sample of one-, two-, and three-byte UTF-8 character 
> encodings in the test file.  Vim fails, however, on the three-byte 
> encodings for the "en" dash, the "em" dash, and the ellipsis, each of 
> which displays incorrectly as a filled-in rectangle.  Now exit vim and 
> do a "less" or "cat" on the utf-8_test.sed file.  You should see most of 
> the sample UTF-8 encoded characters displayed correctly, except once 
> again for the en dash, em dash, and ellipsis. 

All displayed correctly. Lucida Console 11pt.

> So it looks like a problem in the underlying Cygwin run-time libraries
> rather than in vim, less, or cat.  I haven't tested this on four-byte UTF-8
> character encodings, but assume Cygwin will have similar problems.

I don't have a good console font for mb4, but I presume it will be displaed
just fine.


-- 
With best regards,
Andrey Repin
Thursday, December 14, 2017 21:59:07

Sorry for my terrible english...
--
Problem reports:       http://cygwin.com/problems.html
FAQ:                   http://cygwin.com/faq/
Documentation:         http://cygwin.com/docs.html
Unsubscribe info:      http://cygwin.com/ml/#unsubscribe-simple


- Raw text -


  webmaster     delorie software   privacy  
  Copyright © 2019   by DJ Delorie     Updated Jul 2019