delorie.com/archives/browse.cgi   search  
Mail Archives: cygwin/2017/12/14/19:32:43

X-Recipient: archive-cygwin AT delorie DOT com
DomainKey-Signature: a=rsa-sha1; c=nofws; d=sourceware.org; h=list-id
:list-unsubscribe:list-subscribe:list-archive:list-post
:list-help:sender:reply-to:subject:to:references:from:message-id
:date:mime-version:in-reply-to:content-type
:content-transfer-encoding; q=dns; s=default; b=WByHj2GYfGuubpc/
C8MUd18Z4eYE12DlU7wK+RK1D0LLDJEzF07rqUSBUn8bZuhJsf5eINDdO91oIdoY
Q0t+kT930TZ3YssH1dgrH9CKodd1JuzQIoFxGE1d2T8Q/e/hBGeBUclVV+MsRm/l
ZZNNBF9XYW9YCXWQ4o5nGnygxFI=
DKIM-Signature: v=1; a=rsa-sha1; c=relaxed; d=sourceware.org; h=list-id
:list-unsubscribe:list-subscribe:list-archive:list-post
:list-help:sender:reply-to:subject:to:references:from:message-id
:date:mime-version:in-reply-to:content-type
:content-transfer-encoding; s=default; bh=cMVKF7UG188+ppi8RskcRJ
JbIAM=; b=bW4zxo0lbaXUrV4USw1VPoEponKtjlQXs/Ax9mi9Ls31ApQrDLZuhe
pYqiIlRWcVrBVjU0SrH0HF0Zusa9rOGf+GcUcfR78gHbiJH3JMeKeZPblc8YAo9b
WfudWwsHTdM9qf3AkbgeafIRWy/iHcOfuna5yztANG2mppjiFwVxc=
Mailing-List: contact cygwin-help AT cygwin DOT com; run by ezmlm
List-Id: <cygwin.cygwin.com>
List-Subscribe: <mailto:cygwin-subscribe AT cygwin DOT com>
List-Archive: <http://sourceware.org/ml/cygwin/>
List-Post: <mailto:cygwin AT cygwin DOT com>
List-Help: <mailto:cygwin-help AT cygwin DOT com>, <http://sourceware.org/ml/#faqs>
Sender: cygwin-owner AT cygwin DOT com
Mail-Followup-To: cygwin AT cygwin DOT com
Delivered-To: mailing list cygwin AT cygwin DOT com
Authentication-Results: sourceware.org; auth=none
X-Virus-Found: No
X-Spam-SWARE-Status: No, score=2.4 required=5.0 tests=AWL,BAYES_50,KAM_LAZY_DOMAIN_SECURITY,LIKELY_SPAM_SUBJECT,RCVD_IN_DNSWL_LOW autolearn=no version=3.3.2 spammy=calgary, Calgary, alberta, Alberta
X-HELO: smtp-out-so.shaw.ca
X-Authority-Analysis: v=2.2 cv=NKylwwyg c=1 sm=1 tr=0 a=MVEHjbUiAHxQW0jfcDq5EA==:117 a=MVEHjbUiAHxQW0jfcDq5EA==:17 a=N659UExz7-8A:10 a=KsUnjp9KtFXNGl9f_ekA:9 a=pILNOxqGKmIA:10
Reply-To: Brian DOT Inglis AT SystematicSw DOT ab DOT ca
Subject: Re: Need help with multibyte UTF-8 characters
To: cygwin AT cygwin DOT com
References: <626a3c06-e9f2-1932-f1f3-47ddb2051215 AT gmail DOT com> <9d3b73ff-f596-51a2-909a-30a767e3e9b3 AT gmail DOT com> <1909177a-3f35-52d5-1717-9007d6efaa71 AT gmail DOT com>
From: Brian Inglis <Brian DOT Inglis AT SystematicSw DOT ab DOT ca>
Message-ID: <fd791c61-e5ae-051d-c401-16e20274464c@SystematicSw.ab.ca>
Date: Thu, 14 Dec 2017 17:32:27 -0700
User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:52.0) Gecko/20100101 Thunderbird/52.5.0
MIME-Version: 1.0
In-Reply-To: <1909177a-3f35-52d5-1717-9007d6efaa71@gmail.com>
X-CMAE-Envelope: MS4wfCEJAF5VPYxyq8QZlNZnlY49hZZAwKCzQarIvJJzqZbaddn04Z07k+2OaBfO0PgLdwNCSYEACJxFKFB0nQZVTFVa5dSvOKnPYeMHoLGnPaMKGPecg+fc Kk4/byUiOfO8VQygvpHFfqD3+kSllLh6pRaWjMbIgQnxY+knVeDPF7JSLfS+CjHnotQqQuRNE9oTmg==
X-IsSubscribed: yes

On 2017-12-12 12:42, Thomas Taylor wrote:
> I believe that Cygwin displays certain UTF-8 characters incorrectly.  To see the
> problem, first save the attached "utf-8_test.sed" text file to your desktop. 
> Then run "mintty," and set its options by right clicking in its title bar,
> selecting "Options" and then "Text."  On the Text page set "Locale" to "en_US"
> and "Character set" to "UTF-8," and then "Save."  Now exit and restart mintty. 
> Change directory to your desktop and run the editor "vim" on the utf-8_test.sed
> file.  Once inside vim do a ":set fileencoding=utf-8".  You should now see that
> vim displays correctly a sample of one-, two-, and three-byte UTF-8 character
> encodings in the test file.  Vim fails, however, on the three-byte encodings for
> the "en" dash, the "em" dash, and the ellipsis, each of which displays
> incorrectly as a filled-in rectangle.  Now exit vim and do a "less" or "cat" on
> the utf-8_test.sed file.  You should see most of the sample UTF-8 encoded
> characters displayed correctly, except once again for the en dash, em dash, and
> ellipsis.  So it looks like a problem in the underlying Cygwin run-time
> libraries rather than in vim, less, or cat.  I haven't tested this on four-byte
> UTF-8 character encodings, but assume Cygwin will have similar problems.

Like many others -- no problems visible -- all UTF-8 characters displayed
correctly in gvim/X, vim, less, cat from mintty.

-- 
Take care. Thanks, Brian Inglis, Calgary, Alberta, Canada

--
Problem reports:       http://cygwin.com/problems.html
FAQ:                   http://cygwin.com/faq/
Documentation:         http://cygwin.com/docs.html
Unsubscribe info:      http://cygwin.com/ml/#unsubscribe-simple

- Raw text -


  webmaster     delorie software   privacy  
  Copyright © 2019   by DJ Delorie     Updated Jul 2019