X-Recipient: archive-cygwin AT delorie DOT com DomainKey-Signature: a=rsa-sha1; c=nofws; d=sourceware.org; h=list-id :list-unsubscribe:list-subscribe:list-archive:list-post :list-help:sender:date:from:reply-to:message-id:to:subject :in-reply-to:references:mime-version:content-type :content-transfer-encoding; q=dns; s=default; b=BvN3nRejSU03cWCw H03sNuC5TLDwOA0wbSxm8ONHdgaTPfZArO9LZ7E8JEtFQl/TdOSTdhqY845QiKK7 YLPQ+GKctFW5EHNJMRFGWDPPE2i5e85r3QKATWTmh8UVv5SGVppdPGztPz8PtzYq s/PKRzq7I6245XFZMuF9xI7aljc= DKIM-Signature: v=1; a=rsa-sha1; c=relaxed; d=sourceware.org; h=list-id :list-unsubscribe:list-subscribe:list-archive:list-post :list-help:sender:date:from:reply-to:message-id:to:subject :in-reply-to:references:mime-version:content-type :content-transfer-encoding; s=default; bh=a0q6Eq37koQo2a6a6CTNLx ZS9LE=; b=KvcAyj+Kg0uo7HTIAm7r4UP5b1Y/FybNEY5pJDyY4OZNS/94XrWpnl Jsg9aRhMEhbHDX0qsqrxMuikb/hQh+jUnXVvGgLxeCA/HLSOrgDThWTse8PPLnAH O/6rBhAXMa48s6JDH+yb4AxwId8hPGAoLUsQSf2UYyrIvBAa7pDw8= Mailing-List: contact cygwin-help AT cygwin DOT com; run by ezmlm List-Id: List-Subscribe: List-Archive: List-Post: List-Help: , Sender: cygwin-owner AT cygwin DOT com Mail-Followup-To: cygwin AT cygwin DOT com Delivered-To: mailing list cygwin AT cygwin DOT com Authentication-Results: sourceware.org; auth=none X-Virus-Found: No X-Spam-SWARE-Status: No, score=4.8 required=5.0 tests=BAYES_50,FREEMAIL_FROM,KAM_INFOUSMEBIZ,KAM_THEBAT,LIKELY_SPAM_SUBJECT,MIME_BASE64_BLANKS,RCVD_IN_DNSWL_LOW,SPF_PASS autolearn=no version=3.3.2 spammy=UD:index.htm, indexhtm, index.htm, H*F:D*yandex.ru X-HELO: forward103j.mail.yandex.net Authentication-Results: smtp4o.mail.yandex.net; dkim=pass header.i=@yandex.ru Date: Thu, 14 Dec 2017 22:19:46 +0300 From: Andrey Repin Reply-To: cygwin AT cygwin DOT com Message-ID: <1534444625.20171214221946@yandex.ru> To: Thomas Taylor , cygwin AT cygwin DOT com Subject: Re: Need help with multibyte UTF-8 characters In-Reply-To: <1909177a-3f35-52d5-1717-9007d6efaa71@gmail.com> References: <626a3c06-e9f2-1932-f1f3-47ddb2051215 AT gmail DOT com> <9d3b73ff-f596-51a2-909a-30a767e3e9b3 AT gmail DOT com> <1909177a-3f35-52d5-1717-9007d6efaa71 AT gmail DOT com> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 X-IsSubscribed: yes Content-Transfer-Encoding: 8bit X-MIME-Autoconverted: from base64 to 8bit by delorie.com id vBEJKjaV006694 Greetings, Thomas Taylor! > I believe that Cygwin displays certain UTF-8 characters incorrectly.  To > see the problem, first save the attached "utf-8_test.sed" text file to > your desktop.  First, your "NBSP" is actually http://www.fileformat.info/info/unicode/char/23b5/index.htm > Then run "mintty," and set its options by right clicking > in its title bar, selecting "Options" and then "Text."  I just keep them clear. > On the Text page > set "Locale" to "en_US" and "Character set" to "UTF-8," and then > "Save."  Now exit and restart mintty.  Change directory to your desktop > and run the editor "vim" on the utf-8_test.sed file.  Once inside vim do > a ":set fileencoding=utf-8".  You should now see that vim displays > correctly a sample of one-, two-, and three-byte UTF-8 character > encodings in the test file.  Vim fails, however, on the three-byte > encodings for the "en" dash, the "em" dash, and the ellipsis, each of > which displays incorrectly as a filled-in rectangle.  Now exit vim and > do a "less" or "cat" on the utf-8_test.sed file.  You should see most of > the sample UTF-8 encoded characters displayed correctly, except once > again for the en dash, em dash, and ellipsis.  All displayed correctly. Lucida Console 11pt. > So it looks like a problem in the underlying Cygwin run-time libraries > rather than in vim, less, or cat.  I haven't tested this on four-byte UTF-8 > character encodings, but assume Cygwin will have similar problems. I don't have a good console font for mb4, but I presume it will be displaed just fine. -- With best regards, Andrey Repin Thursday, December 14, 2017 21:59:07 Sorry for my terrible english... -- Problem reports: http://cygwin.com/problems.html FAQ: http://cygwin.com/faq/ Documentation: http://cygwin.com/docs.html Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple