delorie.com/archives/browse.cgi   search  
Mail Archives: cygwin/2017/12/13/00:21:57

X-Recipient: archive-cygwin AT delorie DOT com
DomainKey-Signature: a=rsa-sha1; c=nofws; d=sourceware.org; h=list-id
:list-unsubscribe:list-subscribe:list-archive:list-post
:list-help:sender:reply-to:subject:to:references:from:message-id
:date:mime-version:in-reply-to:content-type
:content-transfer-encoding; q=dns; s=default; b=FQNl3CygqiBGF9yW
pzOS7p2Ue5yMh2CpY9tSjz4QCvEsQAFmTXKagIJ0TBTCkMqjnX5qCO5vugMkgg8g
AS9NCOphDM8bOj2WYgyAddAT/t4ZltAGOgMPPX8kWZZt4onPUIb13gTPhsaAcmZH
5DcZpy4HbAnG05GN/521defYTPc=
DKIM-Signature: v=1; a=rsa-sha1; c=relaxed; d=sourceware.org; h=list-id
:list-unsubscribe:list-subscribe:list-archive:list-post
:list-help:sender:reply-to:subject:to:references:from:message-id
:date:mime-version:in-reply-to:content-type
:content-transfer-encoding; s=default; bh=vj41DUUHqj7+rI6ClmhMAF
wEZTU=; b=RkFFfiGqjIk4o2ctL9zhfi82/XhJO8XbPHMqSPIp1AuXS8MNHbcv39
N45gqLHFbgW+ZBYklAyWZwIxK14oMKAk7cxE5HJKBJt+w1ZNHFtpPK7v4BkFghYn
zhLn+TH/NhHOsR+CD9qS3BK9RXm5S5fGmgo5FkG3tGgjG5+RE1S4k=
Mailing-List: contact cygwin-help AT cygwin DOT com; run by ezmlm
List-Id: <cygwin.cygwin.com>
List-Subscribe: <mailto:cygwin-subscribe AT cygwin DOT com>
List-Archive: <http://sourceware.org/ml/cygwin/>
List-Post: <mailto:cygwin AT cygwin DOT com>
List-Help: <mailto:cygwin-help AT cygwin DOT com>, <http://sourceware.org/ml/#faqs>
Sender: cygwin-owner AT cygwin DOT com
Mail-Followup-To: cygwin AT cygwin DOT com
Delivered-To: mailing list cygwin AT cygwin DOT com
Authentication-Results: sourceware.org; auth=none
X-Virus-Found: No
X-Spam-SWARE-Status: No, score=2.3 required=5.0 tests=AWL,BAYES_50,KAM_LAZY_DOMAIN_SECURITY,LIKELY_SPAM_SUBJECT,RCVD_IN_DNSWL_LOW autolearn=no version=3.3.2 spammy=bars, calgary, Calgary, Alberta
X-HELO: smtp-out-no.shaw.ca
X-Authority-Analysis: v=2.2 cv=KLEqNBNo c=1 sm=1 tr=0 a=MVEHjbUiAHxQW0jfcDq5EA==:117 a=MVEHjbUiAHxQW0jfcDq5EA==:17 a=N659UExz7-8A:10 a=eWUNByC9m1pz4WcI4cYA:9 a=pILNOxqGKmIA:10
Reply-To: Brian DOT Inglis AT SystematicSw DOT ab DOT ca
Subject: Re: Need help with multibyte UTF-8 characters
To: cygwin AT cygwin DOT com
References: <626a3c06-e9f2-1932-f1f3-47ddb2051215 AT gmail DOT com>
From: Brian Inglis <Brian DOT Inglis AT SystematicSw DOT ab DOT ca>
Message-ID: <b190e8bc-a60e-2a30-5caa-a2f67a0b91ce@SystematicSw.ab.ca>
Date: Tue, 12 Dec 2017 22:21:37 -0700
User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:52.0) Gecko/20100101 Thunderbird/52.5.0
MIME-Version: 1.0
In-Reply-To: <626a3c06-e9f2-1932-f1f3-47ddb2051215@gmail.com>
X-CMAE-Envelope: MS4wfPRs8TM/xKe857pX3ZJQlfbUBYMZWyr/f84oH07SndPh5X262nK50f78ED1krxwIu+pJ/E4TUSq/AI/x4PPsDqyV6YZzzgceBw3M7OluO5xJH8rMCoTt 73rZOJp2XdYwZqUFshiPdAkjQLa6wmrNyq3rjyTHMww7epE26TpBhhT7URPxyZcMbaSHjuCNR3Uz3w==
X-IsSubscribed: yes

On 2017-12-04 18:23, Thomas Taylor wrote:
> I want to use multibyte UTF-8 characters in 64-bit Cygwin under Windows 7.  The
> "vim" editor running in mintty displays the two-byte characters correctly, but
> not the three- (and I assume four-) byte characters, which instead display as
> rectangular filled-in blocks.  The "less" program doesn't even display two-byte
> characters correctly, but instead displays them as <A1> to <FF>, depending on
> the character in question, in reverse color in the terminal window.  The "cat"
> program is even worse, replacing every two-byte character with a character that
> looks like three horizontal bars stacked one above the other.  I've read the
> "Internationalization" page in the Cygwin online manual, but am still baffled. 
> My LANG environment variable is set to "en_US.UTF-8".  Can anyone help?

Your Windows Regional settings and your mintty/Options/Text/Language and
Character Set should be set to match.
The profile commands below set Cygwin locale to your Windows Regional settings
and charset to UTF-8, or Unix locale to your system locale.
Otherwise your system or mintty is going to be doing conversions on each character.

# Set user-defined locale
locale -fU > /dev/null 2>&1     \
        && LC_ALL=$(locale -fU) \
        || LC_ALL=$(locale |    \
                sed '/^LANG=\|^LC_CTYPE=\|^LC_ALL=/{s///;h};$!d;x;s/"//g')

-- 
Take care. Thanks, Brian Inglis, Calgary, Alberta, Canada

--
Problem reports:       http://cygwin.com/problems.html
FAQ:                   http://cygwin.com/faq/
Documentation:         http://cygwin.com/docs.html
Unsubscribe info:      http://cygwin.com/ml/#unsubscribe-simple

- Raw text -


  webmaster     delorie software   privacy  
  Copyright © 2019   by DJ Delorie     Updated Jul 2019