X-Recipient: archive-cygwin AT delorie DOT com DomainKey-Signature: a=rsa-sha1; c=nofws; d=sourceware.org; h=list-id :list-unsubscribe:list-subscribe:list-archive:list-post :list-help:sender:reply-to:subject:to:references:from:message-id :date:mime-version:in-reply-to:content-type :content-transfer-encoding; q=dns; s=default; b=sZJ7Ta9vlUN8lUVD t8Xvh064JRZJ/xpQOFjtTRjvQ5XNZcXbGjLWatYQpgCy7Th0xe2k1Nm8/w2OjTNQ Qm6ak8Bx5Ym481bZGTnWauer5US/PiNCWmuosCsLClgWP7+HdTBB6d7VQ2T01s+d YQWXdfJVnnsmJYlcu9f902OF4uw= DKIM-Signature: v=1; a=rsa-sha1; c=relaxed; d=sourceware.org; h=list-id :list-unsubscribe:list-subscribe:list-archive:list-post :list-help:sender:reply-to:subject:to:references:from:message-id :date:mime-version:in-reply-to:content-type :content-transfer-encoding; s=default; bh=xKvaclGs3m+5E6UwCKPZQp GK25w=; b=k4D7dquHVNZn1s8kcqA3rCi4ihsV5AGtjpNRkwg6OdwWQye+oDSbOR f2IGP1WAeGe+xcU5jNsy9z0tB2hlR+3oIwklKRBSHQlGuWJtEmoUVQu0HJUpTtiE f0mquqwfPmy96+/wwoSMsLNCanVENsj92IHKT+KRHeDvcs63hp4XM= Mailing-List: contact cygwin-help AT cygwin DOT com; run by ezmlm List-Id: List-Subscribe: List-Archive: List-Post: List-Help: , Sender: cygwin-owner AT cygwin DOT com Mail-Followup-To: cygwin AT cygwin DOT com Delivered-To: mailing list cygwin AT cygwin DOT com Authentication-Results: sourceware.org; auth=none X-Virus-Found: No X-Spam-SWARE-Status: No, score=2.3 required=5.0 tests=AWL,BAYES_50,KAM_LAZY_DOMAIN_SECURITY,LIKELY_SPAM_SUBJECT,RCVD_IN_DNSWL_LOW autolearn=no version=3.3.2 spammy=regional, H*R:D*ca, mintty, calgary X-HELO: smtp-out-so.shaw.ca X-Authority-Analysis: v=2.2 cv=JuuBlIwC c=1 sm=1 tr=0 a=MVEHjbUiAHxQW0jfcDq5EA==:117 a=MVEHjbUiAHxQW0jfcDq5EA==:17 a=N659UExz7-8A:10 a=eWUNByC9m1pz4WcI4cYA:9 a=pILNOxqGKmIA:10 Reply-To: Brian DOT Inglis AT SystematicSw DOT ab DOT ca Subject: Re: Need help with multibyte UTF-8 characters To: cygwin AT cygwin DOT com References: <626a3c06-e9f2-1932-f1f3-47ddb2051215 AT gmail DOT com> From: Brian Inglis Message-ID: <3317b062-e0b1-03dd-b289-97823217dd27@SystematicSw.ab.ca> Date: Mon, 4 Dec 2017 20:48:06 -0700 User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:52.0) Gecko/20100101 Thunderbird/52.5.0 MIME-Version: 1.0 In-Reply-To: <626a3c06-e9f2-1932-f1f3-47ddb2051215@gmail.com> Content-Type: text/plain; charset=windows-1252 Content-Transfer-Encoding: 8bit X-CMAE-Envelope: MS4wfBpSDYXIc6S+I1uEq6jYw1PxW0gsDvRCWml2img8z/QtT+RBZEPlM8TgOwk4QEDbMrtjnGFdzh4/DcQqkROa68TxmkFWmDGdcvDhOrmpipa1vp7K2VEM kZiVgWFWDy89PeI9Zbc6poYANWOAS8MYTTT9egp5NKke3ubA+7/mwRk/1/1ssVr0XdYr7yGb50Lg0g== X-IsSubscribed: yes On 2017-12-04 18:23, Thomas Taylor wrote: > I want to use multibyte UTF-8 characters in 64-bit Cygwin under Windows 7.  The > "vim" editor running in mintty displays the two-byte characters correctly, but > not the three- (and I assume four-) byte characters, which instead display as > rectangular filled-in blocks.  The "less" program doesn't even display two-byte > characters correctly, but instead displays them as to , depending on > the character in question, in reverse color in the terminal window.  The "cat" > program is even worse, replacing every two-byte character with a character that > looks like three horizontal bars stacked one above the other.  I've read the > "Internationalization" page in the Cygwin online manual, but am still baffled.  > My LANG environment variable is set to "en_US.UTF-8".  Can anyone help? Check mintty/Options/Text/Locale[en_US]/Character set[UTF-8]/Apply/Save. Then exit and restart mintty and your shell. To see what locale Cygwin thinks you are set to, run: $ locale To check all Windows locale settings, you can run: $ for o in -s -u -n -i -f ''; do locale $o; done The first two should show your Windows install locale, the rest should show anything you have set up, or the same locale. If any settings don't match LANG, you may have to set LC_ALL=$LANG to force the setting. I use the following profile stanza across all systems for consistency: # Set user-defined locale - use regional settings if available locale -fU > /dev/null 2>&1 \ && LC_ALL=`locale -fU` \ || LC_ALL=`locale | \ /bin/sed '/^LANG=\|^LC_CTYPE=\|^LC_ALL=/{s///;h};$!d;x;s/"//g'` export LC_ALL -- Take care. Thanks, Brian Inglis, Calgary, Alberta, Canada -- Problem reports: http://cygwin.com/problems.html FAQ: http://cygwin.com/faq/ Documentation: http://cygwin.com/docs.html Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple