X-Recipient: archive-cygwin@delorie.com
DomainKey-Signature: a=rsa-sha1; c=nofws; d=sourceware.org; h=list-id
	:list-unsubscribe:list-subscribe:list-archive:list-post
	:list-help:sender:subject:to:references:from:message-id:date
	:mime-version:in-reply-to:content-type
	:content-transfer-encoding; q=dns; s=default; b=MXza1fhEz1vcdX3Y
	dgmb/AdrPkk9D/vfNpl/RgcCuxqcG1Rh1GUJbTCCK7BbDFXmbvMCWX8nG026MGTq
	5ewDbDqhOGEqbDP50odc3xg7oMjSbmOFF3RXDVe0H8UxcagAFkd2fHrobKRDWpEa
	lCUiw+vG+M58xNcQP3C6VypF3dQ=
DKIM-Signature: v=1; a=rsa-sha1; c=relaxed; d=sourceware.org; h=list-id
	:list-unsubscribe:list-subscribe:list-archive:list-post
	:list-help:sender:subject:to:references:from:message-id:date
	:mime-version:in-reply-to:content-type
	:content-transfer-encoding; s=default; bh=lxaspBddplhhls9PLUzzHP
	IqJKY=; b=SieQbiOM4YPXbfcpC7kM/JMGJXqcwpxlTQciBZj1jtK/lbv8GVEhjx
	RiBtLnHcuFz73GPlCkCG3TRnNAjLW3FIsTqPC0uYmZv3rgTVoRZ3XoJNVfCyNeut
	ib67Kp9iaRU3G9WFgNeeKhNITGRcNxY6RnW/i8kT7r1v/uTPYshdQ=
Mailing-List: contact cygwin-help@cygwin.com; run by ezmlm
List-Id: <cygwin.cygwin.com>
List-Subscribe: <mailto:cygwin-subscribe@cygwin.com>
List-Archive: <http://sourceware.org/ml/cygwin/>
List-Post: <mailto:cygwin@cygwin.com>
List-Help: <mailto:cygwin-help@cygwin.com>, <http://sourceware.org/ml/#faqs>
Sender: cygwin-owner@cygwin.com
Mail-Followup-To: cygwin@cygwin.com
Delivered-To: mailing list cygwin@cygwin.com
Authentication-Results: sourceware.org; auth=none
X-Virus-Found: No
X-Spam-SWARE-Status: No, score=3.5 required=5.0 tests=AWL,BAYES_50,KAM_LAZY_DOMAIN_SECURITY,LIKELY_SPAM_SUBJECT,RCVD_IN_DNSWL_NONE autolearn=no version=3.3.2 spammy=bars, filledin, filled-in, horizontal
X-HELO: mout.kundenserver.de
Subject: Re: Need help with multibyte UTF-8 characters
To: cygwin@cygwin.com
References: <626a3c06-e9f2-1932-f1f3-47ddb2051215@gmail.com> <b190e8bc-a60e-2a30-5caa-a2f67a0b91ce@SystematicSw.ab.ca>
From: Thomas Wolff <towo@towo.net>
Message-ID: <89a7d883-e3e5-42d7-2d22-020cdf6ccbdf@towo.net>
Date: Wed, 13 Dec 2017 08:50:15 +0100
User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:52.0) Gecko/20100101 Thunderbird/52.5.0
MIME-Version: 1.0
In-Reply-To: <b190e8bc-a60e-2a30-5caa-a2f67a0b91ce@SystematicSw.ab.ca>
Content-Type: text/plain; charset=windows-1252; format=flowed
Content-Transfer-Encoding: 8bit
X-UI-Out-Filterresults: notjunk:1;V01:K0:i/zodH+nb3U=:m+TSQS8rknoO9VBCES93d4 PYE80J6PusqoNPkxydl+ScNwDE8VX4LcABau2pwO3/N8LpFnSdll1HXyTqj63FB/sjkdKEK5M S4DkWRYvLEJ5mslc9bNl2LzhboYlxvd5S0X0cPMtu9F6d5GF+HYBMOk9D9eSst0uWkOBP67zO EpcA7b9tSWzxFJzkZ57Wzsw5/YwWuzTAL26bgE6gCy9nJw1hgU9djSekn1N4AaKKS7xZ70nRp AKaSN5yv7TGaPxEPF0nTln5A5WrL5ERo1q3A0H5/TvyX7mCAca/anRHDc3psnIR+8BRm5SkJk 8ixH/eUCzoPqFxXqzLUICpsZoTKSjPpk9JmiB9gNN8z32vWZNukavCZ3ur1AAUvyD1J54oPHA 818+NAND04LKifRhfrtlWbrae/5Zjoqv3dxFzvtCsPeIq7Yuo+ddXLv4oKdIlpGuLZ/bYje2F jEwkYw6qiSiz31ep5UECzafoeM7zrUTm1sXJKjo6r9616ANEM+xAJ9bvB/v2PDbYcO0YQcvtK 1/Tmg/CBwHcqz0jChhnuI/XDZwu/FNiNRk6WuJm+aG6IC1RARXKOENZROatXAvMmjR40KAGBy dev7msYnyBHagiciMRLfyk+yzf9L/sAzmuT8Sf+E46/bq/jJc/SBhLXUTiAgrPKYI/9IK5l9U dsmNs+BwhYk8dZAH9V6EO76UpB5RNnq6tgsqlGL4dJSiW8tWbZDNMpPDKP137fR4fQP8i0tyD GVPTJHtDbi8YvJFmZnE47sFuxCeLI9NX5GSebwjJ+ZigYU4zKawFT9NFh3c=
X-IsSubscribed: yes

Hi Brian,

Am 13.12.2017 um 06:21 schrieb Brian Inglis:
> On 2017-12-04 18:23, Thomas Taylor wrote:
>> I want to use multibyte UTF-8 characters in 64-bit Cygwin under Windows 7.  The
>> "vim" editor running in mintty displays the two-byte characters correctly, but
>> not the three- (and I assume four-) byte characters, which instead display as
>> rectangular filled-in blocks.  The "less" program doesn't even display two-byte
>> characters correctly, but instead displays them as <A1> to <FF>, depending on
>> the character in question, in reverse color in the terminal window.  The "cat"
>> program is even worse, replacing every two-byte character with a character that
>> looks like three horizontal bars stacked one above the other.  I've read the
>> "Internationalization" page in the Cygwin online manual, but am still baffled.
>> My LANG environment variable is set to "en_US.UTF-8".  Can anyone help?
> Your Windows Regional settings and your mintty/Options/Text/Language and
> Character Set should be set to match.
> The profile commands below set Cygwin locale to your Windows Regional settings
> and charset to UTF-8, or Unix locale to your system locale.
> Otherwise your system or mintty is going to be doing conversions on each character.
I am not aware that mintty character display and Windows regional 
settings would interfere in any way you indicated.
Can you elaborate on this please?
Thomas

> # Set user-defined locale
> locale -fU > /dev/null 2>&1     \
>          && LC_ALL=$(locale -fU) \
>          || LC_ALL=$(locale |    \
>                  sed '/^LANG=\|^LC_CTYPE=\|^LC_ALL=/{s///;h};$!d;x;s/"//g')
>


--
Problem reports:       http://cygwin.com/problems.html
FAQ:                   http://cygwin.com/faq/
Documentation:         http://cygwin.com/docs.html
Unsubscribe info:      http://cygwin.com/ml/#unsubscribe-simple

