X-Recipient: archive-cygwin AT delorie DOT com DomainKey-Signature: a=rsa-sha1; c=nofws; d=sourceware.org; h=list-id :list-unsubscribe:list-subscribe:list-archive:list-post :list-help:sender:subject:to:references:from:message-id:date :mime-version:in-reply-to:content-type :content-transfer-encoding; q=dns; s=default; b=MXza1fhEz1vcdX3Y dgmb/AdrPkk9D/vfNpl/RgcCuxqcG1Rh1GUJbTCCK7BbDFXmbvMCWX8nG026MGTq 5ewDbDqhOGEqbDP50odc3xg7oMjSbmOFF3RXDVe0H8UxcagAFkd2fHrobKRDWpEa lCUiw+vG+M58xNcQP3C6VypF3dQ= DKIM-Signature: v=1; a=rsa-sha1; c=relaxed; d=sourceware.org; h=list-id :list-unsubscribe:list-subscribe:list-archive:list-post :list-help:sender:subject:to:references:from:message-id:date :mime-version:in-reply-to:content-type :content-transfer-encoding; s=default; bh=lxaspBddplhhls9PLUzzHP IqJKY=; b=SieQbiOM4YPXbfcpC7kM/JMGJXqcwpxlTQciBZj1jtK/lbv8GVEhjx RiBtLnHcuFz73GPlCkCG3TRnNAjLW3FIsTqPC0uYmZv3rgTVoRZ3XoJNVfCyNeut ib67Kp9iaRU3G9WFgNeeKhNITGRcNxY6RnW/i8kT7r1v/uTPYshdQ= Mailing-List: contact cygwin-help AT cygwin DOT com; run by ezmlm List-Id: List-Subscribe: List-Archive: List-Post: List-Help: , Sender: cygwin-owner AT cygwin DOT com Mail-Followup-To: cygwin AT cygwin DOT com Delivered-To: mailing list cygwin AT cygwin DOT com Authentication-Results: sourceware.org; auth=none X-Virus-Found: No X-Spam-SWARE-Status: No, score=3.5 required=5.0 tests=AWL,BAYES_50,KAM_LAZY_DOMAIN_SECURITY,LIKELY_SPAM_SUBJECT,RCVD_IN_DNSWL_NONE autolearn=no version=3.3.2 spammy=bars, filledin, filled-in, horizontal X-HELO: mout.kundenserver.de Subject: Re: Need help with multibyte UTF-8 characters To: cygwin AT cygwin DOT com References: <626a3c06-e9f2-1932-f1f3-47ddb2051215 AT gmail DOT com> From: Thomas Wolff Message-ID: <89a7d883-e3e5-42d7-2d22-020cdf6ccbdf@towo.net> Date: Wed, 13 Dec 2017 08:50:15 +0100 User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:52.0) Gecko/20100101 Thunderbird/52.5.0 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=windows-1252; format=flowed Content-Transfer-Encoding: 8bit X-UI-Out-Filterresults: notjunk:1;V01:K0:i/zodH+nb3U=:m+TSQS8rknoO9VBCES93d4 PYE80J6PusqoNPkxydl+ScNwDE8VX4LcABau2pwO3/N8LpFnSdll1HXyTqj63FB/sjkdKEK5M S4DkWRYvLEJ5mslc9bNl2LzhboYlxvd5S0X0cPMtu9F6d5GF+HYBMOk9D9eSst0uWkOBP67zO EpcA7b9tSWzxFJzkZ57Wzsw5/YwWuzTAL26bgE6gCy9nJw1hgU9djSekn1N4AaKKS7xZ70nRp AKaSN5yv7TGaPxEPF0nTln5A5WrL5ERo1q3A0H5/TvyX7mCAca/anRHDc3psnIR+8BRm5SkJk 8ixH/eUCzoPqFxXqzLUICpsZoTKSjPpk9JmiB9gNN8z32vWZNukavCZ3ur1AAUvyD1J54oPHA 818+NAND04LKifRhfrtlWbrae/5Zjoqv3dxFzvtCsPeIq7Yuo+ddXLv4oKdIlpGuLZ/bYje2F jEwkYw6qiSiz31ep5UECzafoeM7zrUTm1sXJKjo6r9616ANEM+xAJ9bvB/v2PDbYcO0YQcvtK 1/Tmg/CBwHcqz0jChhnuI/XDZwu/FNiNRk6WuJm+aG6IC1RARXKOENZROatXAvMmjR40KAGBy dev7msYnyBHagiciMRLfyk+yzf9L/sAzmuT8Sf+E46/bq/jJc/SBhLXUTiAgrPKYI/9IK5l9U dsmNs+BwhYk8dZAH9V6EO76UpB5RNnq6tgsqlGL4dJSiW8tWbZDNMpPDKP137fR4fQP8i0tyD GVPTJHtDbi8YvJFmZnE47sFuxCeLI9NX5GSebwjJ+ZigYU4zKawFT9NFh3c= X-IsSubscribed: yes Hi Brian, Am 13.12.2017 um 06:21 schrieb Brian Inglis: > On 2017-12-04 18:23, Thomas Taylor wrote: >> I want to use multibyte UTF-8 characters in 64-bit Cygwin under Windows 7.  The >> "vim" editor running in mintty displays the two-byte characters correctly, but >> not the three- (and I assume four-) byte characters, which instead display as >> rectangular filled-in blocks.  The "less" program doesn't even display two-byte >> characters correctly, but instead displays them as to , depending on >> the character in question, in reverse color in the terminal window.  The "cat" >> program is even worse, replacing every two-byte character with a character that >> looks like three horizontal bars stacked one above the other.  I've read the >> "Internationalization" page in the Cygwin online manual, but am still baffled. >> My LANG environment variable is set to "en_US.UTF-8".  Can anyone help? > Your Windows Regional settings and your mintty/Options/Text/Language and > Character Set should be set to match. > The profile commands below set Cygwin locale to your Windows Regional settings > and charset to UTF-8, or Unix locale to your system locale. > Otherwise your system or mintty is going to be doing conversions on each character. I am not aware that mintty character display and Windows regional settings would interfere in any way you indicated. Can you elaborate on this please? Thomas > # Set user-defined locale > locale -fU > /dev/null 2>&1 \ > && LC_ALL=$(locale -fU) \ > || LC_ALL=$(locale | \ > sed '/^LANG=\|^LC_CTYPE=\|^LC_ALL=/{s///;h};$!d;x;s/"//g') > -- Problem reports: http://cygwin.com/problems.html FAQ: http://cygwin.com/faq/ Documentation: http://cygwin.com/docs.html Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple