X-Recipient: archive-cygwin@delorie.com
X-SWARE-Spam-Status: No, hits=-2.4 required=5.0 	tests=AWL,BAYES_00,SPF_HELO_PASS,SPF_PASS
X-Spam-Check-By: sourceware.org
Message-ID: <26777598.post@talk.nabble.com>
Date: Mon, 14 Dec 2009 05:19:24 -0800 (PST)
From: =?UTF-8?Q?LiuYan_=E5=88=98=E7=A0=94?= <lovetide@21cn.com>
To: cygwin@cygwin.com
Subject: Re: Cygwin 1.7: Empty/white-space output when display Chinese      characters in GBK charset encoding?
In-Reply-To: <20091214123509.GI8059@calimero.vinschen.de>
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit
References: <26774467.post@talk.nabble.com> <20091214105915.GF8059@calimero.vinschen.de> <26776766.post@talk.nabble.com> <20091214123509.GI8059@calimero.vinschen.de>
X-IsSubscribed: yes
Mailing-List: contact cygwin-help@cygwin.com; run by ezmlm
Precedence: bulk
List-Id: <cygwin.cygwin.com>
List-Unsubscribe: <mailto:cygwin-unsubscribe-archive-cygwin=delorie.com@cygwin.com>
List-Subscribe: <mailto:cygwin-subscribe@cygwin.com>
List-Archive: <http://sourceware.org/ml/cygwin/>
List-Post: <mailto:cygwin@cygwin.com>
List-Help: <mailto:cygwin-help@cygwin.com>, <http://sourceware.org/ml/#faqs>
Sender: cygwin-owner@cygwin.com
Mail-Followup-To: cygwin@cygwin.com
Delivered-To: mailing list cygwin@cygwin.com


The only unexpected thing is the empty output of GBK-encoded chinese
characters when LANG is xxxx.UTF-8.

I mean it should display something visible(malformed characters, squares or
whatever visible characters) regardless the LANG setting.

echo -e "Chinese characters in GBK encoding: [\xba\xba\xd7\xd6]  -- Output
will be [    ] in Cygwin 1.7"
echo -e "Chinese characters in UTF-8 encoding: [\xe6\xb1\x89\xe5\xad\x97]"

Cygwin windows is a DOS prompt window, and this window has a code page, it's
code page 936(GBK) in Simplified Chinese Windows OS. But Cygwin 1.7 is
*simulating* UTF-8 environment, will this difference cause the empty output
issue?


Corinna Vinschen-2 wrote:
> 
> On Dec 14 04:07, LiuYan ?????? wrote:
>> 
>> Hi Corinna:
>>   I didn't set LANG environment variable, and the result of 'export'
>> command
>> is:
>> ...
>> declare -x LANG="C.UTF-8"
>> ...
>> 
>>   And, if I change LANG to zh_CN.GBK, the cc-gbk.txt file can be
>> displayed
>> correctly (act same behaviour like Cygwin 1.5).
> 
> So everything's as expected.  Good to know.
> 
> 
> Corinna
> 
> -- 
> Corinna Vinschen                  Please, send mails regarding Cygwin to
> Cygwin Project Co-Leader          cygwin AT cygwin DOT com
> Red Hat
> 
> --
> Problem reports:       http://cygwin.com/problems.html
> FAQ:                   http://cygwin.com/faq/
> Documentation:         http://cygwin.com/docs.html
> Unsubscribe info:      http://cygwin.com/ml/#unsubscribe-simple
> 
> 
> 

-- 
View this message in context: http://old.nabble.com/Cygwin-1.7%3A-Empty-white-space-output-when-display-Chinese-characters-in-GBK-charset-encoding--tp26774467p26777598.html
Sent from the Cygwin list mailing list archive at Nabble.com.


--
Problem reports:       http://cygwin.com/problems.html
FAQ:                   http://cygwin.com/faq/
Documentation:         http://cygwin.com/docs.html
Unsubscribe info:      http://cygwin.com/ml/#unsubscribe-simple

