X-Recipient: archive-cygwin AT delorie DOT com X-SWARE-Spam-Status: No, hits=-1.9 required=5.0 tests=BAYES_00,SPF_NEUTRAL X-Spam-Check-By: sourceware.org Message-ID: <4ACB6309.9020609@cornell.edu> Date: Tue, 06 Oct 2009 11:32:25 -0400 From: Ken Brown User-Agent: Thunderbird 2.0.0.22 (Windows/20090605) MIME-Version: 1.0 To: cygwin AT cygwin DOT com Subject: Re: [ANNOUNCEMENT] [1.7] Updated: cygwin-1.7.0-62 References: In-Reply-To: Content-Type: multipart/mixed; boundary="------------040002080806050804010804" X-IsSubscribed: yes Mailing-List: contact cygwin-help AT cygwin DOT com; run by ezmlm List-Id: List-Subscribe: List-Archive: List-Post: List-Help: , Sender: cygwin-owner AT cygwin DOT com Mail-Followup-To: cygwin AT cygwin DOT com Delivered-To: mailing list cygwin AT cygwin DOT com Note-from-DJ: This may be spam --------------040002080806050804010804 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit On 10/3/2009 9:59 AM, Corinna Vinschen wrote: > Apart from bugfixes, this patch contains a change to the > internationalization efforts in Cygwin which cristalized out of a couple > of longish discussions on the cygwin and cygwin-developer lists. > > Here's how it's supposed to work in future: [...] > - The "C" locale's default charset is UTF-8. Does this mean that non-ASCII characters are supposed to display OOTB, or is some user configuration expected? Here's a test case. I've tried to view the attached file (extracted from the output of fc-list) in various ways, and here's what I've found (running XP in the U.S., with no language-related customization): - Using emacs under X, emacs recognizes the file as UTF-8 and displays the foreign characters correctly. - 'cat temp.txt' in the cygwin console produces lots of question marks. - 'cat temp.txt' in xterm or mintty produces lots of garbage. The garbage changes in mintty if I change the choice of codepage in the options, but I haven't been able to get rid of the garbage. - If I set LANG=C.UTF-8 before starting xterm, I get correct display of the foreign characters as in emacs (under X). But this doesn't seem to work for the cygwin console or mintty (or at least I haven't figured out how to make it work). Ken P.S. This post is related to the discussion started in http://cygwin.com/ml/cygwin-developers/2009-10/msg00062.html. But I'm approaching the question as a user, so I didn't think I should reply there. (I'm not subscribed anyway.) --------------040002080806050804010804 Content-Type: text/plain; name="temp.txt" Content-Transfer-Encoding: 8bit Content-Disposition: inline; filename="temp.txt" obyčejné Κανονικά Normál Обычный Normálne --------------040002080806050804010804 Content-Type: text/plain; charset=us-ascii -- Problem reports: http://cygwin.com/problems.html FAQ: http://cygwin.com/faq/ Documentation: http://cygwin.com/docs.html Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple --------------040002080806050804010804--