X-Recipient: archive-cygwin AT delorie DOT com X-SWARE-Spam-Status: No, hits=-1.8 required=5.0 tests=AWL,BAYES_00,SARE_MSGID_LONG40,SPF_PASS X-Spam-Check-By: sourceware.org MIME-Version: 1.0 In-Reply-To: <4AA8DAA3.8060508@fmail.co.uk> References: <4AA8DAA3 DOT 8060508 AT fmail DOT co DOT uk> Date: Thu, 10 Sep 2009 17:45:31 +0100 Message-ID: <416096c60909100945x796ee335n2130b15359582b75@mail.gmail.com> Subject: Re: Help needed getting unicode working in bash From: Andy Koppe To: cygwin AT cygwin DOT com Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable X-IsSubscribed: yes Mailing-List: contact cygwin-help AT cygwin DOT com; run by ezmlm Precedence: bulk List-Id: List-Unsubscribe: List-Subscribe: List-Archive: List-Post: List-Help: , Sender: cygwin-owner AT cygwin DOT com Mail-Followup-To: cygwin AT cygwin DOT com Delivered-To: mailing list cygwin AT cygwin DOT com 2009/9/10 Kit Johnson: > Thanks so much for taking the time to help. =C2=A0This is the first time = I've > used a mailing list so I hope I've replied correctly. Yep, except you replied to me instead of the list. ;) > I understand charactersets and locales better now. =C2=A0I followed your > recommendations plus those of the cygwin FAQ and internationalistation > pages. > > However I still get > "ls: cannot access ?????????.xls: no such file or directory" > instead of Thai characters when I type 'ls' in bash. > I've researched the ISO 639-1 and ISO 3166 codes for my location, and wou= ld > like to use UTF-8. > > Here are the contents of my cygwin.bat file: > @echo off > > C: > chdir C:\cygwin\bin > set LC_CTYPE=3Dth_TH.UTF-8 > bash --login -i Hmm, that should do the job. Are you running the Cygwin 1.7 beta? 1.5 doesn't support locales. ('uname -r' will tell you.) > in my .bashrc file: > export LANG=3D"th_TH.UTF-8" You could set LANG instead of LC_CTYPE in cygwin.bat. The difference is that setting LANG affects all locale-specific behaviours, e.g. it will enable Thai user interfaces and messages in programs that have translations for it. LC_CTYPE only sets the encoding and a couple of other things regarding character processing. If LANG is set, you don't need LC_CTYPE. > export OUTPUT_CHARSET=3D"UTF-8" I don't know whether anything actually uses this. You'd probably be fine without it. Andy -- Problem reports: http://cygwin.com/problems.html FAQ: http://cygwin.com/faq/ Documentation: http://cygwin.com/docs.html Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple