X-Recipient: archive-cygwin AT delorie DOT com X-SWARE-Spam-Status: No, hits=-0.6 required=5.0 tests=AWL,BAYES_50 X-Spam-Check-By: sourceware.org Content-Type: text/plain; charset=utf-8; format=flowed; delsp=yes To: "cygwin AT cygwin DOT com" Date: Fri, 27 Nov 2009 23:12:34 +0100 Subject: [1.7] Accented characters don't work MIME-Version: 1.0 Content-Transfer-Encoding: 8bit From: lemkemch AT t-online DOT de Message-ID: User-Agent: Opera Mail/10.01 (Win32) Mailing-List: contact cygwin-help AT cygwin DOT com; run by ezmlm List-Id: List-Subscribe: List-Archive: List-Post: List-Help: , Sender: cygwin-owner AT cygwin DOT com Mail-Followup-To: cygwin AT cygwin DOT com Delivered-To: mailing list cygwin AT cygwin DOT com What am I doing wrong with my first tries of 1.7? I created in Windows Explorer a directory Ébène and in it a file très. When I look at it with ls in an rxvt window I don't see the accented characters but the two utf-8 bytes. Hm. I then created the same directory from tcsh (my standard shell) and from bash. And the result is really weird. When looking at those directories in Windows Explorer they have Chinese characters at the place of the accented ones. This is with any LC* and LANG unset on XP Pro SP3 in a pure US English locale (Windows wise). The 1.7 installation is parallel to a 1.5 install but I use the same home directory and therefore the same .tcshrc for both. Here's the summary (from a tcsh shell): orion> ls -Rl .: total 0 drwxr-x---+ 1 michael None 0 2009-11-27 23:03 Ébène_made_with_explorer drwxr-x---+ 1 michael None 0 2009-11-27 01:40 ?b?ne_made_with_bash drwxr-x---+ 1 michael None 0 2009-11-27 01:32 ?b?ne_made_with_tcsh ./Ébène_made_with_explorer: total 0 -rwxr-x--- 1 michael None 0 2009-11-27 01:39 très ./?b?ne_made_with_bash: total 0 ./?b?ne_made_with_tcsh: total 0 -rw-r----- 1 michael None 0 2009-11-27 01:31 stuff -rw-r----- 1 michael None 0 2009-11-27 01:32 tr?s orion> ls -R | od -xc 0000000 3a2e c30a 6289 a8c3 656e 6d5f 6461 5f65 . : \n 303 211 b 303 250 n e _ m a d e _ 0000020 6977 6874 655f 7078 6f6c 6572 0a72 62c9 w i t h _ e x p l o r e r \n 311 b 0000040 6ee8 5f65 616d 6564 775f 7469 5f68 6162 350 n e _ m a d e _ w i t h _ b a 0000060 6873 c90a e862 656e 6d5f 6461 5f65 6977 s h \n 311 b 350 n e _ m a d e _ w i 0000100 6874 745f 7363 0a68 2e0a c32f 6289 a8c3 t h _ t c s h \n \n . / 303 211 b 303 250 0000120 656e 6d5f 6461 5f65 6977 6874 655f 7078 n e _ m a d e _ w i t h _ e x p 0000140 6f6c 6572 3a72 740a c372 73a8 0a0a 2f2e l o r e r : \n t r 303 250 s \n \n . / 0000160 62c9 6ee8 5f65 616d 6564 775f 7469 5f68 311 b 350 n e _ m a d e _ w i t h _ 0000200 6162 6873 0a3a 2e0a c92f e862 656e 6d5f b a s h : \n \n . / 311 b 350 n e _ m 0000220 6461 5f65 6977 6874 745f 7363 3a68 730a a d e _ w i t h _ t c s h : \n s 0000240 7574 6666 740a e872 0a73 t u f f \n t r 350 s \n 0000252 orion> stty -a speed 38400 baud; rows 52; columns 80; line = 0; intr = ^C; quit = ^Y; erase = ^?; kill = ^U; eof = ^Z; eol = ; eol2 = ; swtch = ; start = ^Q; stop = ^S; susp = ^D; rprnt = ^R; werase = ^W; lnext = ^V; flush = ^O; min = 1; time = 0; -parenb -parodd cs8 -hupcl -cstopb cread -clocal -crtscts -ignbrk brkint ignpar -parmrk -inpck -istrip -inlcr -igncr icrnl ixon -ixoff -iuclc -ixany imaxbel opost -olcuc -ocrnl onlcr -onocr -onlret -ofill -ofdel nl0 cr0 tab0 bs0 vt0 ff0 isig icanon iexten echo echoe echok -echonl -noflsh -tostop echoctl echoke orion> uname -a CYGWIN_NT-5.1 orion 1.7.0(0.218/5/3) 2009-11-25 13:41 i686 Cygwin orion> There's more strange stuff going on with wildcards but I don't think we should go into that at this point. Something basic must be wrong here. Any ideas? Michael -- Problem reports: http://cygwin.com/problems.html FAQ: http://cygwin.com/faq/ Documentation: http://cygwin.com/docs.html Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple