delorie.com/archives/browse.cgi   search  
Mail Archives: cygwin/2010/06/29/08:52:25

X-Recipient: archive-cygwin AT delorie DOT com
X-SWARE-Spam-Status: No, hits=0.2 required=5.0 tests=AWL,BAYES_00,DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,FREEMAIL_FROM,MIME_CHARSET_FARAWAY,RCVD_IN_DNSWL_NONE,SARE_SUB_ENC_UTF8,TW_BJ,TW_XD,T_TO_NO_BRKTS_FREEMAIL
X-Spam-Check-By: sourceware.org
MIME-Version: 1.0
In-Reply-To: <i0cb2o$m64$1@dough.gmane.org>
References: <hvn70o$n5q$1 AT dough DOT gmane DOT org> <AANLkTimrIRAaUp-xP8ZUe9ew7Wob3EBiUL-hkv-VjY2a AT mail DOT gmail DOT com> <i0cb2o$m64$1 AT dough DOT gmane DOT org>
Date: Tue, 29 Jun 2010 14:52:11 +0200
Message-ID: <AANLkTil80itd3rHEYl5eAJ6Bfra-Mw-aNunC04canB1-@mail.gmail.com>
Subject: Re: LANG=ru_RU.cp1251, but binutils use UTF-8.
From: Csaba Raduly <rcsaba AT gmail DOT com>
To: cygwin AT cygwin DOT com
Mailing-List: contact cygwin-help AT cygwin DOT com; run by ezmlm
List-Id: <cygwin.cygwin.com>
List-Unsubscribe: <mailto:cygwin-unsubscribe-archive-cygwin=delorie DOT com AT cygwin DOT com>
List-Subscribe: <mailto:cygwin-subscribe AT cygwin DOT com>
List-Archive: <http://sourceware.org/ml/cygwin/>
List-Post: <mailto:cygwin AT cygwin DOT com>
List-Help: <mailto:cygwin-help AT cygwin DOT com>, <http://sourceware.org/ml/#faqs>
Sender: cygwin-owner AT cygwin DOT com
Mail-Followup-To: cygwin AT cygwin DOT com
Delivered-To: mailing list cygwin AT cygwin DOT com

On Tue, Jun 29, 2010 at 10:32 AM, Oleksandr Gavenko wrote:
> =9A$ objdump --help | head -n 2 | hexl
> 00000000: d098 d181 d0bf d0be d0bb d18c d0b7 d0be =9A................
(snipped)

These bytes sure look like UTF-8. According to
http://software.hixie.ch/utilities/cgi/unicode-decoder/utf8-decoder

U+0418 CYRILLIC CAPITAL LETTER I character (&#x0418;)
U+0441 CYRILLIC SMALL LETTER ES character (&#x0441;)
U+043F CYRILLIC SMALL LETTER PE character (&#x043F;)
U+043E CYRILLIC SMALL LETTER O character (&#x043E;)
U+043B CYRILLIC SMALL LETTER EL character (&#x043B;)
U+044C CYRILLIC SMALL LETTER SOFT SIGN character (&#x044C;)
U+0437 CYRILLIC SMALL LETTER ZE character (&#x0437;)
(snipped)

As raw characters:

=E9=D3=D0=CF=CC=D8=DA=CF=D7=C1=CE=C9=C5: objdump <=CF=D0=C3=C9=C9> <=C6=C1=
=CA=CC(=D9)>
 =EF=D4=CF=C2=D2=C1=D6=C1=C5=D4 =C9=CE=C6=CF=D2=CD=C1=C3=C9=C0 =C9=DA =CF=
=C2=DF=C5=CB=D4=C1 <=C6=C1=CA=CC(=D9)>.

But I can't get that output out of
CYGWIN_NT-6.0 1.7.5(0.225/5/3) 2010-04-12 19:07 i686 Cygwin
and
GNU objdump (GNU Binutils) 2.20.51.20100410

However, I did manage to get apparently correct cp1251 (see below;
also utf8 with LANG=3Dru_RU.utf8) out of
CYGWIN_NT-6.0 1.5.25(0.156/4/2) 2008-06-12 19:34 i686 Cygwin
and
GNU objdump (GNU Binutils) 2.18.50.20080625

14:40:47 $ LANG=3Dru_RU.cp1251 objdump --help | head -2 | hexdump -C
00000000  c8 f1 ef ee eb fc e7 ee  e2 e0 ed e8 e5 3a 20 6f  |.............:=
 o|
00000010  62 6a 64 75 6d 70 20 3c  ee ef f6 e8 e8 3e 20 3c  |bjdump <.....>=
 <|
(snip)

--=20
Life is complex, with real and imaginary parts.
"Ok, it boots. Which means it must be bug-free and perfect. " -- Linus Torv=
alds
"People disagree with me. I just ignore them." -- Linus Torvalds

--
Problem reports:       http://cygwin.com/problems.html
FAQ:                   http://cygwin.com/faq/
Documentation:         http://cygwin.com/docs.html
Unsubscribe info:      http://cygwin.com/ml/#unsubscribe-simple

- Raw text -


  webmaster     delorie software   privacy  
  Copyright © 2019   by DJ Delorie     Updated Jul 2019