Mailing-List: contact cygwin-help@cygwin.com; run by ezmlm
List-Subscribe: <mailto:cygwin-subscribe@cygwin.com>
List-Archive: <http://sources.redhat.com/ml/cygwin/>
List-Post: <mailto:cygwin@cygwin.com>
List-Help: <mailto:cygwin-help@cygwin.com>, <http://sources.redhat.com/ml/#faqs>
Sender: cygwin-owner@cygwin.com
Mail-Followup-To: cygwin@cygwin.com
Delivered-To: mailing list cygwin@cygwin.com
To: cygwin@cygwin.com
Subject: Re: Non-trivial Cyrillic question
References: <400D3F1B.4010504@contact.bg>
From: Roman Belenov <rbelenov@yandex.ru>
Date: Tue, 20 Jan 2004 18:27:18 +0300
In-Reply-To: <400D3F1B.4010504@contact.bg> (Alexander Shopov's message of "Tue, 20 Jan 2004 16:45:47 +0200")
Message-ID: <ufzeabpyh.fsf@intel.com>
User-Agent: Gnus/5.1006 (Gnus v5.10.6) Emacs/21.2 (windows-nt)
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii
X-Scanned-By: MIMEDefang 2.31 (www . roaringpenguin . com / mimedefang)

Alexander Shopov <ash@contact.bg> writes:

> However when I pipe it to less
>
> cat myfile | less
>
> it gets broken
> I get things like: <CF><D0><C5><CF><C8><D1><CA><C0>

You should tell less what characters are supposed to be printable
using LESSCHARSET or LESSCHARDEF environment variable (man less for
details).

> ls *
>
> it is displayed like
> ????.txt

ls uses current locale to determine which characters are
printable. AFAIK cygwin isn't locale-aware (I can be mistaken); the
possible workaround is to use --show-control-chars switch to print all
characters literally.

BTW Is cp1251 the only 8-bit encoding used by Bulgarian locale in MS
Windows ? In Russia two encoding are used - cp866 as OEM (used, e.g.,
in console), cp1251 as ANSI.

-- 
 							With regards, Roman.


--
Unsubscribe info:      http://cygwin.com/ml/#unsubscribe-simple
Problem reports:       http://cygwin.com/problems.html
Documentation:         http://cygwin.com/docs.html
FAQ:                   http://cygwin.com/faq/

