delorie.com/archives/browse.cgi   search  
Mail Archives: cygwin/2013/09/24/16:01:25

X-Recipient: archive-cygwin AT delorie DOT com
DomainKey-Signature: a=rsa-sha1; c=nofws; d=sourceware.org; h=list-id
:list-unsubscribe:list-subscribe:list-archive:list-post
:list-help:sender:message-id:date:from:mime-version:to:subject
:content-type:content-transfer-encoding; q=dns; s=default; b=ad0
FAmd8zXEbY6a+w7UgRhDZu6o4VFUo9V8wzcWXBE+yVoUXtJ2umiuuZntPP68TSt9
ljFWZft69GPfOBHxosqbUgxcQ5VcXUr9P208+NX5jF8OWeQFwKThmNgETwOEXapk
kd63nMwQGkSBoGbVvscjcz5Mv49JZrX9Vw1mtiqk=
DKIM-Signature: v=1; a=rsa-sha1; c=relaxed; d=sourceware.org; h=list-id
:list-unsubscribe:list-subscribe:list-archive:list-post
:list-help:sender:message-id:date:from:mime-version:to:subject
:content-type:content-transfer-encoding; s=default; bh=UflzYNEpV
NcU31E1DnO112jStbc=; b=BfMfpHyB9Nw1wasv32dqkzhi/eZ2ij4wO53WwRkJT
NfmnMr/aMq8bspW6wlGqI1VMv+Ro4v0Wm970J/kX8oj9JlGrv9SaccEReaCq1S1Y
uwbrDlbUlFb+PGw+d/e6azSKwlBHyY0nXfbq8WyIhbctX1pYjaiy359SmjZm0RSG
sg=
Mailing-List: contact cygwin-help AT cygwin DOT com; run by ezmlm
List-Id: <cygwin.cygwin.com>
List-Subscribe: <mailto:cygwin-subscribe AT cygwin DOT com>
List-Archive: <http://sourceware.org/ml/cygwin/>
List-Post: <mailto:cygwin AT cygwin DOT com>
List-Help: <mailto:cygwin-help AT cygwin DOT com>, <http://sourceware.org/ml/#faqs>
Sender: cygwin-owner AT cygwin DOT com
Mail-Followup-To: cygwin AT cygwin DOT com
Delivered-To: mailing list cygwin AT cygwin DOT com
Authentication-Results: sourceware.org; auth=none
X-Virus-Found: No
X-Spam-SWARE-Status: No, score=-2.9 required=5.0 tests=ALL_TRUSTED,AWL,BAYES_40,RP_MATCHES_RCVD autolearn=ham version=3.3.2
X-HELO: smtp-vbr2.xs4all.nl
Message-ID: <5241EF7D.9080309@xs4all.nl>
Date: Tue, 24 Sep 2013 22:01:01 +0200
From: Erwin Waterlander <waterlan AT xs4all DOT nl>
User-Agent: Mozilla/5.0 (Windows NT 6.0; rv:17.0) Gecko/20130801 Thunderbird/17.0.8
MIME-Version: 1.0
To: cygwin AT cygwin DOT com
Subject: How does Cygwin handle non-Latin1 man pages? (move to UTF-8?)
X-IsSubscribed: yes

Hi,

As far as I see it, on Cygwin it is assumed that man pages are encoded 
in Latin-1 (ISO-8859-1).
For instance the man pages of vim.

/usr/share/man/fr/vim.1.gz is encoded in Latin-1.

$ export LANG=fr_FR.UTF-8
$ man vim

This will show the French man page correctly. Latin-1 is converted to UTF-8.

For the Russian translation of the vim manual I see two files:
/usr/share/man/ru.UTF-8/man1/vim.1.gz
/usr/share/man/ru.KOI8-R/man1/vim.1.gz


When I type
$ export LANG=ru_RU.UTF-8
$ man vim

I get the English man page, instead of the Russian man page.
I think because there is no /usr/share/man/ru/man1/vim.1.gz present.

When I type

$ export LANG=ru_RU.UTF-8
$ export LANGUAGE=ru.UTF-8
$ man vim

The Russian man page is displayed, but all Russian characters are 
wrongly displayed.
I think because it is assumed the man page is in Latin-1.

To get a correct display of the Russian man page I need to change 
/etc/man.config
I change the line with NROFF to:
NROFF         /usr/bin/preconv | /usr/bin/nroff -c -mandoc 2>/dev/null

Now the Russian man page displays correctly, but now all the Latin-1 
pages display wrongly.

So I undo my change in /etc/man.conf


On Linux the trend is to convert all man pages to UTF-8 encoding.
Will Cygwin follow this trend?

regards,

-- 
Erwin Waterlander
http://waterlan.home.xs4all.nl/


--
Problem reports:       http://cygwin.com/problems.html
FAQ:                   http://cygwin.com/faq/
Documentation:         http://cygwin.com/docs.html
Unsubscribe info:      http://cygwin.com/ml/#unsubscribe-simple

- Raw text -


  webmaster     delorie software   privacy  
  Copyright © 2019   by DJ Delorie     Updated Jul 2019