delorie.com/archives/browse.cgi   search  
Mail Archives: cygwin/2011/12/07/12:38:56

X-Recipient: archive-cygwin AT delorie DOT com
X-Spam-Check-By: sourceware.org
Date: Wed, 7 Dec 2011 18:38:08 +0100
From: Corinna Vinschen <corinna-cygwin AT cygwin DOT com>
To: cygwin AT cygwin DOT com
Subject: Re: [ANNOUNCEMENT] CALL FOR TESTING: Cygwin 1.7.10
Message-ID: <20111207173808.GA25743@calimero.vinschen.de>
Reply-To: cygwin AT cygwin DOT com
Mail-Followup-To: cygwin AT cygwin DOT com
References: <announce DOT 20111206093746 DOT GA6222 AT calimero DOT vinschen DOT de> <566vd7hfmi3j980ic4m64d7bv91b5qm6uh AT 4ax DOT com>
MIME-Version: 1.0
In-Reply-To: <566vd7hfmi3j980ic4m64d7bv91b5qm6uh@4ax.com>
User-Agent: Mutt/1.5.21 (2010-09-15)
Mailing-List: contact cygwin-help AT cygwin DOT com; run by ezmlm
List-Id: <cygwin.cygwin.com>
List-Unsubscribe: <mailto:cygwin-unsubscribe-archive-cygwin=delorie DOT com AT cygwin DOT com>
List-Subscribe: <mailto:cygwin-subscribe AT cygwin DOT com>
List-Archive: <http://sourceware.org/ml/cygwin/>
List-Post: <mailto:cygwin AT cygwin DOT com>
List-Help: <mailto:cygwin-help AT cygwin DOT com>, <http://sourceware.org/ml/#faqs>
Sender: cygwin-owner AT cygwin DOT com
Mail-Followup-To: cygwin AT cygwin DOT com
Delivered-To: mailing list cygwin AT cygwin DOT com

On Dec  7 18:00, Bengt Larsson wrote:
> Corinna Vinschen wrote:
> 
> >- cygwin_conv_path and cygwin_conv_path_list: In CCP_WIN_A_TO_POSIX and
> >  CCP_POSIX_TO_WIN_A conversions, use the current Windows ANSI or OEM
> >  charset, depending on the return value of AreFileApisANSI.  Up to Cygwin
> >  1.7.9, both conversions used the current Cygwin charset for the conversion.
> 
> Is that the right thing to do? I have LANG=C.UTF-8. If I pass a
> Windows-style filename on the command line, it's passed as UTF-8. How do
> I then convert that to Unix-style, UTF-8?

First of all, don't do that.  Use POSIX paths.

Second, it's not passed as UTF-8 if the called application is a
non-Cygwin application.  In fact, Cygwin calls CreateProcessW, so all
strings are converted to UTF-16 (aka UNICODE) when starting a non-Cygwin
child process.

Third, as for Cygwin apps, don't use WIN_A, use WIN_W instead, because
that's encoding agnostic:

  main (int argc, char **argv)
  {
    size_t len;
    wchar_t *w32_path;
    char *psx_path;

    setlocale (LC_ALL, "");

    len = mbstowcs (NULL, argv[1], 0) + 1;
    w32_path = (wchar_t *) malloc (len * sizeof (wchar_t));
    mbstowcs (w32_path, argv[1], len);
    psx_path = (char *) cygwin_create_path (CCP_WIN_W_TO_POSIX, w32_path);
    [...]
  }


Corinna

-- 
Corinna Vinschen                  Please, send mails regarding Cygwin to
Cygwin Project Co-Leader          cygwin AT cygwin DOT com
Red Hat

--
Problem reports:       http://cygwin.com/problems.html
FAQ:                   http://cygwin.com/faq/
Documentation:         http://cygwin.com/docs.html
Unsubscribe info:      http://cygwin.com/ml/#unsubscribe-simple

- Raw text -


  webmaster     delorie software   privacy  
  Copyright © 2019   by DJ Delorie     Updated Jul 2019