Mail Archives: cygwin/2010/01/24/04:55:53
On Jan 24 10:37, Corinna Vinschen wrote:
> On Jan 23 22:31, Andy Koppe wrote:
> > Corinna Vinschen:
> > > I applied a patch which handles the characters 0x5c and 0cfe differently
> > > if the charset is set to "SJIS"
> >
> > Something's going seriously wrong with this, and I'd suspect it's to
> > do with turning backslashes into yen symbols.
>
> Right. It occured to me tonight that this will not work from a
> filesystem point-of-view. The people who decided to overload backslash
> and tilde in the ASCII range with different symbols in SJIS still need
> some serious knock on their heads. No wonder the Microsoft guys kept
> the binary values of characters intact, especially due to the backslash
> problem.
>
> > Not sure what could be done about it. Remove SJIS support in favour of CP932?
>
> In theory, we could be able to keep SJIS support in. The
> Cygwin-internal function converting multibyte strings to Unicode
> filenames would have to use CP932. Only on the application level the
> conversion would use SJIS.
>
> There's no system API which takes wchar_t strings, so all strings are
> exchanged between application and system using multibyte strings. Since
> the multibytes strings are the same, that should give a round-trip which
> still works for Win32 filenames:
>
> Input string: "\x5e\xfe"
>
> Application: mbstowcs ("\x5e\xfe") ==> L"\x00a5\x203e"
> wcstombs (L"\x00a5\x203e") ==> "x5e\xfe"
>
> Cygwin sys_mbstowcs ("\x5e\xfe") ==> L"\x005e\x007e"
> sys_wcstombs (L"\x005e\x007e") ==> "x5e\xfe"
...and, if we implement it that way, do we really still need support
for a "CP932" charset?
--
Corinna Vinschen Please, send mails regarding Cygwin to
Cygwin Project Co-Leader cygwin AT cygwin DOT com
Red Hat
--
Problem reports: http://cygwin.com/problems.html
FAQ: http://cygwin.com/faq/
Documentation: http://cygwin.com/docs.html
Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple
- Raw text -