Mail Archives: djgpp-workers/1999/04/14/23:34:15
> 3) I18N
> ISO C/amd 1, wide chars, multibyte strings. Boring work not
> use widely, the commitees are still fighting for the *right*
> set of chars, UTF ? Unicode ? pfff .. !
I started on wide chars a while back, but other things then grabbed my
attention. Eventually, if no one else does it, I'll start on it again
eventually. The first thing that needs to be done is to make wchar_t an
unsigned long to match the 32-bit length of the UCS-4 Unicode
character and change wint_t to match. UTF-8 is a good candidate for
the multibyte encoding since it's in wide use already (its the charset of
HTML 4 for example) and converting between UCS-4 and UTF-8 is not
difficult.
---
Mark Elbrecht, snowball3 AT usa DOT net
http://snowball.digitalspace.net/
- Raw text -