delorie.com/archives/browse.cgi   search  
Mail Archives: djgpp-workers/2005/05/21/12:37:22

X-Authentication-Warning: delorie.com: mail set sender to djgpp-workers-bounces using -f
From: <ams AT ludd DOT ltu DOT se>
Message-Id: <200505211222.j4LCMQKW025118@speedy.ludd.ltu.se>
Subject: Re: wchar_t implementation and multibyte encoding
In-Reply-To: <428F543B.2060801@phekda.gotadsl.co.uk> "from Richard Dawe at May
21, 2005 04:31:07 pm"
To: djgpp-workers AT delorie DOT com
Date: Sat, 21 May 2005 14:22:26 +0200 (CEST)
X-Mailer: ELM [version 2.4ME+ PL78 (25)]
MIME-Version: 1.0
X-ltu-MailScanner-Information: Please contact the ISP for more information
X-ltu-MailScanner: Found to be clean
X-MailScanner-From: ams AT ludd DOT ltu DOT se
Reply-To: djgpp-workers AT delorie DOT com
Errors-To: nobody AT delorie DOT com
X-Mailing-List: djgpp-workers AT delorie DOT com
X-Unsubscribes-To: listserv AT delorie DOT com

According to Richard Dawe:
> You're confusing the codepoint, which is the numbering of characters, 
                                               ^^^^^^^^^^^^^^^^^^^^^^^^
> symbols, etc. with how you represent them. The codepoints are abstract.
  ^^^^^^^^^^^^

> When you talk about "Unicode encoding", this is UTF-32, a mapping of 
> 0x10ffff to a 32-bit integer. That may not seem like an encoding, but it 
> is, because of endianness in the encoded data.

Ok.

1. But suppose I decide to use the inverted Unicode codepoints (IUC),
which I just invented, where 
"IUC character value" == 0x10ffff - "Unicode chararcter value".

Now I have a different set of codepoints. To me, IUC and Unicode are
two different encodings (of characters).

2. I which way _isn't_ Unicode a "numbering of characters, symbols,
etc"?


Right,

						MartinS

- Raw text -


  webmaster     delorie software   privacy  
  Copyright © 2019   by DJ Delorie     Updated Jul 2019