Mail Archives: djgpp-workers/2005/05/21/12:37:22

delorie.com/archives/browse.cgi

search

Mail Archives: djgpp-workers/2005/05/21/12:37:22

X-Authentication-Warning: delorie.com: mail set sender to djgpp-workers-bounces using -f

From: <ams AT ludd DOT ltu DOT se>

Message-Id: <200505211222.j4LCMQKW025118@speedy.ludd.ltu.se>

Subject: Re: wchar_t implementation and multibyte encoding

In-Reply-To: <428F543B.2060801@phekda.gotadsl.co.uk> "from Richard Dawe at May

21, 2005 04:31:07 pm"

To: djgpp-workers AT delorie DOT com

Date: Sat, 21 May 2005 14:22:26 +0200 (CEST)

X-Mailer: ELM [version 2.4ME+ PL78 (25)]

MIME-Version: 1.0

X-ltu-MailScanner-Information: Please contact the ISP for more information

X-ltu-MailScanner: Found to be clean

X-MailScanner-From: ams AT ludd DOT ltu DOT se

Reply-To: djgpp-workers AT delorie DOT com

Errors-To: nobody AT delorie DOT com

X-Mailing-List: djgpp-workers AT delorie DOT com

X-Unsubscribes-To: listserv AT delorie DOT com

According to Richard Dawe:
> You're confusing the codepoint, which is the numbering of characters, 
                                               ^^^^^^^^^^^^^^^^^^^^^^^^
> symbols, etc. with how you represent them. The codepoints are abstract.
  ^^^^^^^^^^^^

> When you talk about "Unicode encoding", this is UTF-32, a mapping of 
> 0x10ffff to a 32-bit integer. That may not seem like an encoding, but it 
> is, because of endianness in the encoded data.

Ok.

1. But suppose I decide to use the inverted Unicode codepoints (IUC),
which I just invented, where 
"IUC character value" == 0x10ffff - "Unicode chararcter value".

Now I have a different set of codepoints. To me, IUC and Unicode are
two different encodings (of characters).

2. I which way _isn't_ Unicode a "numbering of characters, symbols,
etc"?


Right,

						MartinS

- Raw text -

webmaster	delorie software privacy
Copyright © 2019 by DJ Delorie	Updated Jul 2019

X-Authentication-Warning:	delorie.com: mail set sender to djgpp-workers-bounces using -f
From:	<ams AT ludd DOT ltu DOT se>
Message-Id:	<200505211222.j4LCMQKW025118@speedy.ludd.ltu.se>
Subject:	Re: wchar_t implementation and multibyte encoding
In-Reply-To:	<428F543B.2060801@phekda.gotadsl.co.uk> "from Richard Dawe at May
	21, 2005 04:31:07 pm"
To:	djgpp-workers AT delorie DOT com
Date:	Sat, 21 May 2005 14:22:26 +0200 (CEST)
X-Mailer:	ELM [version 2.4ME+ PL78 (25)]
MIME-Version:	1.0
X-ltu-MailScanner-Information:	Please contact the ISP for more information
X-ltu-MailScanner:	Found to be clean
X-MailScanner-From:	ams AT ludd DOT ltu DOT se
Reply-To:	djgpp-workers AT delorie DOT com
Errors-To:	nobody AT delorie DOT com
X-Mailing-List:	djgpp-workers AT delorie DOT com
X-Unsubscribes-To:	listserv AT delorie DOT com