X-Authentication-Warning: delorie.com: mail set sender to djgpp-workers-bounces using -f Message-ID: <428F7B83.2010105@phekda.gotadsl.co.uk> Date: Sat, 21 May 2005 19:18:43 +0100 From: Richard Dawe User-Agent: Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.7.8) Gecko/20050513 Fedora/1.7.8-1.3.1 X-Accept-Language: en, de, fr MIME-Version: 1.0 To: djgpp-workers AT delorie DOT com Subject: Re: wchar_t implementation and multibyte encoding References: <200505211222 DOT j4LCMQKW025118 AT speedy DOT ludd DOT ltu DOT se> In-Reply-To: <200505211222.j4LCMQKW025118@speedy.ludd.ltu.se> Content-Type: text/plain; charset=us-ascii; format=flowed Content-Transfer-Encoding: 7bit Reply-To: djgpp-workers AT delorie DOT com Hello. ams AT ludd DOT ltu DOT se wrote: > According to Richard Dawe: > >>You're confusing the codepoint, which is the numbering of characters, > > ^^^^^^^^^^^^^^^^^^^^^^^^ > >>symbols, etc. with how you represent them. The codepoints are abstract. > > ^^^^^^^^^^^^ > > >>When you talk about "Unicode encoding", this is UTF-32, a mapping of >>0x10ffff to a 32-bit integer. That may not seem like an encoding, but it >>is, because of endianness in the encoded data. > > > Ok. > > 1. But suppose I decide to use the inverted Unicode codepoints (IUC), > which I just invented, where > "IUC character value" == 0x10ffff - "Unicode chararcter value". > > Now I have a different set of codepoints. To me, IUC and Unicode are > two different encodings (of characters). Well the Unicode codepoint is still the same. A value in your IUC is an encoding of the codepoint, not a codepoint. > 2. I which way _isn't_ Unicode a "numbering of characters, symbols, > etc"? Unicode is a numbering of characters, symbols, etc. I think I misunderstand your question. Bye, Rich =] -- Richard Dawe [ http://homepages.nildram.co.uk/~phekda/richdawe/ ] "You can't evaluate a man by logic alone." -- McCoy, "I, Mudd", Star Trek