X-Recipient: archive-cygwin AT delorie DOT com DomainKey-Signature: a=rsa-sha1; c=nofws; d=sourceware.org; h=list-id :list-unsubscribe:list-subscribe:list-archive:list-post :list-help:sender:mime-version:in-reply-to:references:from:date :message-id:subject:to:content-type; q=dns; s=default; b=v/KjQFC XeJv0Q8jiDCpeEmANBhpepsgkIR2oU8KIyGV2zwdpq9dWQBgbs04l7ZpAR0dHnHm FUEX4+j/uXj+97npdDaqS2NTtFPFC8JLMb5fM2M0vwDBYXv0/dvy8UxkGhRTZ7ou kmwDMQCry3CL8sbqSg5xavebyjIhYmy1Es1M= DKIM-Signature: v=1; a=rsa-sha1; c=relaxed; d=sourceware.org; h=list-id :list-unsubscribe:list-subscribe:list-archive:list-post :list-help:sender:mime-version:in-reply-to:references:from:date :message-id:subject:to:content-type; s=default; bh=K6G0OpZvBtPOC qnU6F6pq/5Ej60=; b=b9vQObXV/abh7u5UxkmMcp/dz+PeHK701RTahkwi2UkDc YRSwm0BOWGzJ2mgc1QWWJv9qdkh9d3U5IztICqRlBHqxmXQ2r0L3BZNFmFSvDGi8 J8jOJsLEvJj/uabmjJGdw5+kMl1I6Ozf7NZZZgIYTEGrU6NFC7h93RBcs5a/j0= Mailing-List: contact cygwin-help AT cygwin DOT com; run by ezmlm List-Id: List-Subscribe: List-Archive: List-Post: List-Help: , Sender: cygwin-owner AT cygwin DOT com Mail-Followup-To: cygwin AT cygwin DOT com Delivered-To: mailing list cygwin AT cygwin DOT com Authentication-Results: sourceware.org; auth=none X-Spam-SWARE-Status: No, score=-1.6 required=5.0 tests=AWL,BAYES_00,FREEMAIL_ENVFROM_END_DIGIT,FREEMAIL_FROM,RCVD_IN_DNSWL_NONE,SPF_PASS autolearn=no version=3.3.2 spammy= X-HELO: mail-io0-f177.google.com DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:in-reply-to:references:from:date:message-id:subject:to; bh=8f3CgmRXsiDzRQZpf5Wj639sCPU1q/5k8l9EyPCAQIY=; b=bjQAnSJ0c7xNm6L4DF86j1lgQBWOhZ+o0uWWTqC8uO80UFvg7lMkOXAnzNa4TZOHs5 gb2+6DJoqFOjKxPnI/LzP07n0ZOcEDFApP1WlXUMDHQTkKG9kVxMtsICw2H+sGAcnvuf CR3oW6QhKi0tcS0KS4QCYVACm1kxDjIaV+QT9Gi3wO04yIaoK2xeULF7IUyGz2tdIJhV fBOuVri7tJp3/nB83EjRXEjhq5xJ2+7U32AWtBtJTgtpiII8nLK707+4t5OptqKoK+oJ szElM3Gb8JXkh/WdBpipA9Utbebn0O30dbS78vf9G6PERnQfal1t/ycl8sZHjsf6FdQa hiUQ== MIME-Version: 1.0 In-Reply-To: References: <1183751257 DOT 20180621042620 AT yandex DOT ru> <5B3045B1 DOT 4080504 AT tlinx DOT org> From: Lee Date: Wed, 27 Jun 2018 02:53:38 -0400 Message-ID: Subject: Re: UTF-8 character encoding To: cygwin AT cygwin DOT com Content-Type: text/plain; charset="UTF-8" X-IsSubscribed: yes On 6/26/18, Michael Enright wrote: > On Mon, Jun 25, 2018 at 11:33 AM, Lee wrote: >> I'm still trying to figure utf-8 out, but it seems to me that 0x0 - >> 0xff is part of the utf-8 encoding. > > I don't see how you arrived at this. I screwed up trying to do hex in my head. For whatever reason I didn't want to write 0 - 127 > An initial byte of 0xFF is not > the initial byte of any valid UTF-8 byte sequence. And it doesn't > conform with the statement you have later: right, I screwed up :) > The standards such as IETF RFC-3629 are easy enough to read, so I > recommend using them and citing them to others instead of trying to > summarize. Thanks for the RFC reference - I hadn't come across that one yet. Lee -- Problem reports: http://cygwin.com/problems.html FAQ: http://cygwin.com/faq/ Documentation: http://cygwin.com/docs.html Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple