delorie.com/archives/browse.cgi   search  
Mail Archives: cygwin/2018/09/04/19:29:13

X-Recipient: archive-cygwin AT delorie DOT com
DomainKey-Signature: a=rsa-sha1; c=nofws; d=sourceware.org; h=list-id
:list-unsubscribe:list-subscribe:list-archive:list-post
:list-help:sender:message-id:date:from:subject:to:references
:content-type; q=dns; s=default; b=VJZW9C3BPqPI4JapOl+NVh3ajRYKy
y8+bxS50PJX8dlFbuT2ONBizHAbYLJ0jJ6M08XdU/fdC5NIunx7f6H5fdrRPsk6I
kXCbtdPLhUQdOcwMfOGItVGGlNwEwpT20ihZiLKdrVQw9tlQf2Vy22njTweJYyaj
hQ2AzYZ8d4BHm0=
DKIM-Signature: v=1; a=rsa-sha1; c=relaxed; d=sourceware.org; h=list-id
:list-unsubscribe:list-subscribe:list-archive:list-post
:list-help:sender:message-id:date:from:subject:to:references
:content-type; s=default; bh=OoiPumi+cGZ7OfPPxrCN+d+9in4=; b=Cvh
C0iRBnPfL7Ul+4Moo6JZUDcSZzCBCrg25DhwBIqNKpGSR0ifMZMQPEhCXon6nG7d
Xz+3bGnazZiH0ZhlbiCvi2Lj6A/gMemCY7SzLrcsw+OBPJWYuVTwZQPf47X7xXg2
aE6fNteacMI35Xs3r35u6axy7+n6JQf/thxcnfNY=
Mailing-List: contact cygwin-help AT cygwin DOT com; run by ezmlm
List-Id: <cygwin.cygwin.com>
List-Subscribe: <mailto:cygwin-subscribe AT cygwin DOT com>
List-Archive: <http://sourceware.org/ml/cygwin/>
List-Post: <mailto:cygwin AT cygwin DOT com>
List-Help: <mailto:cygwin-help AT cygwin DOT com>, <http://sourceware.org/ml/#faqs>
Sender: cygwin-owner AT cygwin DOT com
Mail-Followup-To: cygwin AT cygwin DOT com
Delivered-To: mailing list cygwin AT cygwin DOT com
Authentication-Results: sourceware.org; auth=none
X-Spam-SWARE-Status: No, score=-1.9 required=5.0 tests=AWL,BAYES_00,FREEMAIL_FROM,RCVD_IN_DNSWL_NONE,SPF_PASS autolearn=ham version=3.3.2 spammy=recommendations, H*M:1c69fb81, WHITE, H*M:google
X-HELO: mail-oi0-f67.google.com
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=message-id:date:from:subject:to:references:user-agent; bh=mdkZsGgqrZHz5GO6+C4ZvoNzPCM6gvuhSnzmLYH1NQ0=; b=TLPJsuJQQMfImRn3l2kJd85kvTop7FfelKXIrMyg/xAyCB8a7hCQK45drn7pEsr7TM XroKxE5FX+dWedd/iUIoQ+vFtb+iHy+r/aJtm2968MVjokxLtlIGSAb/s/PAd5hoq80f HzaTEKwDj3yHSL/hNx5aAMIyY76pBfyAVUuzRoYjWul867NRe+5F9M5g2yxFIH2avZFE kfVJeb4ic0IF5JoKqmgXTI8n20u1o/vtXjrpNSBALZFAsUOi1cqpUyLKcsjNuMYDWZPb kWXmJHXsbJXIYlKUurSqi65LuekKd+hMWi3h4ExAB05ubAJwnybgZaeMIX7GbMepS/77 8OUg==
Message-ID: <5b8f1539.1c69fb81.3d7fb.2575@mx.google.com>
Date: Tue, 04 Sep 2018 16:28:57 -0700 (PDT)
From: Steven Penny <svnpenn AT gmail DOT com>
Subject: Re: Cygwin fails to utilize Unicode replacement character
To: cygwin AT cygwin DOT com
References: <5c366e53-ad20-7ccc-5d76-c4fd5adefdf9 AT towo DOT net>
User-Agent: Tryst/2.8.0 (cup.github.io/tryst)

On Tue, 4 Sep 2018 23:43:16, Thomas Wolff wrote:
> Traditionally, many terminals used to display the DEL character as a 
> checkered block, which is more or less the MEDIUM SHADE.
> This makes the glyph appear somewhat "erroneous" by convention.

I see - now that Unicode has some dedicated characters for this, it would make
sense to use them, especially since linux is already using them:

1. U+FFFD: http://unicode.org/charts/nameslist/n_FFF0.html
2. U+25A1: http://unicode.org/charts/nameslist/n_25A0.html

> valid code point with no glyph in font -> .notdef glyph -> WHITE SQUARE

this is not true. "WHITE SQUARE" refers to U+25A1, which is an actual character
and different from the ".notdef" glyph. as has been discussed as length in this
thread, the ".notdef glyph" is not an actual character, but a glyph that exists
at position 0 in the font, and while its appearance is not strictly defined,
some recommendations exist:

- empty rectangle
- rectangle with a question mark
- rectangle with an X

> Now if you switch to FFFD REPLACEMENT CHARACTER for invalid code point, 
> and considering that it does not exist in most actual fonts and that the 
> console does not apply font fallback, it will resolve to WHITE SQUARE, thus:
> folding the two different use cases into the same appearance,
> which is bad.

no again, it will resolve to ".notdef glyph", as I put above. otherwise yes, you
do have a point. in the case of a font without U+FFFD, you have ultimately:

invalid code point: .notdef glyph
missing character: .notdef glyph

several ideas have been proposed:

1. keep U+FFFD
2. go back to U+2592
3. use U+25A1 instead
4. use U+FFFD if possible else fallback to U+2592 or U+25A1

if we choose option 1, people not happy with the ambiguity can simply install
"dejavu-fonts" or similar, which Cygwin provides.


--
Problem reports:       http://cygwin.com/problems.html
FAQ:                   http://cygwin.com/faq/
Documentation:         http://cygwin.com/docs.html
Unsubscribe info:      http://cygwin.com/ml/#unsubscribe-simple

- Raw text -


  webmaster     delorie software   privacy  
  Copyright © 2019   by DJ Delorie     Updated Jul 2019