X-Recipient: archive-cygwin@delorie.com
X-Original-To: cygwin@cygwin.com
Delivered-To: cygwin@cygwin.com
DMARC-Filter: OpenDMARC Filter v1.3.2 sourceware.org 95CEA3857C4C
Authentication-Results: sourceware.org;
 dmarc=none (p=none dis=none) header.from=nexgo.de
Authentication-Results: sourceware.org;
 spf=pass smtp.mailfrom=Stromeko@nexgo.de
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=nexgo.de;
 s=vfde-smtpout-mb-15sep; t=1617605008;
 bh=bH+AQIsIa+H/+dYCw2b6bdjZJCfeqX/fHsqSfrorLJQ=;
 h=From:To:Subject:References:Date:In-Reply-To;
 b=fcvusdO0F7yAKIPYG9aD8i3j13OKywGidx/xqIbSXV1IbSvPC/sOVAQA2vwqiWTpj
 FaYIgzJL1DPtSYWJXZn7POXnq/wsNyG+o6T1eSqgR47StImI8cA+iIdN60MzrdueJu
 gRLQ7pbjmhYkOP3dMkeLEF4XCuXBEqlkBpM7IX20=
From: Achim Gratz <Stromeko@nexgo.de>
To: cygwin@cygwin.com
Subject: Re: Perl Unidecode modules - which to use (if not Text::Unidecode)?
References: <d3342ff4-f717-f882-5c41-b27ab272dc03@cyberXpress.co.nz>
Date: Mon, 05 Apr 2021 08:43:19 +0200
In-Reply-To: <d3342ff4-f717-f882-5c41-b27ab272dc03@cyberXpress.co.nz> (Mark
 Aitchison's message of "Fri, 2 Apr 2021 09:35:31 +1300")
Message-ID: <87sg45tens.fsf@Rainer.invalid>
User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/27.1 (gnu/linux)
MIME-Version: 1.0
X-purgate-type: clean
X-purgate-Ad: Categorized by eleven eXpurgate (R) http://www.eleven.de
X-purgate: This mail is considered clean (visit http://www.eleven.de for
 further information)
X-purgate: clean
X-purgate-size: 1211
X-purgate-ID: 155817::1617605007-0000600F-632965B1/0/0
X-Spam-Status: No, score=-0.6 required=5.0 tests=BAYES_00, DKIM_SIGNED,
 DKIM_VALID, DKIM_VALID_AU, DKIM_VALID_EF, RCVD_IN_BARRACUDACENTRAL,
 RCVD_IN_DNSWL_LOW, RCVD_IN_MSPIKE_H2, SPF_HELO_NONE, SPF_PASS,
 TXREP autolearn=no autolearn_force=no version=3.4.2
X-Spam-Checker-Version: SpamAssassin 3.4.2 (2018-09-13) on
 server2.sourceware.org
X-BeenThere: cygwin@cygwin.com
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: General Cygwin discussions and problem reports <cygwin.cygwin.com>
List-Unsubscribe: <https://cygwin.com/mailman/options/cygwin>,
 <mailto:cygwin-request@cygwin.com?subject=unsubscribe>
List-Archive: <https://cygwin.com/pipermail/cygwin/>
List-Post: <mailto:cygwin@cygwin.com>
List-Help: <mailto:cygwin-request@cygwin.com?subject=help>
List-Subscribe: <https://cygwin.com/mailman/listinfo/cygwin>,
 <mailto:cygwin-request@cygwin.com?subject=subscribe>
Content-Type: text/plain; charset="utf-8"
Errors-To: cygwin-bounces@cygwin.com
Sender: "Cygwin" <cygwin-bounces@cygwin.com>
Content-Transfer-Encoding: 8bit
X-MIME-Autoconverted: from base64 to 8bit by delorie.com id 1356hd2o027446

Mark Aitchison writes:
> I am writing perl programs that I'd like to know will work under both
> Linux and Cygwin, and have to deal with Unicode now.

Why not do it properly, i.e. actually work in unicode?

> I had used Text::Unidecode happily in Linux but find no cygwin
> version. Possibly I am not looking in the right places for it, but
> possibly there are different Unicode-related modules that are
> well-supported under both cygwin and linux that I should be using
> instead, and I guess Unicode might be one of those things where it
> depends on the underlying o/s so it probably pays to go with whatever
> is the standard set of modules.

Text::Unidecode _strips_ unicode characters down to ASCII so that
programs that are non-Unicode aware will not balk.  This may have been
useful in the past, but I no longer see the point when the standard
environment almost everywhere is either UTF-8 or UTF-16 these days.

[…]

See "perldoc unicode" for starters.

Regards,
Achim.
-- 
+<[Q+ Matrix-12 WAVE#46+305 Neuron microQkb Andromeda XTk Blofeld]>+

Factory and User Sound Singles for Waldorf Q+, Q and microQ:
http://Synth.Stromeko.net/Downloads.html#WaldorfSounds
--
Problem reports:      https://cygwin.com/problems.html
FAQ:                  https://cygwin.com/faq/
Documentation:        https://cygwin.com/docs.html
Unsubscribe info:     https://cygwin.com/ml/#unsubscribe-simple

