delorie.com/archives/browse.cgi | search |
X-Recipient: | archive-cygwin AT delorie DOT com |
X-Original-To: | cygwin AT cygwin DOT com |
Delivered-To: | cygwin AT cygwin DOT com |
DMARC-Filter: | OpenDMARC Filter v1.3.2 sourceware.org 7A556385782D |
Authentication-Results: | sourceware.org; dmarc=none (p=none dis=none) |
header.from=cyberXpress.co.nz | |
Authentication-Results: | sourceware.org; |
spf=pass smtp.mailfrom=M DOT Aitchison AT cyberXpress DOT co DOT nz | |
DKIM-Signature: | v=1; a=rsa-sha1; c=simple; d=plain.co.nz; h=to:from |
:subject:message-id:date:mime-version:content-type | |
:content-transfer-encoding; s=mail; bh=t8w4brQQ5sOP/86D0b37u3ckV | |
YM=; b=Z33Q07iS4w9TyuMK1Vp3ZvHZPGDXch9qRjsjhGfZ6AmnvlKtRSuEyWUz+ | |
YJ8WOlQ55oAjjSHPP5Xg4FYj5LAPtQvVfvk2wBDXOlMa/KmoJ3NEwFxyAHR+uLss | |
D7aRMIqwcl7Ab9UFFFmFXFVqBhteDmMLJOlhGS7bdk+dlErSnU= | |
To: | cygwin AT cygwin DOT com |
From: | Mark Aitchison <M DOT Aitchison AT cyberXpress DOT co DOT nz> |
Subject: | Perl Unidecode modules - which to use (if not Text::Unidecode)? |
Message-ID: | <d3342ff4-f717-f882-5c41-b27ab272dc03@cyberXpress.co.nz> |
Date: | Fri, 2 Apr 2021 09:35:31 +1300 |
User-Agent: | Mozilla/5.0 (X11; Linux x86_64; rv:78.0) Gecko/20100101 |
Thunderbird/78.8.0 | |
MIME-Version: | 1.0 |
X-Spam-Status: | No, score=0.1 required=5.0 tests=BAYES_00, BODY_8BITS, |
DKIM_SIGNED, DKIM_VALID, JMQ_SPF_NEUTRAL, SPF_HELO_NONE, | |
SPF_PASS autolearn=no autolearn_force=no version=3.4.2 | |
X-Spam-Checker-Version: | SpamAssassin 3.4.2 (2018-09-13) on |
server2.sourceware.org | |
X-BeenThere: | cygwin AT cygwin DOT com |
X-Mailman-Version: | 2.1.29 |
List-Id: | General Cygwin discussions and problem reports <cygwin.cygwin.com> |
List-Archive: | <https://cygwin.com/pipermail/cygwin/> |
List-Post: | <mailto:cygwin AT cygwin DOT com> |
List-Help: | <mailto:cygwin-request AT cygwin DOT com?subject=help> |
List-Subscribe: | <https://cygwin.com/mailman/listinfo/cygwin>, |
<mailto:cygwin-request AT cygwin DOT com?subject=subscribe> | |
Sender: | "Cygwin" <cygwin-bounces AT cygwin DOT com> |
X-MIME-Autoconverted: | from base64 to 8bit by delorie.com id 131KZfYA006352 |
I am writing perl programs that I'd like to know will work under both Linux and Cygwin, and have to deal with Unicode now. I had used Text::Unidecode happily in Linux but find no cygwin version. Possibly I am not looking in the right places for it, but possibly there are different Unicode-related modules that are well-supported under both cygwin and linux that I should be using instead, and I guess Unicode might be one of those things where it depends on the underlying o/s so it probably pays to go with whatever is the standard set of modules. 1. What perl Unicode modules should I consider, if not Text::Unidecode? The present need is to be able to convert those few "foreign" characters (like ÇĆĈĊçĉċĜĞĠĢĝģğġËÌÍÎÏÒÓÔÕ) that are basically ASCII with accent marks to their closest ASCII equivalents, but I'd like to do more with Unicode in the future, without going down any dead-ends as far as being able to run under cygwin is concerned. 2. I see some talk of Internationalization in Chapter 2 of "Setting up Cygwin", but cannot see anything relating to perl modules, and I don't see any easy way to search many months of the mailing list for a keyword... is there any information I should know about? Thanks, Mark Aitchison -- Problem reports: https://cygwin.com/problems.html FAQ: https://cygwin.com/faq/ Documentation: https://cygwin.com/docs.html Unsubscribe info: https://cygwin.com/ml/#unsubscribe-simple
webmaster | delorie software privacy |
Copyright 2019 by DJ Delorie | Updated Jul 2019 |