X-Recipient: archive-cygwin AT delorie DOT com DomainKey-Signature: a=rsa-sha1; c=nofws; d=sourceware.org; h=list-id :list-unsubscribe:list-subscribe:list-archive:list-post :list-help:sender:message-id:date:from:mime-version:to:subject :references:in-reply-to:content-type:content-transfer-encoding; q=dns; s=default; b=LsSLummR+YhXru59Oq38eDjBqkA7YOlk8gQO6keVvK/ Tuqb2FZHqdak+1iKmrY4ksjKoB8nQLs1JXlrv17ZDBXyOmTZmAGx2M9HvdJRfwxk kb3M5ZMptYA1ANd96Ih0uhGMg8PMzAGGUx2vLp9eEnYwHsU/UsMmwMgZiLmhgnhA = DKIM-Signature: v=1; a=rsa-sha1; c=relaxed; d=sourceware.org; h=list-id :list-unsubscribe:list-subscribe:list-archive:list-post :list-help:sender:message-id:date:from:mime-version:to:subject :references:in-reply-to:content-type:content-transfer-encoding; s=default; bh=48PDeQHbe4i0j9Yjx7JDNesKJQw=; b=HTVQsJdv1cNpLLQxf X0lBgUMRl2Zo4vAaV/XNp+C72ONzGriIA7+LBC/3+pUo2XlxsxFAQFpVbVrEdEoO USLLsLrP+0cX0Mmvm1/lKh1+CwM4mj7/GkV+W34Mjb5R5Bae5eKxWuhaDUeKpsij ElAUg5JEBziG2RoFtFwc8K/5ds= Mailing-List: contact cygwin-help AT cygwin DOT com; run by ezmlm List-Id: List-Subscribe: List-Archive: List-Post: List-Help: , Sender: cygwin-owner AT cygwin DOT com Mail-Followup-To: cygwin AT cygwin DOT com Delivered-To: mailing list cygwin AT cygwin DOT com Authentication-Results: sourceware.org; auth=none X-Virus-Found: No X-Spam-SWARE-Status: No, score=-4.2 required=5.0 tests=BAYES_50,GIT_PATCH_2 autolearn=ham version=3.3.2 spammy=UD:uk, H*r:8.14.7, UD:co.uk, OVER X-HELO: Ishtar.sc.tlinx.org Message-ID: <5B3045B1.4080504@tlinx.org> Date: Sun, 24 Jun 2018 18:30:25 -0700 From: L A Walsh User-Agent: Thunderbird MIME-Version: 1.0 To: cygwin AT cygwin DOT com Subject: Re: UTF-8 character encoding References: <1183751257 DOT 20180621042620 AT yandex DOT ru> In-Reply-To: Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit X-IsSubscribed: yes Lee wrote: > So... keep it simple, set > LANG=en_US.UTF-8 > and use vi or something else that comes with cygwin to create the file > and I'll have a file with UTF-8 character encoding - correct? --- The first 127 characters of UTF-8 are identical to the first 127 characters of ASCII, and latin1 and iso-8859-1. If you don't use any characters that need accents or special symbols, then nothing will be encoded in UTF-8, because its only the characters OVER the first 127 (see chart @ http://www.babelstone.co.uk/Unicode/babelmap.html). The site also has a sw util (http://www.babelstone.co.uk/Software/BabelMap.html), that displays and helps config fonts to display all the characters in unicode, though it hasn't been updated to the changes that came out last month or so (Unicode 11). It's a cool little, *free*, utility...though if you find it useful you can always send in your registration. -- Problem reports: http://cygwin.com/problems.html FAQ: http://cygwin.com/faq/ Documentation: http://cygwin.com/docs.html Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple