X-Recipient: archive-cygwin AT delorie DOT com X-SWARE-Spam-Status: No, hits=-1.9 required=5.0 tests=AWL,BAYES_00,SARE_MSGID_LONG40,SPF_PASS X-Spam-Check-By: sourceware.org MIME-Version: 1.0 In-Reply-To: <4B27DAA0.2020406@cs.umass.edu> References: <4B27DAA0 DOT 2020406 AT cs DOT umass DOT edu> Date: Tue, 15 Dec 2009 19:49:06 +0000 Message-ID: <416096c60912151149i673f0757v383aeddfbbf58c77@mail.gmail.com> Subject: Re: UTF-related question From: Andy Koppe To: cygwin AT cygwin DOT com Content-Type: text/plain; charset=UTF-8 X-IsSubscribed: yes Mailing-List: contact cygwin-help AT cygwin DOT com; run by ezmlm Precedence: bulk List-Id: List-Unsubscribe: List-Subscribe: List-Archive: List-Post: List-Help: , Sender: cygwin-owner AT cygwin DOT com Mail-Followup-To: cygwin AT cygwin DOT com Delivered-To: mailing list cygwin AT cygwin DOT com 2009/12/15 Eliot Moss: > Following the guidelines related to cygwin 1.7, I have > generally been using LANG=en_US.UTF-8. But I found that > if I do "man " to get a man page, and then > search (I have man's "more" program set to "less") for > a string having a dash in it, say to search for -a in the > rsync man page to find the description of that flag, it > fails to match. Hmm, it's groff being overly clever, replacing ASCII's combined hyphen/minus (U+002D) with Unicode's specific hyphen (U+2010) or minus (U+2212) characters. I don't know what to do about it, but googling "groff minus" or "man page hyphen" shows the problem exists elsewhere too. Andy -- Problem reports: http://cygwin.com/problems.html FAQ: http://cygwin.com/faq/ Documentation: http://cygwin.com/docs.html Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple