X-Recipient: archive-cygwin AT delorie DOT com X-SWARE-Spam-Status: No, hits=-1.7 required=5.0 tests=AWL,BAYES_00,DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,FREEMAIL_FROM,RCVD_IN_DNSWL_NONE,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: sourceware.org MIME-Version: 1.0 In-Reply-To: <4C993FCA.4090103@charter.net> References: <4C949BEA DOT 2090508 AT charter DOT net> <20100918112910 DOT GH14602 AT calimero DOT vinschen DOT de> <4C9886F1 DOT 7010309 AT charter DOT net> <4C993FCA DOT 4090103 AT charter DOT net> Date: Wed, 22 Sep 2010 06:52:46 +0100 Message-ID: Subject: Re: Instead of a gripe, a memory-jog. From: Andy Koppe To: cygwin AT cygwin DOT com Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable X-IsSubscribed: yes Mailing-List: contact cygwin-help AT cygwin DOT com; run by ezmlm Precedence: bulk List-Id: List-Unsubscribe: List-Subscribe: List-Archive: List-Post: List-Help: , Sender: cygwin-owner AT cygwin DOT com Mail-Followup-To: cygwin AT cygwin DOT com Delivered-To: mailing list cygwin AT cygwin DOT com On 22 September 2010 00:29, SJ Wright wrote: >> Yes. I noticed where I had the territory mis-cased the next time I ran >> wget. In the line that identified the file and URL for each download, >> double-quotes and other punctuation became garbage characters, where they >> hadn't been when I either had *no* LANG variable set or a correctly-writ= ten >> one. So now it's fixed. Thanks again. If LANG (and also LC_ALL and LC_CTYPE) aren't set, Cygwin defaults to UTF-8. It's better to have it set though, because some programs such as emacs default to plain ol' ASCII if the locale isn't set. That's why LANG is set to C.UTF-8 during login shell startup (by /etc/defaults/etc/profile.d/lang.sh). In other words, you shouldn't have to worry about it. > Spoke too soon on the wget matter. Since setting a LANG variable in the > first place (and evidently the right place, or else this wouldn't be a > "matter"), I've been seeing garbage text -- I prefer to call it "drone te= xt" > -- in place of quotation marks during normal (non-verbose and not set to > "quiet") downloads. Here's a sample: >> >> Saving to: =C3=A2=E2=82=AC=C5=93gae77-7748-244-958stck.jpg=C3=A2=E2=82= =AC That looks like wget is using UTF-8 yet your terminal is using ISO-8859-1. The Cygwin console as well as all the terminals shipped with Cygwin (except for rxvt) use UTF-8 by default. With other terminals, you might have to select it somewhere in their options. Andy -- Problem reports: http://cygwin.com/problems.html FAQ: http://cygwin.com/faq/ Documentation: http://cygwin.com/docs.html Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple