X-Recipient: archive-cygwin AT delorie DOT com X-SWARE-Spam-Status: No, hits=-1.9 required=5.0 tests=AWL,BAYES_00,SARE_MSGID_LONG40,SPF_PASS X-Spam-Check-By: sourceware.org MIME-Version: 1.0 In-Reply-To: <4AE281BC.1040200@cwilson.fastmail.fm> References: <4AE235E4 DOT 2060005 AT gmail DOT com> <84fc9c000910231559y194a9ccfyfb9414f8ed04a361 AT mail DOT gmail DOT com> <4AE24BE4 DOT 8020207 AT gmail DOT com> <4AE281BC DOT 1040200 AT cwilson DOT fastmail DOT fm> Date: Sat, 24 Oct 2009 06:47:43 +0100 Message-ID: <416096c60910232247tb0ed351l2d542125bf566d7e@mail.gmail.com> Subject: Re: dg-error vs. i18n? From: Andy Koppe To: cygwin AT cygwin DOT com Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable X-IsSubscribed: yes Mailing-List: contact cygwin-help AT cygwin DOT com; run by ezmlm Precedence: bulk List-Id: List-Unsubscribe: List-Subscribe: List-Archive: List-Post: List-Help: , Sender: cygwin-owner AT cygwin DOT com Mail-Followup-To: cygwin AT cygwin DOT com Delivered-To: mailing list cygwin AT cygwin DOT com 2009/10/24 Charles Wilson: > [cross-posted to cygwin list] > > Background for cygwin list: Dave discovered a problem running some of > the gcc tests. =C2=A0The tests were run in the "C" locale, but in so doing > they assumed an ascii encoding (specifically, that "'" would match ' in > test patterns -- but the program actually emitted those fancy curled > quotes which did not match '). Do you mean they explicitly set the "C" locale? Hmm. Now that we've got the "C.UTF-8" default, "C" could actually go back to mean ASCII. With no locale variables set, the console and filesystem would use UTF-8 anyway, as would applications that call setlocale(,""). Only applications that don't call setlocale() would be using the "C" locale and hence ASCII, as but that'd be fine as either they don't care about it or they actually expect to be using ASCII. > Dave Korn wrote: >> Thanks, that was it. =C2=A0Had to use "C.CP437" in the end, apparently w= e have >> charset encoding names for lots of OEM code pages but none for plain van= illa >> ASCII. Actually, we do: "C.ASCII". Except it's 8-bit rather than 7-bit as elsewhere, making it practically the same as "C.ISO-8859-1". Andy -- Problem reports: http://cygwin.com/problems.html FAQ: http://cygwin.com/faq/ Documentation: http://cygwin.com/docs.html Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple