X-Recipient: archive-cygwin AT delorie DOT com X-SWARE-Spam-Status: No, hits=-2.5 required=5.0 tests=AWL,BAYES_00,SPF_PASS X-Spam-Check-By: sourceware.org Message-ID: <4AE297CB.7040200@gmail.com> Date: Sat, 24 Oct 2009 06:59:39 +0100 From: Dave Korn User-Agent: Thunderbird 2.0.0.17 (Windows/20080914) MIME-Version: 1.0 To: Charles Wilson CC: Dave Korn , Cygwin Mailing List Subject: Re: dg-error vs. i18n? References: <4AE235E4 DOT 2060005 AT gmail DOT com> <84fc9c000910231559y194a9ccfyfb9414f8ed04a361 AT mail DOT gmail DOT com> <4AE24BE4 DOT 8020207 AT gmail DOT com> <4AE281BC DOT 1040200 AT cwilson DOT fastmail DOT fm> In-Reply-To: <4AE281BC.1040200@cwilson.fastmail.fm> Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Mailing-List: contact cygwin-help AT cygwin DOT com; run by ezmlm List-Id: List-Subscribe: List-Archive: List-Post: List-Help: , Sender: cygwin-owner AT cygwin DOT com Mail-Followup-To: cygwin AT cygwin DOT com Delivered-To: mailing list cygwin AT cygwin DOT com Charles Wilson wrote: > [cross-posted to cygwin list] [ Cross-post broken and CC list trimmed; I don't think we need trouble the GCC list with this again until we have a patch that says what kind of target-dependent changes we want to make to the testsuite files to set LANG and LC_ALL correctly for our platform. ] > > Background for cygwin list: Dave discovered a problem running some of > the gcc tests. The tests were run in the "C" locale, but in so doing > they assumed an ascii encoding (specifically, that "'" would match ' in > test patterns -- but the program actually emitted those fancy curled > quotes which did not match '). > > Dave Korn wrote: >> Thanks, that was it. Had to use "C.CP437" in the end, apparently we have >> charset encoding names for lots of OEM code pages but none for plain vanilla >> ASCII. > > That's interesting. I had thought "ascii" was a fairly common encoding > name; I know I've seen both 'encoding="ascii"' and 'encoding="us-ascii"' > in XML documents. Maybe we (cygwin) should add an explicit > plain-old-ascii encoding name? This was tangentially referenced in the recent thread "Re: "C" UTF-8 trouble" on the -developers list. cheers, DaveK -- Problem reports: http://cygwin.com/problems.html FAQ: http://cygwin.com/faq/ Documentation: http://cygwin.com/docs.html Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple