X-Recipient: archive-cygwin AT delorie DOT com X-SWARE-Spam-Status: No, hits=-1.8 required=5.0 tests=AWL,BAYES_00,SARE_MSGID_LONG40,SPF_PASS X-Spam-Check-By: sourceware.org MIME-Version: 1.0 In-Reply-To: <20090923120154.GY20981@calimero.vinschen.de> References: <416096c60909101512l6e42ab72l4ba5fd792363eefd AT mail DOT gmail DOT com> <20090921161014 DOT GI20981 AT calimero DOT vinschen DOT de> <416096c60909211154u5ddd5869v986011aa4ee13d57 AT mail DOT gmail DOT com> <20090922094523 DOT GR20981 AT calimero DOT vinschen DOT de> <416096c60909220912s5dd749bh5cfeb670b0e78c7a AT mail DOT gmail DOT com> <20090922170709 DOT GV20981 AT calimero DOT vinschen DOT de> <20090923120154 DOT GY20981 AT calimero DOT vinschen DOT de> Date: Wed, 23 Sep 2009 13:34:59 +0100 Message-ID: <416096c60909230534g44e80d44t66b18d981b4e3a40@mail.gmail.com> Subject: Re: [1.7] Invalid UTF8 while creating a file -> cannot delete? From: Andy Koppe To: cygwin AT cygwin DOT com Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable X-IsSubscribed: yes Mailing-List: contact cygwin-help AT cygwin DOT com; run by ezmlm Precedence: bulk List-Id: List-Unsubscribe: List-Subscribe: List-Archive: List-Post: List-Help: , Sender: cygwin-owner AT cygwin DOT com Mail-Followup-To: cygwin AT cygwin DOT com Delivered-To: mailing list cygwin AT cygwin DOT com 2009/9/23 Corinna Vinschen: > I have a local patch ready to use the ANSI codepage by default in the > "C" locale. =C2=A0It appears to work nicely and has the additional positi= ve > side effect to simplify the code in a few places. > > If I only new that eastern language users could happily live with > this change as well! Here's an idea to circumvent the DBCS troubles: default to UTF-8 when no charset is specified in the locale and the ANSI charset isn't singlebyte. Based on the following grounds: - Full CJK support (and more) out of the box. - DBCSs can't have worked very well in 1.5 in the first place, because the shell and most applications weren't aware of double-byte characters. Hence backward compatibility is less of an issue here. - Applications that don't (yet) work with UTF-8 are also unlikely to work correctly with DBCSs. - Iwamuro Motonori asked for it. Andy -- Problem reports: http://cygwin.com/problems.html FAQ: http://cygwin.com/faq/ Documentation: http://cygwin.com/docs.html Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple