X-Recipient: archive-cygwin@delorie.com
X-Spam-Check-By: sourceware.org
Message-ID: <COL101-W704EB769BCC6D654FF8220E6640@phx.gbl>
From: Jay <jay.krell@cornell.edu>
To: <brian@dessent.net>, <cygwin@cygwin.com>
Subject: RE: cygwin 1.7.0 and special filename chars
Date: Sun, 24 Aug 2008 19:07:38 +0000
In-Reply-To: <1219596238.28685.ezmlm@cygwin.com>
References: <1219596238.28685.ezmlm@cygwin.com>
Content-Type: text/plain; charset="iso-8859-1"
MIME-Version: 1.0
Mailing-List: contact cygwin-help@cygwin.com; run by ezmlm
Precedence: bulk
List-Id: <cygwin.cygwin.com>
List-Unsubscribe: <mailto:cygwin-unsubscribe-archive-cygwin=delorie.com@cygwin.com>
List-Subscribe: <mailto:cygwin-subscribe@cygwin.com>
List-Archive: <http://sourceware.org/ml/cygwin/>
List-Post: <mailto:cygwin@cygwin.com>
List-Help: <mailto:cygwin-help@cygwin.com>, <http://sourceware.org/ml/#faqs>
Sender: cygwin-owner@cygwin.com
Mail-Followup-To: cygwin@cygwin.com
Delivered-To: mailing list cygwin@cygwin.com
Content-Transfer-Encoding: 8bit
X-MIME-Autoconverted: from quoted-printable to 8bit by delorie.com id m7OJ8Zk4016646


 > Apparently my understanding of FAT is wrong then.  
 > Brian 

FAT stores Unicode on disk, if necessary, as part of the "long name" support.
That is, if a name happens to be 8.3, both in length and character set "etc.",
it occupies just one "normal" directory entry.
If a name is not 8.3, either in length or character set "etc.", it is stored as a "long name".

"etc." as in, at least, "dot location" -- multiple dots and leading dots are not "8.3".

C:\>dir /x
08/24/2008  12:02 PM              F66E2~1      .f
08/24/2008  12:02 PM              112E5D~1.1   1.1.1
08/24/2008  12:02 PM              FO3368~1     .fo
08/24/2008  12:02 PM              FOO~1        .foo

(notice how "short names" can be much longer than "long names").

In fact, the Win9x "installable file system" interfaces traffic in Unicode.

The referenced email describing 1.7 lists some special characters that are
now allowed, but it doesn't mention question mark.
Perhaps a good description would be LIKE: "all 8 bit characters except forward slash and nul"??
(or, whatever, all 8 bit characters except )

 - Jay

--
Unsubscribe info:      http://cygwin.com/ml/#unsubscribe-simple
Problem reports:       http://cygwin.com/problems.html
Documentation:         http://cygwin.com/docs.html
FAQ:                   http://cygwin.com/faq/


