delorie.com/archives/browse.cgi   search  
Mail Archives: cygwin/2000/08/31/10:55:16

Mailing-List: contact cygwin-help AT sourceware DOT cygnus DOT com; run by ezmlm
List-Subscribe: <mailto:cygwin-subscribe AT sources DOT redhat DOT com>
List-Archive: <http://sources.redhat.com/ml/cygwin/>
List-Post: <mailto:cygwin AT sources DOT redhat DOT com>
List-Help: <mailto:cygwin-help AT sources DOT redhat DOT com>, <http://sources.redhat.com/ml/#faqs>
Sender: cygwin-owner AT sources DOT redhat DOT com
Delivered-To: mailing list cygwin AT sources DOT redhat DOT com
Message-ID: <c=DK%a=_%p=CD-Danmark%l=CDDKSERVER-000831145220Z-2699@cddkserver.cddk.dk>
From: Peter Ring <PRI AT cddk DOT dk>
To: "'cygwin AT sources DOT redhat DOT com'" <cygwin AT sources DOT redhat DOT com>
Subject: behaviour of textutil sort has changed
Date: Thu, 31 Aug 2000 16:52:20 +0200
X-Mailer: Microsoft Exchange Server Internet Mail Connector Version 4.0.996.62
MIME-Version: 1.0
X-MIME-Autoconverted: from quoted-printable to 8bit by delorie.com id KAA06022

The behaviour of the cygwin port of textutil sort seems to have changed
slightly. I can't say for sure when it happened; I'm trying to find out
if it's to do with the pesky EOL issue.

For the record, I'm running on top of Windows NT 4.0, the current
installation was quite recently updated, all mount points are binary,
and CYGWIN is 'tty binmode ntea ntsec'.

It used to be like that if the input was a file 'sortexample.in' with LF
as EOL:

d5669267~1980_REM
c56aa142~1980_REM~001__Om_revisionsmeddelelser
00a06b52~1980_REM~001__Om_revisionsmeddelelser~Hoved
0001def3~1980_REM~001__Om_revisionsmeddelelser~001
711f9bb6~1980_REM~002__Fonde
00a06b52~1980_REM~002__Fonde~Hoved
0001def3~1980_REM~002__Fonde~001
0001def0~1980_REM~002__Fonde~002
0001def1~1980_REM~002__Fonde~003
0001def6~1980_REM~002__Fonde~004
0001def7~1980_REM~002__Fonde~005
4e4ce819~1979_SD-CIR
7aa92869~1979_SD-CIR~001__Trækgrundlaget
00a06b52~1979_SD-CIR~001__Trækgrundlaget~Hoved
0001def3~1979_SD-CIR~001__Trækgrundlaget~001

this command:

  sort -t~ -k2,2 -k3,3 -k4,4 -o sortexample.out sortexample.in

would produce the following output in sortexample.out:

4e4ce819~1979_SD-CIR
7aa92869~1979_SD-CIR~001__Trækgrundlaget
0001def3~1979_SD-CIR~001__Trækgrundlaget~001
00a06b52~1979_SD-CIR~001__Trækgrundlaget~Hoved
d5669267~1980_REM
c56aa142~1980_REM~001__Om_revisionsmeddelelser
0001def3~1980_REM~001__Om_revisionsmeddelelser~001
00a06b52~1980_REM~001__Om_revisionsmeddelelser~Hoved
711f9bb6~1980_REM~002__Fonde
0001def3~1980_REM~002__Fonde~001
0001def0~1980_REM~002__Fonde~002
0001def1~1980_REM~002__Fonde~003
0001def6~1980_REM~002__Fonde~004
0001def7~1980_REM~002__Fonde~005
00a06b52~1980_REM~002__Fonde~Hoved

BTW, if I run the example on a Linux box, I also get this behaviour.
And now it produces this output:

0001def3~1979_SD-CIR~001__Trækgrundlaget~001
00a06b52~1979_SD-CIR~001__Trækgrundlaget~Hoved
7aa92869~1979_SD-CIR~001__Trækgrundlaget
4e4ce819~1979_SD-CIR
0001def3~1980_REM~001__Om_revisionsmeddelelser~001
00a06b52~1980_REM~001__Om_revisionsmeddelelser~Hoved
c56aa142~1980_REM~001__Om_revisionsmeddelelser
0001def3~1980_REM~002__Fonde~001
0001def0~1980_REM~002__Fonde~002
0001def1~1980_REM~002__Fonde~003
0001def6~1980_REM~002__Fonde~004
0001def7~1980_REM~002__Fonde~005
00a06b52~1980_REM~002__Fonde~Hoved
711f9bb6~1980_REM~002__Fonde
d5669267~1980_REM

It is as if something is now silently implied at the end of each line
('1979_SD-CIRsomething' is larger than '1979_SD-CIR'); it used to be
like that if the input had CR-LF as EOL, I would get the latter
behaviour (which, BTW, is also what will happen on the Linux box if
there's a CR before the LF).

Before I start hacking sort, I'd like to know if this is an intended
change.

Kind regards,
Peter Ring

--
Want to unsubscribe from this list?
Send a message to cygwin-unsubscribe AT sourceware DOT cygnus DOT com

- Raw text -


  webmaster     delorie software   privacy  
  Copyright © 2019   by DJ Delorie     Updated Jul 2019