delorie.com/archives/browse.cgi   search  
Mail Archives: cygwin/2020/07/28/00:06:07

X-Recipient: archive-cygwin AT delorie DOT com
X-Original-To: cygwin AT cygwin DOT com
Delivered-To: cygwin AT cygwin DOT com
DMARC-Filter: OpenDMARC Filter v1.3.2 sourceware.org 0ECD53857C49
Authentication-Results: sourceware.org; dmarc=none (p=none dis=none)
header.from=SystematicSw.ab.ca
Authentication-Results: sourceware.org;
spf=none smtp.mailfrom=brian DOT inglis AT systematicsw DOT ab DOT ca
X-Authority-Analysis: v=2.3 cv=ecemg4MH c=1 sm=1 tr=0
a=kiZT5GMN3KAWqtYcXc+/4Q==:117 a=kiZT5GMN3KAWqtYcXc+/4Q==:17
a=IkcTkHD0fZMA:10 a=RZ24vCjvlsqmbDxLIRQA:9 a=7Zwj6sZBwVKJAoWSPKxL6X1jA+E=:19
a=QEXdDO2ut3YA:10
Subject: Re: Problem with output from gawk software in recent Cygwin
installation
To: cygwin AT cygwin DOT com
References: <CAC7Qdh1bPAejJZkV6GY96kcAVKNFDvGcTBpbSfXY6jC-aMGsiQ AT mail DOT gmail DOT com>
<1729e34a-245c-2e87-6604-bebf8e08d62b AT cs DOT umass DOT edu>
<000601d6643e$7153de00$53fb9a00$@rogers.com>
<3609a7fa-6024-2091-f2fb-b3052ab8f051 AT SystematicSw DOT ab DOT ca>
<CAC7Qdh3AEkbX849S+VyDgSm0_WYRaP5d0zT2QOvmWpd3t3PMkQ AT mail DOT gmail DOT com>
From: Brian Inglis <Brian DOT Inglis AT SystematicSw DOT ab DOT ca>
Autocrypt: addr=Brian DOT Inglis AT SystematicSw DOT ab DOT ca; prefer-encrypt=mutual;
keydata=
mDMEXopx8xYJKwYBBAHaRw8BAQdAnCK0qv/xwUCCZQoA9BHRYpstERrspfT0NkUWQVuoePa0
LkJyaWFuIEluZ2xpcyA8QnJpYW4uSW5nbGlzQFN5c3RlbWF0aWNTdy5hYi5jYT6IlgQTFggA
PhYhBMM5/lbU970GBS2bZB62lxu92I8YBQJeinHzAhsDBQkJZgGABQsJCAcCBhUKCQgLAgQW
AgMBAh4BAheAAAoJEB62lxu92I8Y0ioBAI8xrggNxziAVmr+Xm6nnyjoujMqWcq3oEhlYGAO
WacZAQDFtdDx2koSVSoOmfaOyRTbIWSf9/Cjai29060fsmdsDLg4BF6KcfMSCisGAQQBl1UB
BQEBB0Awv8kHI2PaEgViDqzbnoe8B9KMHoBZLS92HdC7ZPh8HQMBCAeIfgQYFggAJhYhBMM5
/lbU970GBS2bZB62lxu92I8YBQJeinHzAhsMBQkJZgGAAAoJEB62lxu92I8YZwUBAJw/74rF
IyaSsGI7ewCdCy88Lce/kdwX7zGwid+f8NZ3AQC/ezTFFi5obXnyMxZJN464nPXiggtT9gN5
RSyTY8X+AQ==
Organization: Systematic Software
Message-ID: <ab599eb1-cc52-2a91-2a95-66c929a86a05@SystematicSw.ab.ca>
Date: Mon, 27 Jul 2020 22:05:16 -0600
User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:68.0) Gecko/20100101
Thunderbird/68.10.0
MIME-Version: 1.0
In-Reply-To: <CAC7Qdh3AEkbX849S+VyDgSm0_WYRaP5d0zT2QOvmWpd3t3PMkQ@mail.gmail.com>
X-CMAE-Envelope: MS4wfLoo+QJbZM4IKFFV6XOPAJUfDbR4nHZN0GzVhyrX063bp0t2hmdiXe2MvWEI0xvxjGRGWL61jf8w3aeqX+/XCW3QvkhCZ316r8aM5W2k4S6OStMvzQwN
DNcuWNlWYYNRPISWL/cKUhooe1KSWB0d+IrPDSkP4otfsHU+PMBNp0SJ75w0/qNTqXLG8cc4Ct5Dkw==
X-Spam-Status: No, score=-1.0 required=5.0 tests=BAYES_00, KAM_DMARC_STATUS,
KAM_LAZY_DOMAIN_SECURITY, LOTS_OF_MONEY, RCVD_IN_DNSWL_LOW, SPF_HELO_NONE,
SPF_NONE, TXREP autolearn=no autolearn_force=no version=3.4.2
X-Spam-Checker-Version: SpamAssassin 3.4.2 (2018-09-13) on
server2.sourceware.org
X-BeenThere: cygwin AT cygwin DOT com
X-Mailman-Version: 2.1.29
List-Id: General Cygwin discussions and problem reports <cygwin.cygwin.com>
List-Unsubscribe: <http://cygwin.com/mailman/options/cygwin>,
<mailto:cygwin-request AT cygwin DOT com?subject=unsubscribe>
List-Archive: <https://cygwin.com/pipermail/cygwin/>
List-Post: <mailto:cygwin AT cygwin DOT com>
List-Help: <mailto:cygwin-request AT cygwin DOT com?subject=help>
List-Subscribe: <http://cygwin.com/mailman/listinfo/cygwin>,
<mailto:cygwin-request AT cygwin DOT com?subject=subscribe>
Reply-To: cygwin AT cygwin DOT com
Errors-To: cygwin-bounces AT cygwin DOT com
Sender: "Cygwin" <cygwin-bounces AT cygwin DOT com>

On 2020-07-27 15:58, Bryan VanSchouwen wrote:
> On Mon, Jul 27, 2020 at 4:20 PM Brian Inglis wrote:
>> On 2020-07-27 11:50, Michel LaBarre wrote:
>>> On July 27, 2020 12:52 PM, Eliot Moss wrote:
>>>> On 7/27/2020 11:47 AM, Bryan VanSchouwen wrote:
>>>>> I just tried executing an awk script using the most recent version
>>>>> of gawk, but the output did not turn out the way that it was supposed
>>>>> to.
>>>>> This script uses the following command to print the output data to
>>>>> the output file:
>>>>> print(cai[i], rpi[i], i) > 
>>>>> "Fit_Height_correln_plot_-_cPuMP_vs_2NH2-cPuMP.dat"
>>>>> and previously, this command always printed the values of the three 
>>>>> variables on a single line, separated by spaces; however, now the
>>>>> gawk software is automatically adding hard-returns between the
>>>>> values, resulting in the three values being printed on separate lines
>>>>> within the data file.
>>>>> What is going on here, and how do I permanently make it stop??

>>> Here's a wondering: Could it have to do with line endings?  If Windows 
>>> CRLF is getting in there, then the variables might get a CR in them, 
>>> which might do weird things.  This assumes those are string variables, 
>>> not numeric.

>> Better yet, how about an example using manifest constants in a one line
>> sample to eliminate impact of arrays or changes in input data as in: gawk
>> 'BEGIN {print(1,2,3)}' or gawk 'BEGIN {print(1,2,3) > "xxx.txt"}'>
> No problem with awk or gawk:
> $ for ((i = 0; i < 10; ++i))
>   do
>     printf "%d %d %d %d\n" $((i+1)) $((i+2)) $((i+3)) $((i+4))
>   done > test.txt
> $ awk '{print($1, $2, $3)}' test.txt
>     1 2 3
>     2 3 4
>     3 4 5
>     4 5 6
>     5 6 7
>     6 7 8
>     7 8 9
>     8 9 10
>     9 10 11
>     10 11 12
> So the issue appears to be with your command line, script, or input data 
> file: please show the command line used to execute the script, attach the 
> complete awk script, and input data file for diagnosis, or selections of the
> latter piped through or output using cat -A to show control characters.
> Here they are (attached). The script was executed with the following 
> command:> gawk -f peak_intensity_correln_plot_compile.awk
Input files have <CR><LF> \r\n <ctrl-M><ctrl-J> line terminators and those are
carried thru at the ends of the string fields:

$ gawk -f peak_intensity_correln_plot_compile.awk
$ file *cPuMP*.dat
2NH2-cPuMP_nh_-_pk_Fit_Height_data.dat:            ASCII text, with CRLF line
terminators
cPuMP_nh_-_pk_Fit_Height_data.dat:                 ASCII text, with CRLF line
terminators
Fit_Height_correln_plot_-_cPuMP_vs_2NH2-cPuMP.dat: ASCII text, with CR, LF line
terminators
$ cat -A Fit_Height_correln_plot_-_cPuMP_vs_2NH2-cPuMP.dat | head
1571697^M 1716833^M 224$
2672863^M 2894992^M 225$
2184902^M 9710015^M 226$
4393362^M 4095908^M 227$
3828609^M 4218978^M 229$
6285045^M 4008320^M 233$
3936959^M 4104667^M 234$
1698322^M 1942553^M 237$
4144791^M 4346435^M 238$
2546328^M 2804338^M 239$

You could change your input line terminators to "\r\n" e.g. option -vRS="\r\n",
insert '{ sub( /\r$/, ""); before each 'split(x, s, " ")', convert your input
fields from strings to numbers by adding zero i.e. cai[i] += 0; rpi[i] += 0; or
use belts, braces, and suspenders with all three, e.g.

$ gawk -vRS="\r\n" -f peak_intensity_correln_plot_compile.awk
$ file *cPuMP*.dat
2NH2-cPuMP_nh_-_pk_Fit_Height_data.dat:            ASCII text, with CRLF line
terminators
cPuMP_nh_-_pk_Fit_Height_data.dat:                 ASCII text, with CRLF line
terminators
Fit_Height_correln_plot_-_cPuMP_vs_2NH2-cPuMP.dat: ASCII text
$ cat -A Fit_Height_correln_plot_-_cPuMP_vs_2NH2-cPuMP.dat | head
1571697 1716833 224$
2672863 2894992 225$
2184902 9710015 226$
4393362 4095908 227$
3828609 4218978 229$
6285045 4008320 233$
3936959 4104667 234$
1698322 1942553 237$
4144791 4346435 238$
2546328 2804338 239$

-- 
Take care. Thanks, Brian Inglis, Calgary, Alberta, Canada

This email may be disturbing to some readers as it contains
too much technical detail. Reader discretion is advised.
[Data in IEC units and prefixes, physical quantities in SI.]
--
Problem reports:      https://cygwin.com/problems.html
FAQ:                  https://cygwin.com/faq/
Documentation:        https://cygwin.com/docs.html
Unsubscribe info:     https://cygwin.com/ml/#unsubscribe-simple

- Raw text -


  webmaster     delorie software   privacy  
  Copyright © 2019   by DJ Delorie     Updated Jul 2019