Mail Archives: cygwin/2020/07/28/13:08:57
X-Recipient: | archive-cygwin AT delorie DOT com
|
X-Original-To: | cygwin AT cygwin DOT com
|
Delivered-To: | cygwin AT cygwin DOT com
|
DMARC-Filter: | OpenDMARC Filter v1.3.2 sourceware.org D7599384C005
|
Authentication-Results: | sourceware.org; dmarc=none (p=none dis=none)
|
| header.from=SystematicSw.ab.ca
|
Authentication-Results: | sourceware.org;
|
| spf=none smtp.mailfrom=brian DOT inglis AT systematicsw DOT ab DOT ca
|
X-Authority-Analysis: | v=2.3 cv=ecemg4MH c=1 sm=1 tr=0
|
| a=kiZT5GMN3KAWqtYcXc+/4Q==:117 a=kiZT5GMN3KAWqtYcXc+/4Q==:17
|
| a=IkcTkHD0fZMA:10 a=w_pzkKWiAAAA:8 a=D-hjUeOzi2gWLipTs3oA:9
|
| a=7Zwj6sZBwVKJAoWSPKxL6X1jA+E=:19 a=QEXdDO2ut3YA:10 a=IcnSy03WLZUA:10
|
| a=sRI3_1zDfAgwuvI8zelB:22
|
Subject: | Re: Problem with output from gawk software in recent Cygwin
|
| installation
|
To: | cygwin AT cygwin DOT com
|
References: | <CAC7Qdh1bPAejJZkV6GY96kcAVKNFDvGcTBpbSfXY6jC-aMGsiQ AT mail DOT gmail DOT com>
|
| <1729e34a-245c-2e87-6604-bebf8e08d62b AT cs DOT umass DOT edu>
|
| <000601d6643e$7153de00$53fb9a00$@rogers.com>
|
| <3609a7fa-6024-2091-f2fb-b3052ab8f051 AT SystematicSw DOT ab DOT ca>
|
| <CAC7Qdh3AEkbX849S+VyDgSm0_WYRaP5d0zT2QOvmWpd3t3PMkQ AT mail DOT gmail DOT com>
|
| <ab599eb1-cc52-2a91-2a95-66c929a86a05 AT SystematicSw DOT ab DOT ca>
|
| <CAC7Qdh1kR-PuxFOVxeCeA8fZGCsv_2jz_6_JqKaLffhtdGB+Ag AT mail DOT gmail DOT com>
|
From: | Brian Inglis <Brian DOT Inglis AT SystematicSw DOT ab DOT ca>
|
Autocrypt: | addr=Brian DOT Inglis AT SystematicSw DOT ab DOT ca; prefer-encrypt=mutual;
|
| keydata=
|
| mDMEXopx8xYJKwYBBAHaRw8BAQdAnCK0qv/xwUCCZQoA9BHRYpstERrspfT0NkUWQVuoePa0
|
| LkJyaWFuIEluZ2xpcyA8QnJpYW4uSW5nbGlzQFN5c3RlbWF0aWNTdy5hYi5jYT6IlgQTFggA
|
| PhYhBMM5/lbU970GBS2bZB62lxu92I8YBQJeinHzAhsDBQkJZgGABQsJCAcCBhUKCQgLAgQW
|
| AgMBAh4BAheAAAoJEB62lxu92I8Y0ioBAI8xrggNxziAVmr+Xm6nnyjoujMqWcq3oEhlYGAO
|
| WacZAQDFtdDx2koSVSoOmfaOyRTbIWSf9/Cjai29060fsmdsDLg4BF6KcfMSCisGAQQBl1UB
|
| BQEBB0Awv8kHI2PaEgViDqzbnoe8B9KMHoBZLS92HdC7ZPh8HQMBCAeIfgQYFggAJhYhBMM5
|
| /lbU970GBS2bZB62lxu92I8YBQJeinHzAhsMBQkJZgGAAAoJEB62lxu92I8YZwUBAJw/74rF
|
| IyaSsGI7ewCdCy88Lce/kdwX7zGwid+f8NZ3AQC/ezTFFi5obXnyMxZJN464nPXiggtT9gN5
|
| RSyTY8X+AQ==
|
Organization: | Systematic Software
|
Message-ID: | <d39444d3-732f-68b2-f870-59c99fd62c32@SystematicSw.ab.ca>
|
Date: | Tue, 28 Jul 2020 11:08:07 -0600
|
User-Agent: | Mozilla/5.0 (Windows NT 10.0; WOW64; rv:68.0) Gecko/20100101
|
| Thunderbird/68.10.0
|
MIME-Version: | 1.0
|
In-Reply-To: | <CAC7Qdh1kR-PuxFOVxeCeA8fZGCsv_2jz_6_JqKaLffhtdGB+Ag@mail.gmail.com>
|
X-CMAE-Envelope: | MS4wfCV2oduwC9kPfyfoJsKqicEBid04+OgI30+UwOOMFS9LYRIlYCb57FbT+4yikR1rvdixiO2goj4abiX65nM6V7e9tk2nMhDqB3GxgKFVC93N9tPkLYdy
|
| ytKex68fq44iiF74MPVdxH07Qe8KypWzen3anwEpd1GQED/VKqz7+GyMVcCSTH7HJDnbOEYKPK4ePQ==
|
X-Spam-Status: | No, score=-8.0 required=5.0 tests=BAYES_00, KAM_DMARC_STATUS,
|
| KAM_LAZY_DOMAIN_SECURITY, RCVD_IN_DNSWL_LOW, SPF_HELO_NONE, SPF_NONE,
|
| TXREP autolearn=no autolearn_force=no version=3.4.2
|
X-Spam-Checker-Version: | SpamAssassin 3.4.2 (2018-09-13) on
|
| server2.sourceware.org
|
X-BeenThere: | cygwin AT cygwin DOT com
|
X-Mailman-Version: | 2.1.29
|
List-Id: | General Cygwin discussions and problem reports <cygwin.cygwin.com>
|
List-Unsubscribe: | <https://cygwin.com/mailman/options/cygwin>,
|
| <mailto:cygwin-request AT cygwin DOT com?subject=unsubscribe>
|
List-Archive: | <https://cygwin.com/pipermail/cygwin/>
|
List-Post: | <mailto:cygwin AT cygwin DOT com>
|
List-Help: | <mailto:cygwin-request AT cygwin DOT com?subject=help>
|
List-Subscribe: | <https://cygwin.com/mailman/listinfo/cygwin>,
|
| <mailto:cygwin-request AT cygwin DOT com?subject=subscribe>
|
Reply-To: | cygwin AT cygwin DOT com
|
Errors-To: | cygwin-bounces AT cygwin DOT com
|
Sender: | "Cygwin" <cygwin-bounces AT cygwin DOT com>
|
On 2020-07-28 09:32, Bryan VanSchouwen via Cygwin wrote:
> On Tue, Jul 28, 2020 at 12:06 AM Brian Inglis wrote:
>> On 2020-07-27 15:58, Bryan VanSchouwen wrote:
>>> On Mon, Jul 27, 2020 at 4:20 PM Brian Inglis wrote:
>>>> On 2020-07-27 11:50, Michel LaBarre wrote:
>>>>> On July 27, 2020 12:52 PM, Eliot Moss wrote:
>>>>>> On 7/27/2020 11:47 AM, Bryan VanSchouwen wrote:
>>>>>>> I just tried executing an awk script using the most recent version
>>>>>>> of gawk, but the output did not turn out the way that it was supposed
>>>>>>> to.
>>>>>>> This script uses the following command to print the output data to
>>>>>>> the output file:
>>>>>>> print(cai[i], rpi[i], i) >
>>>>>>> "Fit_Height_correln_plot_-_cPuMP_vs_2NH2-cPuMP.dat"
>>>>>>> and previously, this command always printed the values of the three
>>>>>>> variables on a single line, separated by spaces; however, now the
>>>>>>> gawk software is automatically adding hard-returns between the
>>>>>>> values, resulting in the three values being printed on separate lines
>>>>>>> within the data file.
>>>>>>> What is going on here, and how do I permanently make it stop??
>>
>>>>> Here's a wondering: Could it have to do with line endings? If Windows
>>>>> CRLF is getting in there, then the variables might get a CR in them,
>>>>> which might do weird things. This assumes those are string variables,
>>>>> not numeric.
>>
>>>> Better yet, how about an example using manifest constants in a one line
>>>> sample to eliminate impact of arrays or changes in input data as in:
>> gawk
>>>> 'BEGIN {print(1,2,3)}' or gawk 'BEGIN {print(1,2,3) > "xxx.txt"}'>
>>> No problem with awk or gawk:
>>> $ for ((i = 0; i < 10; ++i))
>>> do
>>> printf "%d %d %d %d\n" $((i+1)) $((i+2)) $((i+3)) $((i+4))
>>> done > test.txt
>>> $ awk '{print($1, $2, $3)}' test.txt
>>> 1 2 3
>>> 2 3 4
>>> 3 4 5
>>> 4 5 6
>>> 5 6 7
>>> 6 7 8
>>> 7 8 9
>>> 8 9 10
>>> 9 10 11
>>> 10 11 12
>>> So the issue appears to be with your command line, script, or input data
>>> file: please show the command line used to execute the script, attach
>> the
>>> complete awk script, and input data file for diagnosis, or selections of
>> the
>>> latter piped through or output using cat -A to show control characters.
>>> Here they are (attached). The script was executed with the following
>>> command:> gawk -f peak_intensity_correln_plot_compile.awk
>> Input files have <CR><LF> \r\n <ctrl-M><ctrl-J> line terminators and those
>> are
>> carried thru at the ends of the string fields:
>>
>> $ gawk -f peak_intensity_correln_plot_compile.awk
>> $ file *cPuMP*.dat
>> 2NH2-cPuMP_nh_-_pk_Fit_Height_data.dat: ASCII text, with CRLF
>> line
>> terminators
>> cPuMP_nh_-_pk_Fit_Height_data.dat: ASCII text, with CRLF
>> line
>> terminators
>> Fit_Height_correln_plot_-_cPuMP_vs_2NH2-cPuMP.dat: ASCII text, with CR, LF
>> line
>> terminators
>> $ cat -A Fit_Height_correln_plot_-_cPuMP_vs_2NH2-cPuMP.dat | head
>> 1571697^M 1716833^M 224$
>> 2672863^M 2894992^M 225$
>> 2184902^M 9710015^M 226$
>> 4393362^M 4095908^M 227$
>> 3828609^M 4218978^M 229$
>> 6285045^M 4008320^M 233$
>> 3936959^M 4104667^M 234$
>> 1698322^M 1942553^M 237$
>> 4144791^M 4346435^M 238$
>> 2546328^M 2804338^M 239$
>>
>> You could change your input line terminators to "\r\n" e.g. option
>> -vRS="\r\n",
>> insert '{ sub( /\r$/, ""); before each 'split(x, s, " ")', convert your
>> input
>> fields from strings to numbers by adding zero i.e. cai[i] += 0; rpi[i] +=
>> 0; or
>> use belts, braces, and suspenders with all three, e.g.
>>
>> $ gawk -vRS="\r\n" -f peak_intensity_correln_plot_compile.awk
>> $ file *cPuMP*.dat
>> 2NH2-cPuMP_nh_-_pk_Fit_Height_data.dat: ASCII text, with CRLF
>> line
>> terminators
>> cPuMP_nh_-_pk_Fit_Height_data.dat: ASCII text, with CRLF
>> line
>> terminators
>> Fit_Height_correln_plot_-_cPuMP_vs_2NH2-cPuMP.dat: ASCII text
>> $ cat -A Fit_Height_correln_plot_-_cPuMP_vs_2NH2-cPuMP.dat | head
>> 1571697 1716833 224$
>> 2672863 2894992 225$
>> 2184902 9710015 226$
>> 4393362 4095908 227$
>> 3828609 4218978 229$
>> 6285045 4008320 233$
>> 3936959 4104667 234$
>> 1698322 1942553 237$
>> 4144791 4346435 238$
>> 2546328 2804338 239$
> Just out of curiosity: Could this "<CR><LF>" issue be something new for
> Windows 10? I ask because I don't recall having this issue with my old
> Windows 7 computer.
If you had over 3 year old Cygwin packages on your Windows 7 system, as
changes for POSIX compatibility were made in the builds for test releases of
gawk, grep, sed coordinated and announced together in:
https://cygwin.com/legacy-ml/cygwin/2017-02/threads.html#00152
Perhaps the issue is in whatever generated/s the files, or whatever you had
installed and in your path on Windows 7. Msys and Mingw versions of gawk may
ignore extra <CR>s on input, and may possibly also be included with Git for
Windows, or other Windows Unix-like packages.
--
Take care. Thanks, Brian Inglis, Calgary, Alberta, Canada
This email may be disturbing to some readers as it contains
too much technical detail. Reader discretion is advised.
[Data in IEC units and prefixes, physical quantities in SI.]
--
Problem reports: https://cygwin.com/problems.html
FAQ: https://cygwin.com/faq/
Documentation: https://cygwin.com/docs.html
Unsubscribe info: https://cygwin.com/ml/#unsubscribe-simple
- Raw text -