| delorie.com/archives/browse.cgi | search |
| X-Recipient: | archive-cygwin AT delorie DOT com |
| X-SWARE-Spam-Status: | No, hits=-2.6 required=5.0 tests=AWL,BAYES_00,KHOP_THREADED,SPF_HELO_PASS,TW_VM,T_RP_MATCHES_RCVD |
| X-Spam-Check-By: | sourceware.org |
| To: | cygwin AT cygwin DOT com |
| From: | Ralf <wiesweg AT tacos-gmbh DOT de> |
| Subject: | Re: length in gawk returns wrong value |
| Date: | Thu, 19 Jul 2012 11:27:01 +0000 (UTC) |
| Lines: | 57 |
| Message-ID: | <loom.20120719T131247-62@post.gmane.org> |
| References: | <loom DOT 20120719T103849-659 AT post DOT gmane DOT org> <20120719092024 DOT GA31055 AT calimero DOT vinschen DOT de> |
| Mime-Version: | 1.0 |
| User-Agent: | Loom/3.14 (http://gmane.org/) |
| X-IsSubscribed: | yes |
| Mailing-List: | contact cygwin-help AT cygwin DOT com; run by ezmlm |
| List-Id: | <cygwin.cygwin.com> |
| List-Subscribe: | <mailto:cygwin-subscribe AT cygwin DOT com> |
| List-Archive: | <http://sourceware.org/ml/cygwin/> |
| List-Post: | <mailto:cygwin AT cygwin DOT com> |
| List-Help: | <mailto:cygwin-help AT cygwin DOT com>, <http://sourceware.org/ml/#faqs> |
| Sender: | cygwin-owner AT cygwin DOT com |
| Mail-Followup-To: | cygwin AT cygwin DOT com |
| Delivered-To: | mailing list cygwin AT cygwin DOT com |
Corinna Vinschen <corinna-cygwin <at> cygwin.com> writes:
>
> Uh oh. 1.7.9 is old. Please update.
>
> > 0000000 R 374 c k e n \r \n
> > 0000010
> > Length: 1
> >
> > What can I do to get the correct length in gawk without changing
> > ttt.txt?
>
> Dunno. This is not what I see. What did you have $LANG and $LC_CTYPE
> set to? Here's what I see:
>
> $ uname -a
> CYGWIN_NT-6.1 vmbert7 1.7.16(0.261/5/3) 2012-07-09 14:51 i686 Cygwin
>
> $ echo $LANG
> C.UTF-8
>
> $ echo "Rücken" > ttt.txt
> $ od -c ttt.txt
> 0000000 R 303 274 c k e n \n
> 0000010
>
> $ gawk '{print "Length: " length($0)}' ttt.txt
> Length: 6
>
> $ gawk --version | head -1
> GNU Awk 4.0.1
>
> Corinna
>
After updating I added following lines on top of my script:
export LANG=C.UTF-8
echo LANG: $LANG
echo LC_CTYPE: $LC_TYPE
c:/unix/bin/gawk --version | head -1
And this is my output:
LANG: C.UTF-8
LC_CTYPE:
GNU Awk 4.0.1
CYGWIN_NT-6.0-WOW64 WIESWEG 1.7.15(0.260/5/3) 2012-05-09 10:25 i686 Cygwin
0000000 R 374 c k e n \r \n
0000010
Length: 5
Very strange!
But after adding
export LC_CTYPE=C
I got the correct result.
Thanks for your quick help!
--
Problem reports: http://cygwin.com/problems.html
FAQ: http://cygwin.com/faq/
Documentation: http://cygwin.com/docs.html
Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple
| webmaster | delorie software privacy |
| Copyright © 2019 by DJ Delorie | Updated Jul 2019 |