X-Recipient: archive-cygwin@delorie.com
X-SWARE-Spam-Status: No, hits=-1.9 required=5.0 	tests=AWL,BAYES_00,SARE_MSGID_LONG40,SPF_PASS
X-Spam-Check-By: sourceware.org
MIME-Version: 1.0
In-Reply-To: <380-2200912128193944786@cantv.net>
References: <380-2200912128193944786@cantv.net>
Date: Mon, 28 Dec 2009 22:37:57 +0000
Message-ID: <416096c60912281437o16aec4cct8b64b7518d9a9a1@mail.gmail.com>
Subject: Re: gcc4[1.7] printf treats differently a string constant and a  	character array
From: Andy Koppe <andy.koppe@gmail.com>
To: rodmedina@cantv.net, cygwin@cygwin.com
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: quoted-printable
X-IsSubscribed: yes
Mailing-List: contact cygwin-help@cygwin.com; run by ezmlm
Precedence: bulk
List-Id: <cygwin.cygwin.com>
List-Unsubscribe: <mailto:cygwin-unsubscribe-archive-cygwin=delorie.com@cygwin.com>
List-Subscribe: <mailto:cygwin-subscribe@cygwin.com>
List-Archive: <http://sourceware.org/ml/cygwin/>
List-Post: <mailto:cygwin@cygwin.com>
List-Help: <mailto:cygwin-help@cygwin.com>, <http://sourceware.org/ml/#faqs>
Sender: cygwin-owner@cygwin.com
Mail-Followup-To: cygwin@cygwin.com
Delivered-To: mailing list cygwin@cygwin.com

2009/12/28 Rodrigo Medina:
> Hi,
> I am moving from cygwin-1.5 and gcc3.4 to cygwin1.7 and gcc4.
> Some simple programs of mine fail.
>
> I am using LC_ALL=3Des_VE.ISO-8859-15.
>
> I have reduced the problem to this example
>
> --------------
> #include <stdio.h>
> main()
> {
> static char* line1 =3D
> " This letter has an accent -->=C3=A1, this one has no accent -->a\n\n";
> static char* line2 =3D " ***** another line ******\n\n";
> static char* line3 =3D
> " These letters have an accent -->=C3=83=C2=A1, these ones have no accent=
 -->A!\n\n";
> static char* line4 =3D
> " This letter has an accent -->=C3=83, this one has no accent -->A\n\n";
> =C2=A0printf(" This letter has an accent -->=C3=A1, this one has no accent
> -->a\n\n");
> =C2=A0printf(line2);
> =C2=A0printf("%d %d %d\n\n",line1[29],line1[30],line1[31]);
> =C2=A0printf(line1);
> =C2=A0printf(line2);
> =C2=A0printf(" These letters have an accent -->=C3=83=C2=A1, these ones h=
ave no accent
> -->A!\n\n");
> =C2=A0printf(line2);
> =C2=A0printf("%d %d %d %d\n\n",line3[32],line3[33],line3[34],line3[35]);
> =C2=A0printf(line3);
> =C2=A0printf(line2);
> =C2=A0printf(" This letter has an accent -->=C3=83, this one has no accent
> -->A\n\n");
> =C2=A0printf(line2);
> =C2=A0printf("%d %d %d\n\n",line4[29],line4[30],line4[31]);
> =C2=A0printf(line4);
> =C2=A0printf(line2);
> =C2=A0printf(" ----- END ------");
> }----------------
>
> My output is:
>
> =C2=A0This letter has an accent -->=C3=A1, this one has no accent -->a
>
> =C2=A0***** another line ******
>
> 62 -31 44
>
> =C2=A0This letter has an accent --> ***** another line ******
>
> =C2=A0These letters have an accent -->=C3=83=C2=A1, these ones have no ac=
cent -->A!
>
> =C2=A0***** another line ******
>
> 62 -61 -95 44
>
> =C2=A0These letters have an accent -->=C3=83=C2=A1, these ones have no ac=
cent -->A!
>
> =C2=A0***** another line ******
>
> =C2=A0This letter has an accent -->=C3=83, this one has no accent -->A
>
> =C2=A0***** another line ******
>
> 62 -61 44
>
> =C2=A0This letter has an accent --> ***** another line ******
>
> =C2=A0----- END ------
>
> As you can see the output of printf(string_constant) is what
> I expected. The ouput of printf(char_array) is trucated at the non-ASCII
> character.

Reproduced. Looking at the compiler's assembly output, some of the
printf() calls are replaced by calls to puts(), and those do work
correctly, whereas the remaining printf() calls with accented
characters misbehave. So printf()'s handling of non-ASCII characters
needs a closer look.

Andy

--
Problem reports:       http://cygwin.com/problems.html
FAQ:                   http://cygwin.com/faq/
Documentation:         http://cygwin.com/docs.html
Unsubscribe info:      http://cygwin.com/ml/#unsubscribe-simple

