delorie.com/archives/browse.cgi   search  
Mail Archives: cygwin/2006/04/26/18:15:56

X-Spam-Check-By: sourceware.org
Message-ID: <444FF10F.7040204@scytek.de>
Date: Wed, 26 Apr 2006 18:15:43 -0400
From: Volker Quetschke <quetschke AT scytek DOT de>
User-Agent: Mozilla Thunderbird 1.0.7 (Windows/20050923)
MIME-Version: 1.0
To: cygwin AT cygwin DOT com
Subject: Re: Reproducible hang with unusual border conditions
References: <444FE899 DOT 4010102 AT scytek DOT de> <20060426214823 DOT GA22097 AT trixie DOT casa DOT cgf DOT cx>
In-Reply-To: <20060426214823.GA22097@trixie.casa.cgf.cx>
X-IsSubscribed: yes
Mailing-List: contact cygwin-help AT cygwin DOT com; run by ezmlm
List-Subscribe: <mailto:cygwin-subscribe AT cygwin DOT com>
List-Archive: <http://sourceware.org/ml/cygwin/>
List-Post: <mailto:cygwin AT cygwin DOT com>
List-Help: <mailto:cygwin-help AT cygwin DOT com>, <http://sourceware.org/ml/#faqs>
Sender: cygwin-owner AT cygwin DOT com
Mail-Followup-To: cygwin AT cygwin DOT com
Delivered-To: mailing list cygwin AT cygwin DOT com

--------------enig4D16CF9CBD6E0D48C0945684
Content-Type: text/plain; charset=ISO-8859-1; format=flowed
Content-Transfer-Encoding: 7bit



Christopher Faylor wrote:
> On Wed, Apr 26, 2006 at 05:39:37PM -0400, Volker Quetschke wrote:
> 
>>On my search for the infamous cygwin hang while building OOo,
>>see <http://sources.redhat.com/ml/cygwin/2005-12/msg00279.html>,
>>and <http://wiki.services.openoffice.org/wiki/Windows#Miscellaneous_info>
>>that I couldn't reproduce myself (WJFFM) I finally managed to get a
>>reproducible hang on my systems that might be related.
>>
>>As said, the original OOo build works just fine here, but there is
>>an alternative possibility to build OOo with 4nt instead of tcsh, and
> 
>>from that I managed to reproduce a small testcase. Unfortunately
> 
>>it requires 4nt :(  and some strict/strange border conditions:
>>
>>1. It only hangs from rxvt. (Invoked with shortlink from desktop to:
>>C:\cygwin\bin\rxvt.exe  -cr green -sl 1000 -tn linux -e /usr/bin/bash 
>>--login -i)
>>I guess the actual switches don't matter.  Startting the following
>>example from the bash that is started from the cygwin icon (cygwin.bat)
>>didn't show the problem so far.
>>
>>2. Only hangs for "real" background processes started with setsid. From rxvt
>>like this:
>>$ setsid ./hang.sh >& hang.log
>>
>>You need to unpack the following testcase,
>><http://www.openoffice.org/nonav/issues/showattachment.cgi/36076/w32-4nt-hang.zip>
>>it contains the following files:
>>-rwxr-xr-x 1 quetschke None 147456 Apr  3 20:19 dmake.exe
>>-rwxr-xr-x 1 quetschke None      3 Apr 26 10:34 hang.XXX$
>>-rwxr-xr-x 1 quetschke None 192635 Apr 26 16:45 hang.log
>>-rwxr-xr-x 1 quetschke None    285 Apr 26 13:30 hang.mk
>>-rwxr-xr-x 1 quetschke None    133 Apr 26 13:22 hang.sh
>>
>>The dmake.exe is a native w32 version, from a m164 OO build with W32-4nt.
>>
>>For me, after entering the above command and letting it run for a few 
>>minutes,
>>I get a 4nt process that is stuck at 99% and no new output reaches hang.log.
>>
>>It definitely looks like 4nt is hanging, *BUT*
>>$ ps
>>     PID    PPID    PGID     WINPID  TTY  UID    STIME COMMAND
>>    3972       1    3972       3972  con 1006 16:35:12 /usr/bin/rxvt
>>I    3996    3972    3996       4008    0 1006 16:35:12 /usr/bin/bash
>>    3804       1    3804        736    ? 1006 16:49:29 /usr/bin/bash
>>     828    3804    3804       3760    ? 1006 16:52:26
>>/cygdrive/d/w1/w32-4nt-hang/dmake
>>
>>if I kill the calling cygwin script, kill 3804 in this case, 4nt stops to
>>loop.
>>
>>As you can see in hang.mk, I tried to reproduce the hang with cmd.exe, but I
>>cannot get to hang that either.
>>
>>This is a very elusive problem :(
> 
> 
> So, if I'm understanding what you're saying correctly, you are seeing a
> hang in a non-cygwin process (4nt) which can't be duplicated in a
> program with similar functionality (cmd) so therefore, your conclusion
> is that there is a cygwin bug?
> 
> Please verify what the identity of the actual hung program by running
> task manager.  If it really is a non-cygwin program that is having problems
> then I'm not tremendously concerned.
The 99% come from the task manager, obviously it is a non-cygwin program
that hangs.

But it looks suspicious that killing the toplevel cygwin
bash script stops the looping 4nt process - Through another native
W32 proc (hang.sh -> dmake -> 4nt). And also that
$ ls /proc/*/fd
now hangs until (see original problem with OOo build)
the 4nt or the script are killed looks strange.

I'm not trying to fix third party programs here, I'm clutching
every straw to produce a reproducible hang so that this problem
can be solved for good.

   Volker

-- 
PGP/GPG key  (ID: 0x9F8A785D)  available  from  wwwkeys.de.pgp.net
key-fingerprint 550D F17E B082 A3E9 F913  9E53 3D35 C9BA 9F8A 785D

--------------enig4D16CF9CBD6E0D48C0945684
Content-Type: application/pgp-signature; name="signature.asc"
Content-Description: OpenPGP digital signature
Content-Disposition: attachment; filename="signature.asc"

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.2.1 (MinGW)
Comment: Using GnuPG with Thunderbird - http://enigmail.mozdev.org

iD8DBQFET/ETPTXJup+KeF0RAqKqAJ9QvtVs88E9wA+uWsnMyKHvLTZXfACfURKH
C4lWhfNbOFurbeWhV2IF26I=
=Yajs
-----END PGP SIGNATURE-----

--------------enig4D16CF9CBD6E0D48C0945684--

- Raw text -


  webmaster     delorie software   privacy  
  Copyright © 2019   by DJ Delorie     Updated Jul 2019