delorie.com/archives/browse.cgi   search  
Mail Archives: cygwin/2009/07/27/22:53:08

X-Recipient: archive-cygwin AT delorie DOT com
X-SWARE-Spam-Status: No, hits=-2.3 required=5.0 tests=AWL,BAYES_00,J_CHICKENPOX_42,SPF_PASS
X-Spam-Check-By: sourceware.org
From: Haojun Bao <baohaojun AT gmail DOT com>
To: cygwin AT cygwin DOT com
Subject: Re: Emacs can't start-process more than 30~40 processes (Was: Re: Emacs w3m `w3m-toggle-inline-images' cause segfault)
References: <83iqhlbgoc DOT fsf AT gmail DOT com> <4A6727A8 DOT 2090905 AT cornell DOT edu> <83iqhkur2h DOT fsf AT gmail DOT com> <834osy98jo DOT fsf_-_ AT gmail DOT com>
Date: Tue, 28 Jul 2009 10:52:44 +0800
In-Reply-To: <834osy98jo.fsf_-_@gmail.com> (Haojun Bao's message of "Mon, 27 Jul 2009 10:14:19 +0800")
Message-ID: <83tz0xtt6c.fsf@gmail.com>
User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/23.0.96 (cygwin)
MIME-Version: 1.0
X-IsSubscribed: yes
Mailing-List: contact cygwin-help AT cygwin DOT com; run by ezmlm
List-Id: <cygwin.cygwin.com>
List-Unsubscribe: <mailto:cygwin-unsubscribe-archive-cygwin=delorie DOT com AT cygwin DOT com>
List-Subscribe: <mailto:cygwin-subscribe AT cygwin DOT com>
List-Archive: <http://sourceware.org/ml/cygwin/>
List-Post: <mailto:cygwin AT cygwin DOT com>
List-Help: <mailto:cygwin-help AT cygwin DOT com>, <http://sourceware.org/ml/#faqs>
Sender: cygwin-owner AT cygwin DOT com
Mail-Followup-To: cygwin AT cygwin DOT com
Delivered-To: mailing list cygwin AT cygwin DOT com

Haojun Bao <baohaojun AT gmail DOT com> writes:

> I have reduced the test case in this mail 
>   http://cygwin.com/ml/cygwin/2009-07/msg00111.html
> to a simpler one:
>
>     $/bin/emacs --batch -q  --execute '(let ((num 0))
>       (while (< num 30)
>         (setq num (+ num 1))
>         (message "num is %d" num)
>         (start-process "hello" nil "/usr/bin/echo")))'
>     
> Emacs will coredump at the 30th process it tries to start on my XP.
>
> Now I think this should seem familiar to the experts. I tried to gdb it,
> the backtrace shows segfault is happening at the same place 
> (#0 0x610949d8 in fhandler_pipe::create () from /usr/bin/cygwin1.dll)
>
> You might need to tweak the (< num 30) to (< num 40) or bigger, also, to
> use gdb on it, you need write the lisp into a file and use `-l' to load
> the file:
>
> cat > ~/2.el <<End
> (let ((num 0))
>   (while (< num 40)
>     (setq num (+ num 1))
>     (message "num is %d" num)
>     (start-process "hello" nil "/usr/bin/echo")))
> End
>
> gdb --args ./emacs --batch -q  -l ~/2.el

I have debugged it again, and I think I have more clue. I have read the
how-cygheap-works.txt, and this might be a known problem.

It's because the cygheap space has been used up. With Procexp, I can see
cygwin1.dll is based 0x61000000, with size 0x300000 (3M). When segfault
is about to happen, cygheap_max is 0x6164e924, and the _csbrk is called
with a increase of 
    (gdb) p sbs
    $16 = 65544
This will increase cyghead to 0x6165e92c, but from Proxexp I can see
cygncurses-9.dll is based at 0x61650000.

Besides, this code snippet from pipe.cc fhandler_pipe::create didn't
check for NULL pointer, and directly caused the segfault.

      fhs[0] = (fhandler_pipe *) build_fh_dev (*piper_dev);
      fhs[1] = (fhandler_pipe *) build_fh_dev (*pipew_dev);

//bhj: we should check NULL here.

      mode |= mode & O_TEXT ?: O_BINARY;
      fhs[0]->init (r, FILE_CREATE_PIPE_INSTANCE | GENERIC_READ, mode);
      fhs[1]->init (w, FILE_CREATE_PIPE_INSTANCE | GENERIC_WRITE, mode);

My question is, is there anyway out of this? Can I just rebase
cygwin1.dll to the end of all other DLLs? 

--
Problem reports:       http://cygwin.com/problems.html
FAQ:                   http://cygwin.com/faq/
Documentation:         http://cygwin.com/docs.html
Unsubscribe info:      http://cygwin.com/ml/#unsubscribe-simple

- Raw text -


  webmaster     delorie software   privacy  
  Copyright © 2019   by DJ Delorie     Updated Jul 2019