Mail Archives: cygwin/2023/08/25/06:51:15
On Aug 25 17:48, Takashi Yano via Cygwin wrote:
> On Thu, 24 Aug 2023 10:59:33 +0200
> Corinna Vinschen wrote:
> > > I'm not sure why at all, however, the following patch seems to
> > > solve the issue.
> > >
> > > diff --git a/winsup/cygwin/select.cc b/winsup/cygwin/select.cc
> > > index 7b9473849..de5794c9f 100644
> > > --- a/winsup/cygwin/select.cc
> > > +++ b/winsup/cygwin/select.cc
> > > @@ -1790,7 +1790,7 @@ peek_socket (select_record *me, bool)
> > > if (events & FD_WRITE)
> > > {
> > > wfd_set w = { 1, { fh->get_socket () } };
> > > - TIMEVAL t = { 0 };
> > > + TIMEVAL t = { .tv_sec = 0, .tv_usec = 1 };
> > >
> > > if (_win32_select (0, NULL, &w, NULL, &t) == 0)
> > > events &= ~FD_WRITE;
> >
> > Yeah, this is weird. A TIMEVAL value of 0 indicates non-blocking,
> > so why should waiting a usec make that better? It also potentially
> > slows down Cygwin's select noticably if multiple sockets are part
> > of the descriptor set.
> >
> > Hmmm.
> >
> > Is it possible that _win32_select returns with SOCKET_ERROR for
> > some reason?
> >
> > Unfortunately I'm a bit swamped ATM, but rather than setting t to 1
> > usec, what if the check goes:
> >
> > if (_win32_select (0, NULL, &w, NULL, &t) != 1)
> >
> > ?
>
> This did not help. I looked into this deeper and noticed that:
> 1) _win32_select() sometimes returns 0.
> 2) If _win32_select() returns 0, WaitForMultipleObjects(..., INFINITE)
> is called in thread_socket().
> 3) WaitForMultipleObjects() sometimes does not return for FD_WRITE
> for unknown reason.
> This causes the stall.
So the situation is that the network event handling returned FD_WRITE,
because it always returns FD_WRITE as long as a non-blocking send()
function didn't explicitely fail due to buffer overrun.
However, _win32_select will notice that the buffer is full, so it
does not return 1, but 0. I e., the socket is not ready for writing.
Now you're saying that it's possible that the following WFMO will
never return?
That would mean that the FD_WRITE event won't be triggered again because
it already *had* been triggered and the only way to re-enable it is to
call one of the send() functions (see
https://learn.microsoft.com/en-us/windows/win32/api/winsock2/nf-winsock2-wsaeventselect)
I don't have an answer to this problem yet.
Can we use send(sock, "", 0) to reenable FD_WRITE, perhaps?
Corinna
--
Problem reports: https://cygwin.com/problems.html
FAQ: https://cygwin.com/faq/
Documentation: https://cygwin.com/docs.html
Unsubscribe info: https://cygwin.com/ml/#unsubscribe-simple
- Raw text -