Mailing-List: contact cygwin-help AT cygwin DOT com; run by ezmlm List-Subscribe: List-Archive: List-Post: List-Help: , Sender: cygwin-owner AT cygwin DOT com Mail-Followup-To: cygwin AT cygwin DOT com Delivered-To: mailing list cygwin AT cygwin DOT com Message-ID: <20041004140220.48753.qmail@web60308.mail.yahoo.com> Date: Mon, 4 Oct 2004 07:02:20 -0700 (PDT) From: Patrick Samson Subject: Re: Cygserver 100% CPU (was: References to both cygwin1.dll and msvcrt.dl To: cygwin AT cygwin DOT com In-Reply-To: <20041004093312.GC20965@cygbert.vinschen.de> MIME-Version: 1.0 Content-Type: multipart/mixed; boundary="0-543418162-1096898540=:48529" X-IsSubscribed: yes Note-from-DJ: This may be spam --0-543418162-1096898540=:48529 Content-Type: text/plain; charset=us-ascii Content-Id: Content-Disposition: inline --- Corinna Vinschen wrote: > On Oct 1 07:24, Patrick Samson wrote: > > > > --- Corinna Vinschen wrote: > > > > > On Sep 30 23:41, Patrick Samson wrote: > > > > Now, when it's wrong, I can see: > > > > good morning (error=4)! > > > > Error 4 is EINTR on the return of msleep(). > > > > Subsequently semop() returns with this EINTR. > > > > > > Are you set up to build cygwin? If so, could > you > > > please test the > > > following patch to cygserver and if it changes > > > anything for you? > > > > Same behaviour. > > As soon as there are some error=4, it will hang. > > On service stop, postgres may stop some of its > > backends, but not all of them, and stay in > > 'Stopping' state. > > I'm still hoping for a simple testcase... > I'm still working on it (the problem, not the testcase, as it is probably a race condition). I'm looking at traces from cygserver.log. I found something strange. We may not focus too much on error=4. For each EINTR the semop() is called again and all is back to sleep. Even before the first error=4, the last op=1 is supposed to wake up all sem[] of the semid. But I could see that: sem 1 is never mentioned anywhere (?, but why not) sem 0, 10 are not in sleeping state, OK sem 2,3,4,5,6,7,8,9,10,12,14,15 are waked up, and set back to sleep, OK sem 11 and 13 ARE MISSING ! Why aren't 11 and 13 waked up? Because there were not put in a good state of sleeping? Hum... Look at the attached extract. You will see that inside the semop() for sem[11], the trace output is suspended and during that time 3 other semop() arrived, and even 1 more just after. So 4 calls are waiting for mutex semid at the same time. Operations for sem 9, 14, 2, 15 seem to be OK as they are well bound between Locked/Unlocked. But NOT for 13, for which the Unlocked seems to appear after the end of sem 15. The first Unlocked should be the one for putting sem 11 to sleep, but many things may have arrived before to reach the WaitForMultipleObjects(). So I suspect that something is corrupted in regard with the Event to wait for. Note: when everything was stopped, sem 11 was able to return from sleeping with EINTR. No trace for sem 13. But reminder that some processes, but all, were able to stop correctly. _______________________________ Do you Yahoo!? Declare Yourself - Register online to vote today! http://vote.yahoo.com --0-543418162-1096898540=:48529 Content-Type: application/octet-stream; name="cygserver9.log" Content-Transfer-Encoding: base64 Content-Description: cygserver9.log Content-Disposition: attachment; filename="cygserver9.log" Y3lnc2VydmVyOiBzeXN2X3NlbS5jYywgbGluZSA5Njg6IGNhbGwgdG8gc2Vt b3AoNjU1MzcsIDB4MjJFNDQ0LCAxKQ0KDQpjeWdzZXJ2ZXI6IHN5c3Zfc2Vt LmNjLCBsaW5lIDk5NDogVHJ5IGxvY2tpbmcgbXV0ZXggc2VtaWQNCmN5Z3Nl cnZlcjogc3lzdl9zZW0uY2MsIGxpbmUgOTk0OiBMb2NrZWQgICAgICBtdXRl eCBzZW1pZA0KY3lnc2VydmVyOiBzeXN2X3NlbS5jYywgbGluZSAxMDQ5OiBz ZW1vcDogIHNlbWFwdHI9QTA1NkRGMCwgc2VtX2Jhc2U9QTA1NjUwQywgc2Vt cHRyPUEwNTY1OTAsIHNlbVsxMV09MCA6IG9wPS0xLCBmbGFnPXdhaXQNCg0K Y3lnc2VydmVyOiBzeXN2X3NlbS5jYywgbGluZSAxMDUzOiBzZW1vcDogIGNh bid0IGRvIGl0IG5vdw0KDQpjeWdzZXJ2ZXI6IHN5c3Zfc2VtLmNjLCBsaW5l IDEwODY6IHNlbW9wOiAgcm9sbGJhY2sgMCB0aGN5Z3NlcnZlcjogc3lzdl9z ZW0uY2MsIGxpbmUgOTY4OiBjYWxsIHRvIHNlbW9wKDY1NTM3LCAweDIyRTQ0 NCwgMSkKCmN5Z3NlcnZlcjogc3lzdl9zZW0uY2MsIGxpbmUgOTk0OiBUcnkg bG9ja2luZyBtdXRleCBzZW1pZApjeWdzZXJ2ZXI6IHN5c3Zfc2VtLmNjLCBs aW5lIDk2ODogY2FsbCB0byBzZW1vcCg2NTUzNywgMHgyMkU0NDQsIDEpCgpj eWdzZXJ2ZXI6IHN5c3Zfc2VtLmNjLCBsaW5lIDk5NDogVHJ5IGxvY2tpbmcg bXV0ZXggc2VtaWQKY3lnc2VydmVyOiBzeXN2X3NlbS5jYywgbGluZSA5Njg6 IGNhbGwgdG8gc2Vtb3AoNjU1MzcsIDB4MjJFNEQ0LCAxKQoKY3lnc2VydmVy OiBzeXN2X3NlbS5jYywgbGluZSA5OTQ6IFRyeSBsb2NraW5nIG11dGV4IHNl bWlkCgpvdWdoIC0xDQoNCmN5Z3NlcnZlcjogc3lzdl9zZW0uY2MsIGxpbmUg MTEwOTogc2Vtb3A6ICBnb29kIG5pZ2h0IQ0KDQpjeWdzZXJ2ZXI6IGJzZF9t dXRleC5jYywgbGluZSAxODk6IFVubG9ja2VkICAgIG11dGV4IHNlbWlkDQpj eWdzZXJ2ZXI6IHN5c3Zfc2VtLmNjLCBsaW5lIDk2ODogY2FsbCB0byBzZW1v cCg2NTUzNywgMHgyMkU0NDQsIDEpDQoNCmN5Z3NlcnZlcjogc3lzdl9zZW0u Y2MsIGxpbmUgOTk0OiBUcnkgbG9ja2luZyBtdXRleCBzZW1pZA0KY3lnc2Vy dmVyOiBzeXN2X3NlbS5jYywgbGluZSA5Njg6IGNhbGwgdG8gc2Vtb3AoNjU1 MzYsIDB4MjJFNEE0LCAxKQ0KDQpjeWdzZXJ2ZXI6IHN5c3Zfc2VtLmNjLCBs aW5lIDk5NDogVHJ5IGxvY2tpbmcgbXV0ZXggc2VtaWQNCmN5Z3NlcnZlcjog c3lzdl9zZW0uY2MsIGxpbmUgOTk0OiBMb2NrZWQgICAgICBtdXRleCBzZW1p ZA0KY3lnc2VydmVyOiBzeXN2X3NlbS5jYywgbGluZSAxMDQ5OiBzZW1vcDog IHNlbWFwdHI9QTA1NkRBOCwgc2VtX2Jhc2U9QTA1NjQ0MCwgc2VtcHRyPUEw NTY0QUMsIHNlbVs5XT0wIDogb3A9MSwgZmxhZz13YWl0DQoNCmN5Z3NlcnZl cjogc3lzdl9zZW0uY2MsIGxpbmUgMTIxNjogc2Vtb3A6ICBkb2luZyB3YWtl dXANCg0KY3lnc2VydmVyOiBzeXN2X3NlbS5jYywgbGluZSAxMjE4OiBzZW1v cDogIGJhY2sgZnJvbSB3YWtldXANCg0KY3lnc2VydmVyOiBzeXN2X3NlbS5j YywgbGluZSAxMjIwOiBzZW1vcDogIGRvbmUNCg0KY3lnc2VydmVyOiBzeXN2 X3NlbS5jYywgbGluZSAxMjIzOiBVbmxvY2tlZCAgICBtdXRleCBzZW1pZA0K Y3lnc2VydmVyOiBzeXN2X3NlbS5jYywgbGluZSA5OTQ6IExvY2tlZCAgICAg IG11dGV4IHNlbWlkDQpjeWdzZXJ2ZXI6IHN5c3Zfc2VtLmNjLCBsaW5lIDEw NDk6IHNlbW9wOiAgc2VtYXB0cj1BMDU2REYwLCBzZW1fYmFzZT1BMDU2NTBD LCBzZW1wdHI9QTA1NjVBOCwgc2VtWzEzXT0wIDogb3A9LTEsIGZsYWc9d2Fp dA0KDQpjeWdzZXJ2ZXI6IHN5c3Zfc2VtLmNjLCBsaW5lIDEwNTM6IHNlbW9w OiAgY2FuJ3QgZG8gaXQgbm93DQoNCmN5Z3NlcnZlcjogc3lzdl9zZW0uY2Ms IGxpbmUgMTA4Njogc2Vtb3A6ICByb2xsYmFjayAwIHRocm91Z2ggLTENCg0K Y3lnc2VydmVyOiBzeXN2X3NlbS5jYywgbGluZSAxMTA5OiBzZW1vcDogIGdv b2QgbmlnaHQhDQoNCmN5Z3NlcnZlcjogc3lzdl9zZW0uY2MsIGxpbmUgOTk0 OiBMb2NrZWQgICAgICBtdXRleCBzZW1pZA0KY3lnc2VydmVyOiBzeXN2X3Nl bS5jYywgbGluZSAxMDQ5OiBzZW1vcDogIHNlbWFwdHI9QTA1NkRGMCwgc2Vt X2Jhc2U9QTA1NjUwQywgc2VtcHRyPUEwNTY1QjQsIHNlbVsxNF09MCA6IG9w PS0xLCBmbGFnPXdhaXQNCg0KY3lnc2VydmVyOiBzeXN2X3NlbS5jYywgbGlu ZSAxMDUzOiBzZW1vcDogIGNhbid0IGRvIGl0IG5vdw0KDQpjeWdzZXJ2ZXI6 IHN5c3Zfc2VtLmNjLCBsaW5lIDEwODY6IHNlbW9wOiAgcm9sbGJhY2sgMCB0 aHJvdWdoIC0xDQoNCmN5Z3NlcnZlcjogc3lzdl9zZW0uY2MsIGxpbmUgMTEw OTogc2Vtb3A6ICBnb29kIG5pZ2h0IQ0KDQpjeWdzZXJ2ZXI6IGJzZF9tdXRl eC5jYywgbGluZSAxODk6IFVubG9ja2VkICAgIG11dGV4IHNlbWlkDQpjeWdz ZXJ2ZXI6IHN5c3Zfc2VtLmNjLCBsaW5lIDk5NDogTG9ja2VkICAgICAgbXV0 ZXggc2VtaWQNCmN5Z3NlcnZlcjogc3lzdl9zZW0uY2MsIGxpbmUgMTA0OTog c2Vtb3A6ICBzZW1hcHRyPUEwNTZERjAsIHNlbV9iYXNlPUEwNTY1MEMsIHNl bXB0cj1BMDU2NTI0LCBzZW1bMl09MCA6IG9wPTEsIGZsYWc9d2FpdA0KDQpj eWdzZXJ2ZXI6IHN5c3Zfc2VtLmNjLCBsaW5lIDEyMTY6IHNlbW9wOiAgZG9p bmcgd2FrZXVwDQoNCmN5Z3NlcnZlcjogc3lzdl9zZW0uY2MsIGxpbmUgMTIx ODogc2Vtb3A6ICBiYWNrIGZyb20gd2FrZXVwDQoNCmN5Z3NlcnZlcjogc3lz dl9zZW0uY2MsIGxpbmUgMTIyMDogc2Vtb3A6ICBkb25lDQoNCmN5Z3NlcnZl cjogc3lzdl9zZW0uY2MsIGxpbmUgMTIyMzogVW5sb2NrZWQgICAgbXV0ZXgg c2VtaWQNCmN5Z3NlcnZlcjogc3lzdl9zZW0uY2MsIGxpbmUgOTk0OiBMb2Nr ZWQgICAgICBtdXRleCBzZW1pZA0KY3lnc2VydmVyOiBzeXN2X3NlbS5jYywg bGluZSAxMDQ5OiBzZW1vcDogIHNlbWFwdHI9QTA1NkRGMCwgc2VtX2Jhc2U9 QTA1NjUwQywgc2VtcHRyPUEwNTY1QzAsIHNlbVsxNV09MCA6IG9wPS0xLCBm bGFnPXdhaXQNCg0KY3lnc2VydmVyOiBzeXN2X3NlbS5jYywgbGluZSAxMDUz OiBzZW1vcDogIGNhbid0IGRvIGl0IG5vdw0KDQpjeWdzZXJ2ZXI6IHN5c3Zf c2VtLmNjLCBsaW5lIDEwODY6IHNlbW9wOiAgcm9sbGJhY2sgMCB0aHJvdWdo IC0xDQoNCmN5Z3NlcnZlcjogc3lzdl9zZW0uY2MsIGxpbmUgMTEwOTogc2Vt b3A6ICBnb29kIG5pZ2h0IQ0KDQpjeWdzZXJ2ZXI6IGJzZF9tdXRleC5jYywg bGluZSAxODk6IFVubG9ja2VkICAgIG11dGV4IHNlbWlkDQpjeWdzZXJ2ZXI6 IGJzZF9tdXRleC5jYywgbGluZSAxODk6IFVubG9ja2VkICAgIG11dGV4IHNl bWlkDQpjeWdzZXJ2ZXI6IGJzZF9tdXRleC5jYywgbGluZSAyMzI6IFRyeSBs b2NraW5nIG11dGV4IHNlbWlkDQpjeWdzZXJ2ZXI6IGJzZF9tdXRleC5jYywg bGluZSAyMzI6IExvY2tlZCAgICAgIG11dGV4IHNlbWlkDQpjeWdzZXJ2ZXI6 IHN5c3Zfc2VtLmNjLCBsaW5lIDExMTI6IHNlbW9wOiAgZ29vZCBtb3JuaW5n IChlcnJvcj0wKSENCg0KY3lnc2VydmVyOiBzeXN2X3NlbS5jYywgbGluZSAx MTQ0OiBzZW1vcDogIGdvb2QgbW9ybmluZyENCg0KY3lnc2VydmVyOiBzeXN2 X3NlbS5jYywgbGluZSAxMDQ5OiBzZW1vcDogIHNlbWFwdHI9QTA1NkRBOCwg c2VtX2Jhc2U9QTA1NjQ0MCwgc2VtcHRyPUEwNTY0QUMsIHNlbVs5XT0xIDog b3A9LTEsIGZsYWc9d2FpdA0KDQpjeWdzZXJ2ZXI6IHN5c3Zfc2VtLmNjLCBs aW5lIDEyMjA6IHNlbW9wOiAgZG9uZQ0KDQpjeWdzZXJ2ZXI6IHN5c3Zfc2Vt LmNjLCBsaW5lIDEyMjM6IFVubG9ja2VkICAgIG11dGV4IHNlbWlkDQo= --0-543418162-1096898540=:48529 Content-Type: text/plain; charset=us-ascii -- Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple Problem reports: http://cygwin.com/problems.html Documentation: http://cygwin.com/docs.html FAQ: http://cygwin.com/faq/ --0-543418162-1096898540=:48529--