Plague of (ntpdate) zombies |
| log in |
Questions and Answers : Unix/Linux : Plague of (ntpdate) zombies
| Author | Message |
|---|---|
|
Having received my sensor it's been happily running but I have now discovered I have 106 ntpdate processes in zombie state on my machine (Fedora 10). I already run ntpd as a matter of course and it's well synchronized; are the two interacting in some bad way ? | |
| ID: 1006 · Rating: 0 · rate:
| |
|
hmm, I'll take a look, it seems your computer never time syncs with our server: | |
| ID: 1007 · Rating: 0 · rate:
| |
|
Mine has been doing that too ever since, exactly since, QCN. Am running the latest Jaunty 64 alpha but it was doing it before. When I reboot, they are all gone. Then they return, more and more, longer I stay on without rebooting. | |
| ID: 1250 · Rating: 0 · rate:
| |
hmm, I'll take a look, it seems your computer never time syncs with our server: This is caused by the parent process not calling wait() to acknowledge the termination of the child process. ntpdate is being phased out, to be replaced by ntpd -q. However, I have determined that if ntpd is running or sleeping that the invoking of ntpd -q doesn't execute. I am still researching, but I would like to be able to have the QCN process ask my normally installed ntpd for the info it needs. That way, I would have only one copy running. I would need to know the time server you prefer. read the ntp-doc that you can install in Linux to find all the answers. Edit: nptdate has some conflicts with ntpd and does not set the clock when both are running. | |
| ID: 1401 · Rating: 0 · rate:
| |
|
thanks Fred (Rudy Toody) - I think as you noted my cmd line is being truncated so I will look into a fix for that. I just run ntpdate to get a "local" offset from our servers, so it isn't trying to really update your clock etc (just gives me a number offset I can report to adjust event times etc). | |
| ID: 1418 · Rating: 0 · rate:
| |
|
I can confirm this. I have ntpd running as a matter of course. Attempts to run ntpdate to a local stratum 2 server produces: | |
| ID: 1419 · Rating: 0 · rate:
| |
|
can you try ntpdate with all the options I use, either the one I distribute via BOINC or your "local" ntpdate, i.e. | |
| ID: 1420 · Rating: 0 · rate:
| |
|
Just to prove me wrong both of those work fine: | |
| ID: 1432 · Rating: 0 · rate:
| |
|
great, that means somehow my linux command-line is getting truncated as Rudy Toody thought. because the -q option shouldn't conflict with anybody's ntpd (if running) etc; I just "grep" the offset to get a better estimate of the time. | |
| ID: 1435 · Rating: 0 · rate:
| |
|
I have aborted and reset and still get the truncations. | |
| ID: 1444 · Rating: 0 · rate:
| |
|
well it seems to be doing the time sync but leaving zombies; I think it's because of the customized popen pipe stuff I'm doing which isn't cleaning up after itself nicely, so I'll try a waitpid() and see if that fixes it. | |
| ID: 1445 · Rating: 0 · rate:
| |
|
I detached and re-attached and I still get no time-sync. The truncation still exists. It's like I haven't gotten the new version. | |
| ID: 1446 · Rating: 0 · rate:
| |
|
odd, I just put up a new version (4.80) in which I doubled the size of the cmd line (which should be much bigger than required), see if that works? | |
| ID: 1456 · Rating: 0 · rate:
| |
|
Nope still getting zombies every 15 mins. This is with 4.80. I quit and restarted BOINC but this didn't alter things. | |
| ID: 1457 · Rating: 0 · rate:
| |
|
that's odd, I get no zombies & time sync OK with 4.80 on my Ubuntu & Debian tests (run for about an hour). I'm afraid I won't have too much time to work on Linux until I move back to Oxford in the next month; where I'll have more Linux boxes to play with. If the zombies get to be hogging up your machine you may want to suspend until I figure out what's happening (but I'm using waitpid so that should clear up any zombies, and it seems to be time syncing OK so I don't see where it would have crashed or anything). | |
| ID: 1461 · Rating: 0 · rate:
| |
|
I have no zombies at all! It doesn't appear that the ntpdate process is ever invoked (I haven't seen anything pop up on the system monitor.) The error file shows a non-sync every three minutes. | |
| ID: 1465 · Rating: 0 · rate:
| |
Questions and Answers :
Unix/Linux :
Plague of (ntpdate) zombies