Re: resource leak

Robert Watson Thu, 02 Oct 2008 05:08:52 -0700


On Wed, 1 Oct 2008, Stephen Clark wrote:

A big part of problem is this seems to take about 100 days of uptime tooccur. We have some inhouse test boxes but have never seen the problem,probably because non of them have been up more than about 45 days. The unitsin the field, of which there is about 300, are headless and none arephysically close.
When the boxes are rebooted there are no error messages in any of the logfiles, only the absence of information that would normally be logged by newprocesses that would be spawned. We are getting ready to install a patchthat will try to gather more information.
I thought about writing an app the would try to fork a child periodicallyand record in a log file if there was an error. But EAGAIN is nonspecific asto the real reason the fork failed. I was looking for some way toperiodically log the resources that would cause the fork failure.


The narrowness of the UNIX errno space is, at times, fairly unhelpful.

As far as I'm aware, the two main causes of EAGAIN out of fork() are anexhaustion of maxprocs or an exhaustion of per-user process limits. Thissuggests one or more run-away applications or services, or a gradual leak ofprocesses from a service (perhaps a failure to GC dead children, or a gradualincrease but never decrease of worker processes?).


Robert N M Watson
Computer Laboratory
University of Cambridge

procstat -k looks like it would have been a good candidate but unfortunatelywe

are running 6.1.

Thanks for the response.
Steve

--

"They that give up essential liberty to obtain temporary safety,
deserve neither liberty nor safety."  (Ben Franklin)

"The course of history shows that as a government grows, liberty
decreases."  (Thomas Jefferson)

_______________________________________________
[email protected] mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"

Re: resource leak

Reply via email to