Re: [Puppet Users] Puppet agent hangs after running a few hours, defunct sh process

2012-01-10 Thread Jo Rhett
The comments in the redhat bug indicated that this breakage came from upstream, 
as did the fix.  So it's entirely possible that this bug appeared in some 
Debian kernels, but I don't know which.

On Jan 10, 2012, at 8:41 AM, jcbollinger wrote:
> On Jan 9, 11:40 am, Jo Rhett  wrote:
>> On Jan 7, 2012, at 7:40 PM, Andreas N wrote:
>> 
>>> That doesn't tell me anything other than that the puppet agent is blocking 
>>> on select() with a timeout of two seconds.
>> 
>> Sounds like #10418.  Check your kernel version.
>>  https://projects.puppetlabs.com/issues/10418
> 
> It sounds similar, but 10418 is specific to a particular RedHat /
> CentOS kernel, and the OP is observing his problem on Ubuntu.  My
> awareness of that issue is one of the reasons I advised the OP to look
> at kernel versions, however.
> 
> 
> John
> 
> -- 
> You received this message because you are subscribed to the Google Groups 
> "Puppet Users" group.
> To post to this group, send email to puppet-users@googlegroups.com.
> To unsubscribe from this group, send email to 
> puppet-users+unsubscr...@googlegroups.com.
> For more options, visit this group at 
> http://groups.google.com/group/puppet-users?hl=en.
> 

-- 
Jo Rhett
Net Consonance : consonant endings by net philanthropy, open source and other 
randomness

-- 
You received this message because you are subscribed to the Google Groups 
"Puppet Users" group.
To post to this group, send email to puppet-users@googlegroups.com.
To unsubscribe from this group, send email to 
puppet-users+unsubscr...@googlegroups.com.
For more options, visit this group at 
http://groups.google.com/group/puppet-users?hl=en.



[Puppet Users] Puppet agent hangs after running a few hours, defunct sh process

2012-01-05 Thread Andreas N
Hi, 

On a node running Puppet 2.7.9 from apt.puppetlabs.com, using Ubuntu 
10.04.3 the agent hangs after a few hours of operation. I have to kill -9 
it, nothing else helps. Obviously, this is unfortunate.

Looking at ps -ef I see this:

root  4842  4594  0 Jan04 pts/000:00:55 /usr/bin/ruby1.8 
/usr/bin/puppet agent --verbose --no-daemonize --debug
root  9803  4842  0 Jan04 pts/000:00:00 [sh] 

It seems a defunct sh process is responsible. This has happened before on 
that node so I started the agent with the command line arguments you see 
above. Unfortunately the produced debug logs don't look any different from 
the debug logs on a node where I haven't observed this behavior. The logs 
from the last run can be found here, nonetheless: http://pastie.org/3128200

The problem seems to happen regularly on that particular node but I looked 
around other nodes we have running and it seems to happen on a few others 
as well. These nodes don't have anything in common (not even the puppet 
master) but do have a few common modules applied. Could this be caused by 
one of those modules? How would I go about debugging? Or does anyone 
already know what's going on here?

Thanks, 

Andreas

-- 
You received this message because you are subscribed to the Google Groups 
"Puppet Users" group.
To view this discussion on the web visit 
https://groups.google.com/d/msg/puppet-users/-/z6W5nxo-DqAJ.
To post to this group, send email to puppet-users@googlegroups.com.
To unsubscribe from this group, send email to 
puppet-users+unsubscr...@googlegroups.com.
For more options, visit this group at 
http://groups.google.com/group/puppet-users?hl=en.