hey Erik I tried increasing the file descriptor when I had the same
issue last year and that did not help what so ever and adding parallel 5
to zencommand.conf didn't help either.

I'm not sure if you remember but I had the same issue, where I was
running many nagios commands on over 300 devices.


On Mon, 2007-02-05 at 21:56 -0500, Erik A. Dahl wrote:
> Soren,
> 
> You are running out of file descriptors!  some googling got me this...
> 
> http://bcr2.uwaterloo.ca/~brecht/servers/openfiles.html
> 
> The root cause of your issue though maybe that you are using  
> check_ping... Zenoss has ping monitoring built-in no need to use  
> check_ping.
> 
> -EAD
> 
> On Feb 5, 2007, at 2:56 PM, Soren Williams wrote:
> 
> > I am having a hard time keeping zencommand running well. I can  
> > start it
> > by using the command zencommand run -c -D but it creates thousands of
> > sessions in my firewall, bogging everything down.
> >
> > I am running Zenoss 1.1.0 on an FC3 box.
> >
> > Here are the respective errors I'm seeing.
> >
> > Zencommand run -v10 gives the following error before exiting
> >
> > DEBUG:zen.Events:EventClassInst=Stop
> > DEBUG:zen.Events:insert into status set
> > firstTime=1170700284.489,severity=2,component='ze
> > ncommand',agent='zencommand',summary='stopped',dedupid='localhost.loca 
> > ld
> > omain|zencommand|
> > /App/Stop||2| 
> > stopped',manager='localhost.localdomain',eventKey='',device
> > ='localhost.local
> > domain',eventClass='/App/ 
> > Stop',lastTime=1170700284.489,message='stopped'
> > ,evid='0a0a2d8634
> > ed1047cffffffc' on duplicate key update
> > summary='stopped',count=count+1,lastTime=11707002
> > 84.489: --> 1
> > DEBUG:zen.Events:DELETE FROM status WHERE clearid IS NOT NULL: --> 0
> >
> >
> > This is what I see on the Zenoss server when running zencommand run -c
> > -D
> >
> > [EMAIL PROTECTED] ZenRRD]$ ps aux | grep ping
> > zenoss    9060  0.0  0.0     0    0 ?        Z    11:40   0:00
> > [check_ping] <defunct>
> > zenoss    9067  0.0  0.0  2672  516 ?        S    11:40   0:00 /bin/ 
> > ping
> > -n -U -w 10 -c 5 xxx.xxx.xxx.xxx
> > zenoss    9103  0.0  0.1  7028 1028 ?        S    11:40   0:00
> > /usr/local/zenoss/libexec/check_ping -H xx.xxx.xxx.xxx -w 300,20% -c
> > 500,50% -p 2
> >
> >
> > This is what I see in the zencommand.log eventually after running
> > zencommand run -c -D
> >
> > zencommand.log
> > 2007-02-04 22:36:35 ERROR zen.zencommand: [Errno 24] Too many open  
> > files
> > Traceback (most recent call last):
> >   File "/usr/local/zenoss/Products/ZenRRD/zencommand.py", line 426, in
> > processSchedule
> >     c.start(self.pool).addBoth(self.finished)
> >   File "/usr/local/zenoss/Products/ZenRRD/zencommand.py", line 302, in
> > start
> >     d = pr.start(self)
> >   File "/usr/local/zenoss/Products/ZenRRD/zencommand.py", line 101, in
> > start
> >     reactor.spawnProcess(self, shell, self.cmdline, env=None)
> >   File "usr/local/zenoss/lib/python/twisted/internet/posixbase.py",  
> > line
> > 282, in spawnProcess
> >     processProtocol, uid, gid, childFDs)
> >   File "usr/local/zenoss/lib/python/twisted/internet/process.py", line
> > 359, in __init__
> >     raise ex
> > OSError: [Errno 24] Too many open files
> >
> > I am using the Nagios check_ping command on about 47 devices. The  
> > entire
> > command is 'check_ping -H {dev\manageip} -w 300,20% -c 500,50% -p 2'
> >
> > I also get errors about Nagios being deprecated when running  
> > zencommand.
> >
> >
> > Soren Williams
> > _______________________________________________
> > zenoss-users mailing list
> > [email protected]
> > http://lists.zenoss.org/mailman/listinfo/zenoss-users
> 
> _______________________________________________
> zenoss-users mailing list
> [email protected]
> http://lists.zenoss.org/mailman/listinfo/zenoss-users
_______________________________________________
zenoss-users mailing list
[email protected]
http://lists.zenoss.org/mailman/listinfo/zenoss-users

Reply via email to