hey Erik I tried increasing the file descriptor when I had the same issue last year and that did not help what so ever and adding parallel 5 to zencommand.conf didn't help either.
I'm not sure if you remember but I had the same issue, where I was running many nagios commands on over 300 devices. On Mon, 2007-02-05 at 21:56 -0500, Erik A. Dahl wrote: > Soren, > > You are running out of file descriptors! some googling got me this... > > http://bcr2.uwaterloo.ca/~brecht/servers/openfiles.html > > The root cause of your issue though maybe that you are using > check_ping... Zenoss has ping monitoring built-in no need to use > check_ping. > > -EAD > > On Feb 5, 2007, at 2:56 PM, Soren Williams wrote: > > > I am having a hard time keeping zencommand running well. I can > > start it > > by using the command zencommand run -c -D but it creates thousands of > > sessions in my firewall, bogging everything down. > > > > I am running Zenoss 1.1.0 on an FC3 box. > > > > Here are the respective errors I'm seeing. > > > > Zencommand run -v10 gives the following error before exiting > > > > DEBUG:zen.Events:EventClassInst=Stop > > DEBUG:zen.Events:insert into status set > > firstTime=1170700284.489,severity=2,component='ze > > ncommand',agent='zencommand',summary='stopped',dedupid='localhost.loca > > ld > > omain|zencommand| > > /App/Stop||2| > > stopped',manager='localhost.localdomain',eventKey='',device > > ='localhost.local > > domain',eventClass='/App/ > > Stop',lastTime=1170700284.489,message='stopped' > > ,evid='0a0a2d8634 > > ed1047cffffffc' on duplicate key update > > summary='stopped',count=count+1,lastTime=11707002 > > 84.489: --> 1 > > DEBUG:zen.Events:DELETE FROM status WHERE clearid IS NOT NULL: --> 0 > > > > > > This is what I see on the Zenoss server when running zencommand run -c > > -D > > > > [EMAIL PROTECTED] ZenRRD]$ ps aux | grep ping > > zenoss 9060 0.0 0.0 0 0 ? Z 11:40 0:00 > > [check_ping] <defunct> > > zenoss 9067 0.0 0.0 2672 516 ? S 11:40 0:00 /bin/ > > ping > > -n -U -w 10 -c 5 xxx.xxx.xxx.xxx > > zenoss 9103 0.0 0.1 7028 1028 ? S 11:40 0:00 > > /usr/local/zenoss/libexec/check_ping -H xx.xxx.xxx.xxx -w 300,20% -c > > 500,50% -p 2 > > > > > > This is what I see in the zencommand.log eventually after running > > zencommand run -c -D > > > > zencommand.log > > 2007-02-04 22:36:35 ERROR zen.zencommand: [Errno 24] Too many open > > files > > Traceback (most recent call last): > > File "/usr/local/zenoss/Products/ZenRRD/zencommand.py", line 426, in > > processSchedule > > c.start(self.pool).addBoth(self.finished) > > File "/usr/local/zenoss/Products/ZenRRD/zencommand.py", line 302, in > > start > > d = pr.start(self) > > File "/usr/local/zenoss/Products/ZenRRD/zencommand.py", line 101, in > > start > > reactor.spawnProcess(self, shell, self.cmdline, env=None) > > File "usr/local/zenoss/lib/python/twisted/internet/posixbase.py", > > line > > 282, in spawnProcess > > processProtocol, uid, gid, childFDs) > > File "usr/local/zenoss/lib/python/twisted/internet/process.py", line > > 359, in __init__ > > raise ex > > OSError: [Errno 24] Too many open files > > > > I am using the Nagios check_ping command on about 47 devices. The > > entire > > command is 'check_ping -H {dev\manageip} -w 300,20% -c 500,50% -p 2' > > > > I also get errors about Nagios being deprecated when running > > zencommand. > > > > > > Soren Williams > > _______________________________________________ > > zenoss-users mailing list > > [email protected] > > http://lists.zenoss.org/mailman/listinfo/zenoss-users > > _______________________________________________ > zenoss-users mailing list > [email protected] > http://lists.zenoss.org/mailman/listinfo/zenoss-users _______________________________________________ zenoss-users mailing list [email protected] http://lists.zenoss.org/mailman/listinfo/zenoss-users
