Oops... sorry for my last content-free reply.

For some reason, zenperfsnmp isn't getting through all the devices. This will cause heartbeat failures, and the log messages below.

Can you do this for me:

   $ ps auxww | grep zenperfsnmp

Note the process id

$ strace -e trace=network -s 2000 -p [process id from above] 2>file-to-send-to-eric

Interrupt it after a minute.

zip the file and send it to me. This will let me see what packets are going out and coming back.

This will include things like IP addresses and community strings, so send it off list. I'm specifically looking for sendto/recvfrom on the IP addresses that are not collecting. We should see several attempts to talk to those devices in one minute. I will check the integrity of the packets and verify that we are decoding them properly.
-Eric

Todd Michael Hebert wrote:
These are in a rotated-out zenperfsnmp.log:

2006-08-11 16:23:02 WARNING zen.zenperfsnmp: Deleting old RRD file: /usr/local/zenoss/perf/B1/ifOutOctets.rrd

There's one of these, it would appear, for every single rrd.

I'm also getting these errors, and lots of them:

2006-08-14 18:59:29 WARNING zen.zenperfsnmp: Devices status is not clearing. Restarting. 2006-08-14 19:08:17 WARNING zen.zenperfsnmp: Devices status is not clearing. Restarting.

Right now I have 2-3 devices with events that just won't clear as well..but everything else seems to be monitoring normally.

I'm getting very frustrated, and I really don't want to take the whole Zenoss install down & start over. I'm monitoring 152 devices at this point... and the system has really been great for diagnosing network problems and knowing when anything goes down.

Todd M. Hebert

_______________________________________________
zenoss-users mailing list
[email protected]
http://lists.zenoss.org/mailman/listinfo/zenoss-users


_______________________________________________
zenoss-users mailing list
[email protected]
http://lists.zenoss.org/mailman/listinfo/zenoss-users

Reply via email to