<mailto:[email protected]>Hello,
I encountered the problem when snmp agent reported to be down on some
servers with the message:
2008-10-27 12:50:40 ERROR zen.zenperfsnmp: Failed to collect on web01
(twisted.python.failure.Failure: [Failure instance: Traceback (failure
with no frames): twisted.internet.error.TimeoutError: User timeout
caused connection failure.
Nothing helps:
- restarting of zenperfsnmp from web GUI or command line
- restarting of whole bunch of zenoss daemons
- changing MAX_OIDS_PER_REQUEST in zenperfsnmp.py or per device
zMaxOIDPerRequest to 20 and even less
- raising zSnmpTimeout to 15 or even 20 seconds
- changing monitors snmp timeouts in dmd/manage interface
- deleting and recreating servers' devices in Zenoss
- removing cache files in zenoss/var
And the most weird thing is that everything works from command line of
zenoss host e.g.
snmpwalk
or
zenperfsnmp run -v -d <device>
and modeling works just fine for failing devices.
We rely on snmp to watch running processes, network interfaces and many
other stuff.
Please help!
Thanks in advance,
Andriy
Installation details:
Zenoss Zenoss 2.1.3
OS Linux (i686) 2.6.24 (Linux falcon 2.6.24-16-server #1 SMP Thu Apr 10
13:58:00 UTC 2008 i686)
Zope Zope 2.8.8
Python Python 2.4.5
Database MySQL 5.0.51 (Ver 5.0.51)
RRD RRDtool 1.2.23
Twisted Twisted 2.5.0
SNMP PySNMP 3.4.3
Twisted SNMP TwistedSNMP 0.3.13
NetSnmp NetSnmp 5.4.1
_______________________________________________
zenoss-users mailing list
[email protected]
http://lists.zenoss.org/mailman/listinfo/zenoss-users