Greetings,

I use MRTG 2.17.4 to monitor a system with several thousand nodes. It's been running perfectly for nearly year. Yesterday, the graphs suddenly stopped displaying information for every single node. Dumping the rrd files shows that every entry since 1700z contains NaN, as shown here:

<!-- 2016-03-24 16:35:00 GMT / 1458837300 --> <row><v>1.1346932622e+05</v><v>4.4329144411e+05</v></row> <!-- 2016-03-24 16:40:00 GMT / 1458837600 --> <row><v>1.1090133815e+05</v><v>4.4215029817e+05</v></row> <!-- 2016-03-24 16:45:00 GMT / 1458837900 --> <row><v>1.1164798165e+05</v><v>4.2520479138e+05</v></row> <!-- 2016-03-24 16:50:00 GMT / 1458838200 --> <row><v>1.1138191129e+05</v><v>4.3855997644e+05</v></row> <!-- 2016-03-24 16:55:00 GMT / 1458838500 --> <row><v>1.1547433278e+05</v><v>4.5473071311e+05</v></row> <!-- 2016-03-24 17:00:00 GMT / 1458838800 --> <row><v>NaN</v><v>NaN</v></row> <!-- 2016-03-24 17:05:00 GMT / 1458839100 --> <row><v>NaN</v><v>NaN</v></row> <!-- 2016-03-24 17:10:00 GMT / 1458839400 --> <row><v>NaN</v><v>NaN</v></row> <!-- 2016-03-24 17:15:00 GMT / 1458839700 --> <row><v>NaN</v><v>NaN</v></row> <!-- 2016-03-24 17:20:00 GMT / 1458840000 --> <row><v>NaN</v><v>NaN</v></row> <!-- 2016-03-24 17:25:00 GMT / 1458840300 --> <row><v>NaN</v><v>NaN</v></row> <!-- 2016-03-24 17:30:00 GMT / 1458840600 --> <row><v>NaN</v><v>NaN</v></row> <!-- 2016-03-24 17:35:00 GMT / 1458840900 --> <row><v>NaN</v><v>NaN</v></row> <!-- 2016-03-24 17:40:00 GMT / 1458841200 --> <row><v>NaN</v><v>NaN</v></row> <!-- 2016-03-24 17:45:00 GMT / 1458841500 --> <row><v>NaN</v><v>NaN</v></row> <!-- 2016-03-24 17:50:00 GMT / 1458841800 --> <row><v>NaN</v><v>NaN</v></row>


I am able to use snmpget by hand:

$ snmpget -cmycommstr -v2c myhost ifDescr.4 ifInOctets.4 ifDescr.4 ifOutOctets.4
IF-MIB::ifDescr.4 = STRING: eth0
IF-MIB::ifInOctets.4 = Counter32: 981294193
IF-MIB::ifDescr.4 = STRING: eth0
IF-MIB::ifOutOctets.4 = Counter32: 1271239781


Enabling all debugging flags for mrtg produced this, which shows some obvious errors:

2016-03-25 15:16:17 -- --fork: start clearing confcache on first entry for target mycommstr@myhost_
2016-03-25 15:16:17 -- --coca: clear confcache mycommstr@myhost_
2016-03-25 15:16:17 -- --fork: finished clearing confcache
2016-03-25 15:16:17 -- --coca: store in confcache mycommstr@myhost_ Descr eth0 --> 4 2016-03-25 15:16:17 -- --coca: store in confcache mycommstr@myhost_ Descr eth1 --> 5 2016-03-25 15:16:17 -- --coca: store in confcache mycommstr@myhost_ Descr eth2 --> 2 2016-03-25 15:16:17 -- --coca: store in confcache mycommstr@myhost_ Descr eth3 --> 3 2016-03-25 15:16:17 -- --coca: store in confcache mycommstr@myhost_ Descr lo --> 1 2016-03-25 15:16:17 -- --coca: store in confcache mycommstr@myhost_ Eth --> 1 2016-03-25 15:16:17 -- --coca: store in confcache mycommstr@myhost_ Eth 00-15-17-9f-93-26 --> 2 2016-03-25 15:16:17 -- --coca: store in confcache mycommstr@myhost_ Eth 00-15-17-9f-93-27 --> 3 2016-03-25 15:16:17 -- --coca: store in confcache mycommstr@myhost_ Eth 00-22-19-98-9e-4f --> 4 2016-03-25 15:16:17 -- --coca: store in confcache mycommstr@myhost_ Eth 00-22-19-98-9e-51 --> 5 2016-03-25 15:16:17 -- --coca: store in confcache mycommstr@myhost_ Ip 10.0.6.1 --> 5 2016-03-25 15:16:17 -- --coca: store in confcache mycommstr@myhost_ Ip 127.0.0.1 --> 1 2016-03-25 15:16:17 -- --coca: store in confcache mycommstr@myhost_ Ip 165.92.30.101 --> 4 2016-03-25 15:16:17 -- --coca: store in confcache mycommstr@myhost_ Ip 165.92.30.51 --> 4 2016-03-25 15:16:17 -- --coca: store in confcache mycommstr@myhost_ Name eth0 --> 4 2016-03-25 15:16:17 -- --coca: store in confcache mycommstr@myhost_ Name eth1 --> 5 2016-03-25 15:16:17 -- --coca: store in confcache mycommstr@myhost_ Name eth2 --> 2 2016-03-25 15:16:17 -- --coca: store in confcache mycommstr@myhost_ Name eth3 --> 3 2016-03-25 15:16:17 -- --coca: store in confcache mycommstr@myhost_ Name lo --> 1 2016-03-25 15:16:17 -- --coca: store in confcache mycommstr@myhost_ Type 24 --> 1 2016-03-25 15:16:17 -- --coca: store in confcache mycommstr@myhost_ Type 6 --> Dup
.
.
.
2016-03-25 15:32:36 -- --base: Check for Thresholds
2016-03-25 15:32:36 -- --base: Act on Router/Target myhost_eth0
2016-03-25 15:32:36 -- 2016-03-25 15:32:35: ERROR: Target[myhost_eth0][_IN_] ' $target->[77]{$mode} ' did not eval into defined data 2016-03-25 15:32:36 -- 2016-03-25 15:32:35: ERROR: Target[myhost_eth0][_OUT_] ' $target->[77]{$mode} ' did not eval into defined data 2016-03-25 15:32:36 -- --base: Get Current values: in:undef, out:undef, up:undef, name:undef, time:1458918977
2016-03-25 15:32:36 -- --base: Create Graphics
2016-03-25 15:32:36 -- --base: start RRDtool section
2016-03-25 15:32:36 -- --base: maxi:125000000, maxo:125000000
2016-03-25 15:32:36 -- --log: RRDs::tune(/netmon/mrtg/rrd/myhost_eth0.rrd -a ds0:125000000 -a ds1:125000000 -d ds0:COUNTER -d ds1:COUNTER) 2016-03-25 15:32:36 -- --log: RRDs::update(/netmon/mrtg/rrd/myhost_eth0.rrd, '1458918977:U:U')
2016-03-25 15:32:36 -- --log:  got: ???/???

This appears to be the case with every data point. Does anyone know what might be causing this and how to correct it?

Regards,
Brian Rapp


_______________________________________________
mrtg mailing list
[email protected]
https://lists.oetiker.ch/cgi-bin/listinfo/mrtg

Reply via email to