Zenoss 2.3.3 on SLES 10 (but I doubt that makes much difference!)

If I want to monitor for specific processes on a box, typically the box 
supports SNMP so if I put it in a class like /Device/Server/Linux it will 
automatically have MODELING data collector "plugins" applied, including 
zenoss.snmp.HRSWRunMap (use table drop-down menu for device and select More -> 
"Collector plugins").  If you turn up debug on zenmodeler you can see this 
information being collected using the SNMP Host Resources MIB (just run 
zenmodeler run -v 10 -d <your device name> ).  

If you go to the device's OS page, you can see configured Processes and 
clicking on the Process Name entry takes you (hopefully) to graphs of cpu, 
memory and count for that process.  From the graphs page, if you go to the 
template tab it show that the standard /Devices/OSProcess template is used to 
collect this data.  However the Data Sources for OSProcess are rather strange - 
the type is SNMP but there is no MIB OID to supply the data!  I expected this 
to be getting data from the Host Resources hrSWRunPerf table??  What actually 
populates these graphs ('cos there IS data there and it looks reasonable)?

Next step.  Suppose you can't use SNMP to get process info from a box.  You can 
assign it to /Devices/Server/Cmd and it gets a bunch of MODELING data collector 
"plugins", including zenoss.cmd.linux.process.  Now if you run zenmodeler in 
debug you can see that simple operating system commands are run over ssh 
(provided your ssh authorization is all setup).  You can see a line like:


> 
> DEBUG:zen.SshClient:command ps axho comm,args
> 

followed by a whole bunch of lines which is obviously output from the above ps 
command (note that there are no Nagios plugins involved here - it's just the ps 
command run over ssh).  However, if processes die, they never get reported in 
Zenoss  - status always stays green (and I don't think it is the "display bug 
reported in ticket 3780 and I have restarted zenprocess....).  I have turned up 
debugging in zenprocess and can see reports on other processes on other devices 
that are performing process checks with SNMP but there appears to be no 
reference whatsoever to the process being checked on my ssh device.

Next problem, if I drill into the process name to see graphs of cpu, memory and 
count for this process, the cpu and mem have no graph at all between the < and 
> bars.  The count graph exists but has "nan" values - not available.  Not 
surprising really as it is still using the single /Devices/OSProcess template 
described above which "suggests" it using SNMP.

So,
1) Is this process monitoring by ssh commands working as designed and, 
basically, it doesn't work (even though it does get the data)
2) Has anyone got this to work?
3) Even with standard SNMP process modelling using the HRSWRunMap data 
collector, what is actually populating the OSProcess performance data template?
4) Is there any chance of getting the performance data if you are using the 
zenoss.cmd.linux.process collector?

Cheers,
Jane




-------------------- m2f --------------------

Read this topic online here:
http://forums.zenoss.com/viewtopic.php?p=33416#33416

-------------------- m2f --------------------



_______________________________________________
zenoss-users mailing list
[email protected]
http://lists.zenoss.org/mailman/listinfo/zenoss-users

Reply via email to