I would like to have a watchdog for nxlog, as I've found a few occasions
recently where nxlog (on RHEL5, build from source using the provided spec
file) was no-longer able to send (or have its sent messages received) by a
receiving nxlog instance (on RHEL6)

I'm not sure of why this is (there was nothing logged at INFO level), but I
do know that restarting nxlog on the receiving side (or pointing the sender
to a different receiver) was sufficient to restore service.

I would much prefer to have some sort of 'liveness' test (not merely
ensuring that there is a process called 'nxlog') that could be used as a
test to restart a likely failed nxlog instance.

Ideally, I'd like to be able to take a core-dump etc. of when this happens
for further root-cause analysis.

Considering there is never any silent time of the day, I would be happy
with a test that was based on number of events processes in [some small
number] of minutes.

Or I suppose I could just pass in a particular log message, and then check
to see that it has come though...


Has anyone done anything similar and would like to share what they have
done?

Cheers,
Cameron

-- 
Cameron Kerr <cameron.kerr...@gmail.com>
See my blog at http://distracted-it.blogspot.co.nz/  (previously
http://humbledown.org/)
Skype me on cameron.kerr.nz
------------------------------------------------------------------------------
Slashdot TV.  
Video for Nerds.  Stuff that matters.
http://tv.slashdot.org/
_______________________________________________
nxlog-ce-users mailing list
nxlog-ce-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nxlog-ce-users

Reply via email to