You need to stop monit, then start it again ("monit -v" is sufficient). The 
attached output is just configuration dump + waked up running monit daemon. You 
will find the output in related syslog file - once the problem will occur again 
(monit thinks the process is running while it is stopped), please check/send 
the log.

Regards,
Martin


> On 23 Dec 2014, at 15:51, Russell Simpkins <[email protected]> wrote:
> 
> Adding host allow 'localhost'
> Skipping redundant host 'localhost'
> Skipping redundant host 'localhost'
> Adding credentials for user 'admin'
> Adding PAM group 'monit'
> Adding PAM group 'users'
> Runtime constants:
>  Control file       = /etc/monitrc
>  Log file           = syslog
>  Pid file           = /var/run/monit.pid
>  Id file            = /root/.monit.id <http://monit.id/>
>  State file         = /root/.monit.state
>  Debug              = True
>  Log                = True
>  Use syslog         = True
>  Is Daemon          = True
>  Use process engine = True
>  Poll time          = 10 seconds with start delay 240 seconds
>  Expect buffer      = 256 bytes
>  Mail from          = (not defined)
>  Mail subject       = (not defined)
>  Mail message       = (not defined)
>  Start monit httpd  = True
>  httpd bind address = localhost
>  httpd portnumber   = 2812
>  httpd signature    = True
>  Use ssl encryption = False
>  httpd auth. style  = Basic Authentication and Host/Net allow list
> 
> The service list contains the following entries:
> 
> Process Name          = recentnews-feed
>  Pid file             = /var/run/recentnews-feed.pid
>  Monitoring mode      = active
>  Start program        = '/sbin/service recentnews-feed start' timeout 30 
> second(s)
>  Stop program         = '/sbin/service recentnews-feed stop' timeout 30 
> second(s)
>  Existence            = if does not exist then restart
>  Pid                  = if changed then alert
>  PPid                 = if changed then alert
>  Timeout              = If restarted 50 times within 50 cycle(s) then 
> unmonitor
> 
> Process Name          = nscd
>  Pid file             = /var/run/nscd/nscd.pid
>  Monitoring mode      = active
>  Start program        = '/etc/init.d/nscd start' timeout 30 second(s)
>  Stop program         = '/etc/init.d/nscd stop' timeout 30 second(s)
>  Existence            = if does not exist then restart
>  Pid                  = if changed then alert
>  PPid                 = if changed then alert
>  Timeout              = If restarted 5 times within 5 cycle(s) then unmonitor
> 
> Process Name          = nrpe
>  Pid file             = /var/run/nrpe.pid
>  Monitoring mode      = active
>  Start program        = '/etc/init.d/nrpe start' timeout 30 second(s)
>  Stop program         = '/etc/init.d/nrpe stop' timeout 30 second(s)
>  Existence            = if does not exist then restart
>  Pid                  = if changed then alert
>  PPid                 = if changed then alert
>  Timeout              = If restarted 5 times within 5 cycle(s) then unmonitor
> 
> Process Name          = httpd
>  Pid file             = /var/run/httpd/httpd.pid
>  Monitoring mode      = active
>  Start program        = '/etc/init.d/httpd start' timeout 30 second(s)
>  Stop program         = '/usr/bin/killall -9 httpd' timeout 30 second(s)
>  Existence            = if does not exist then restart
>  Pid                  = if changed then alert
>  PPid                 = if changed then alert
>  Timeout              = If restarted 5 times within 5 cycle(s) then unmonitor
> 
> Process Name          = emissary-master
>  Pid file             = /var/run/emissary.pid
>  Monitoring mode      = active
>  Start program        = '/etc/init.d/emissary start' timeout 30 second(s)
>  Stop program         = '/usr/bin/pkill -9 -f (emissary-master|emop_node)' 
> timeout 30 second(s)
>  Existence            = if does not exist then restart
>  Pid                  = if changed then alert
>  PPid                 = if changed then alert
>  Timeout              = If restarted 5 times within 5 cycle(s) then unmonitor
> 
> File Name             = emissary-pidfile
>  Path                 = /var/run/emissary.pid
>  Monitoring mode      = active
>  Start program        = '/etc/init.d/emissary start' timeout 30 second(s)
>  Stop program         = '/usr/bin/pkill -9 -f (emissary-master|emop_node)' 
> timeout 30 second(s)
>  Existence            = if does not exist then restart
>  Timestamp            = if greater than 3600 second(s) then restart
> 
> Process Name          = du-glass-broker-feed
>  Pid file             = /var/run/DU_GlassBroker.php.pid
>  Monitoring mode      = active
>  Start program        = '/sbin/service du-glass-broker-feed start' timeout 30 
> second(s)
>  Stop program         = '/sbin/service du-glass-broker-feed stop' timeout 30 
> second(s)
>  Existence            = if does not exist then restart
>  Pid                  = if changed then alert
>  PPid                 = if changed then alert
>  Timeout              = If restarted 5 times within 5 cycle(s) then unmonitor
> 
> System Name           = du-proc-00001.du-proc.data-universe-production
>  Monitoring mode      = active
> 
> -------------------------------------------------------------------------------
> Monit daemon with PID 12702 awakened
> 
> On Tue, Dec 23, 2014 at 9:42 AM, Martin Pala <[email protected] 
> <mailto:[email protected]>> wrote:
> Then it should notice the process died quickly.
> 
> Please run Monit in debug mode and send output:
> 
>       monit -vI
> 
> Regards,
> Martin
> 
> 
>> On 23 Dec 2014, at 15:33, Russell Simpkins <[email protected] 
>> <mailto:[email protected]>> wrote:
>> 
>> Hi,
>> 
>> I have the daemon set to 10 seconds
>> 
>> set daemon  10
>>    with start delay 240
>> 
>> 
>> 
>> On Tue, Dec 23, 2014 at 8:57 AM, Martin Pala <[email protected] 
>> <mailto:[email protected]>> wrote:
>> Hi,
>> 
>> what's the poll cycle settings? ("set daemon <seconds>" statement)
>> 
>> Monit performs the checks and then sleep for given number of seconds. If you 
>> poll cycle is long, Monit will not notice the process died until next cycle.
>> 
>> Regards,
>> Martin
>> 
>> 
>>> On 23 Dec 2014, at 14:45, Russell Simpkins <[email protected] 
>>> <mailto:[email protected]>> wrote:
>>> 
>>> Hi,
>>> 
>>> I have a check on a process via a pid file that monit reported as up, when 
>>> the pid was dead and I was wondering if there were any good tips for 
>>> figuring out why. We're running monit 5.9. When I run a status, I can see 
>>> my process listed as running and monitored:
>>> 
>>>   Process 'recentnews-feed'
>>>     status                            Running
>>>     monitoring status                 Monitored
>>>     pid                               9680
>>>     parent pid                        1
>>>     uid                               5005
>>>     effective uid                     5005
>>>     gid                               5006
>>>     uptime                            1d 3h 20m 
>>>     children                          0
>>>     memory kilobytes                  1805064
>>>     memory kilobytes total            1805064
>>>     memory percent                    25.7%
>>>     memory percent total              25.7%
>>>     cpu percent                       0.1%
>>>     cpu percent total                 0.1%
>>>     data collected                    Fri, 19 Dec 2014 04:14:04
>>> 
>>> When I check to see if the pid is actually running, it's not there:
>>> 
>>>   $ ps -ef | grep -i 9680
>>>   root     24520 24029  0 06:39 pts/0    00:00:00 grep -i 9680
>>> 
>>> 
>>> My monit config:
>>> check process recentnews-feed with pidfile /var/run/recentnews-feed.pid     
>>>  start program = "/sbin/service recentnews-feed start"
>>>   stop program = "/sbin/service recentnews-feed stop"
>>>   if 50 restarts within 50 cycles then timeout
>>> 
>>> Again, just curious if this is a known issue in 5.9 or how to figure out 
>>> why monit thought the pid was up when it was not.
>>> 
>>> Thanks,
>>> 
>>> Russ
>>> 
>>> --
>>> To unsubscribe:
>>> https://lists.nongnu.org/mailman/listinfo/monit-general 
>>> <https://lists.nongnu.org/mailman/listinfo/monit-general>
>> 
>> --
>> To unsubscribe:
>> https://lists.nongnu.org/mailman/listinfo/monit-general 
>> <https://lists.nongnu.org/mailman/listinfo/monit-general>
>> 
>> --
>> To unsubscribe:
>> https://lists.nongnu.org/mailman/listinfo/monit-general 
>> <https://lists.nongnu.org/mailman/listinfo/monit-general>
> 
> --
> To unsubscribe:
> https://lists.nongnu.org/mailman/listinfo/monit-general 
> <https://lists.nongnu.org/mailman/listinfo/monit-general>
> 
> --
> To unsubscribe:
> https://lists.nongnu.org/mailman/listinfo/monit-general

--
To unsubscribe:
https://lists.nongnu.org/mailman/listinfo/monit-general

Reply via email to