Hi,

Monit will disable the process monitoring on excessive restart failures due to 
"if 5 restart within 5 cycles then timeout" statement in your configuration 
(the "timeout" action an alias for "unmonitor" and we switched the 
documentation in the past to "unmonitor" as it is more clear: 
https://mmonit.com/monit/documentation/monit.html#SERVICE-RESTART-LIMIT)

The monitoring state is persistent - the "timed out" service has usually some 
hard error which requires manual intervention (timeout statement prevents 
endless restart loop). When the problem is resolved, the monitoring needs to be 
enabled manually.

If you want to drop the state for example after reboot, place the statefile to 
tmpfs filesystem (you can use "set statefile <path>" statement to customize 
state file placement).

Regards,
Martin



> On 16 Dec 2015, at 14:20, Stas Oskin <[email protected]> wrote:
> 
> Hi,
> 
> After some more digging, it occurred to me that it might that monit just 
> stops monitoring the process after it unable to restart it.
> 
> So on monit 4.x it appears this state was cleared when just restarting monit, 
> while on 5.x it seems you need actually to mark the check as active manually 
> via the monit command.
> 
> Is this correct?
> 
> On Fri, Nov 20, 2015 at 5:37 PM, Martin Pala <[email protected] 
> <mailto:[email protected]>> wrote:
> Hi,
> 
> please run monit in debug mode and send output:
> 
>       monit -vI
> 
> Regards,
> Martin
> 
> 
> 
>> On 19 Nov 2015, at 20:39, Stas Oskin <[email protected] 
>> <mailto:[email protected]>> wrote:
>> 
>> Hi,
>> 
>> The monit log shows only the general start-up information.
>> 
>> There is no messages about the processes going offline, it's like monit does 
>> not use the pid file to find the process anymore.
>> 
>> When I use HTTP port probing though it works just fine. Any idea what could 
>> it be?
>> 
>> Regards.
>> 
>> On Sun, Nov 15, 2015 at 9:16 PM, Martin Pala <[email protected] 
>> <mailto:[email protected]>> wrote:
>> Hi,
>> 
>> please can you provide more details about the problem? (error messages 
>> and/or monit log).
>> 
>> Note that monit 5.9 includes fix for program execution for CentOS6/RHEL6, we 
>> recommend upgrade to latest monit version (5.15), you can get it here: 
>> https://mmonit.com/monit/#download <https://mmonit.com/monit/#download>. You 
>> can build rpm directly from the source code release: rpmbuild -tb 
>> monit-5.15.tar.gz. I think RHEL uses custom configuration file, official 
>> monit looks for /etc/monitrc, so you may need to rename the configuration 
>> file or create a link after upgrade.
>> 
>> Regards,
>> Martin
>> 
>> 
>> > On 14 Nov 2015, at 16:51, Stas Oskin <[email protected] 
>> > <mailto:[email protected]>> wrote:
>> >
>> > Hi,
>> >
>> > Monit has reliably served us through the years, and we are very happy of 
>> > it.
>> >
>> > Unfortunately during scheduled migration to CentOS 6 due CentOS 5 EOL, we 
>> > discovered it stopped monitoring the services pid files. HTTP monitoring 
>> > works fine.
>> >
>> > The CentOS 6 version is:
>> > monit-5.1.1-4.el6.x86_64
>> >
>> > CentOS 5 version is:
>> > monit-4.10.1-8.el5
>> >
>> > An example config that not working anymore (but accepted by monit when 
>> > starting):
>> > check process XXXX with pidfile /XXXX/pid/XXXXX.pid
>> >     start program  "/etc/init.d/xxxxx restart"
>> >     stop program  "/etc/init.d/xxxxxx stop"
>> >     if mem usage > 85% then restart
>> >     if 5 restarts within 5 cycles then timeout
>> >
>> > I guess something changed in configuration jump from 4 to 5, will 
>> > appreciate any advice.
>> >
>> > Thanks!
>> > --
>> > To unsubscribe:
>> > https://lists.nongnu.org/mailman/listinfo/monit-general 
>> > <https://lists.nongnu.org/mailman/listinfo/monit-general>
>> 
>> 
>> --
>> To unsubscribe:
>> https://lists.nongnu.org/mailman/listinfo/monit-general 
>> <https://lists.nongnu.org/mailman/listinfo/monit-general>
>> 
>> --
>> To unsubscribe:
>> https://lists.nongnu.org/mailman/listinfo/monit-general 
>> <https://lists.nongnu.org/mailman/listinfo/monit-general>
> 
> --
> To unsubscribe:
> https://lists.nongnu.org/mailman/listinfo/monit-general 
> <https://lists.nongnu.org/mailman/listinfo/monit-general>
> 
> --
> To unsubscribe:
> https://lists.nongnu.org/mailman/listinfo/monit-general

--
To unsubscribe:
https://lists.nongnu.org/mailman/listinfo/monit-general

Reply via email to