OK, done -

Not sure if attaching files is allowed; and not much to show here either -
so here goes -

Last few lines of log once I restarted in debug mode -

[UTC May 13 06:09:05] info     : Starting Monit 5.13 daemon with http
interface at [*]:2812
[UTC May 13 06:09:05] info     : Monit start delay set -- pause for 5s
[UTC May 13 06:09:10] info     : Starting Monit HTTP server at [*]:2812
[UTC May 13 06:09:10] info     : Monit HTTP server started
[UTC May 13 06:09:10] info     : 'crumble.localdomain' Monit started
[UTC May 13 06:09:10] info     : M/Monit heartbeat started
[UTC May 13 06:09:10] error    : 'opsworks-agent-master-running' process is
not running
[UTC May 13 06:09:10] error    : 'opsworks-agent' process is not running
[UTC May 13 06:09:10] info     : 'opsworks-agent' trying to restart
[UTC May 13 06:09:10] info     : 'opsworks-agent' start: /usr/bin/env
[UTC May 13 06:09:42] error    : 'opsworks-agent-master-running' process is
not running
[UTC May 13 06:09:42] info     : 'opsworks-agent-master-running' trying to
restart
[UTC May 13 06:09:42] info     : 'opsworks-agent' start: /usr/bin/env
[UTC May 13 06:10:14] error    : 'opsworks-agent-master-running' process is
not running
[UTC May 13 06:10:14] info     : 'opsworks-agent-master-running' trying to
restart
[UTC May 13 06:10:14] info     : 'opsworks-agent' start: /usr/bin/env


The debug produced this -

Starting monit: Adding credentials for user 'admin'
Runtime constants:
 Control file       = /etc/monit/monitrc
 Log file           = /var/log/monit.log
 Pid file           = /var/run/monit.pid
 Id file            = /var/lib/monit.id
 State file         = /var/run/monit.state
 Debug              = True
 Log                = True
 Use syslog         = False
 Is Daemon          = True
 Use process engine = True
 Poll time          = 30 seconds with start delay 5 seconds
 Expect buffer      = 256 bytes
 Event queue        = base directory /var/monit with 100 slots
 M/Monit(s)         = http://[FILTERED_IP]:80/collector with timeout 5
seconds using credentials
 Mail from          = [email protected]
 Mail subject       = $SERVICE $EVENT at $DATE
 Mail message       = Monit $ACTION $SERVI..(truncated)
 Start monit httpd  = True
 httpd bind address = Any/All
 httpd portnumber   = 2812
 httpd ssl          = Disabled
 httpd signature    = Enabled
 httpd auth. style  = Basic Authentication

The service list contains the following entries:

Process Name          = opsworks-agent-master-running
 Group                = opsworks
 Match                = opsworks-agent: master
 Monitoring mode      = active
 Existence            = if does not exist for 2 cycles then restart

Process Name          = opsworks-agent
 Group                = opsworks
 Pid file             = /var/lib/aws/opsworks/pid/opsworks-agent.pid
 Monitoring mode      = active
 Start program        = '/usr/bin/env service opsworks-agent start' timeout
30 second(s)
 Stop program         = '/usr/bin/env service opsworks-agent stop' timeout
30 second(s)
 Existence            = if does not exist then restart
 Depends on Service   = opsworks-agent-keep-alive-daemons-log
 Depends on Service   = opsworks-agent-process-command-daemons-log
 Depends on Service   = opsworks-agent-statistic-daemons-log
 Depends on Service   = opsworks-agent-master-running

File Name             = opsworks-agent-statistic-daemons-log
 Group                = opsworks
 Path                 = /var/log/aws/opsworks/opsworks-agent.statistics.log
 Monitoring mode      = active
 Existence            = if does not exist for 3 cycles then restart
 Timestamp            = if greater than 120 second(s) for 3 cycles then
restart

File Name             = opsworks-agent-process-command-daemons-log
 Group                = opsworks
 Path                 =
/var/log/aws/opsworks/opsworks-agent.process_command.log
 Monitoring mode      = active
 Existence            = if does not exist for 3 cycles then restart
 Timestamp            = if greater than 120 second(s) for 3 cycles then
restart

File Name             = opsworks-agent-keep-alive-daemons-log
 Group                = opsworks
 Path                 = /var/log/aws/opsworks/opsworks-agent.keep_alive.log
 Monitoring mode      = active
 Existence            = if does not exist for 3 cycles then restart
 Timestamp            = if greater than 120 second(s) for 3 cycles then
restart

System Name           = crumble.localdomain
 Monitoring mode      = active

-------------------------------------------------------------------------------
Monit daemon with PID 26769 awakened


On Wed, May 13, 2015 at 11:37 AM Martin Pala <[email protected]> wrote:

> Please make sure monit logging is enabled (the “set logfile” statement) +
> run Monit in debug mode (-v option), try to reproduce the problem and send
> logs.
>
> Regards,
> Martin
>
>
> > On 13 May 2015, at 07:15, Shrinath M <[email protected]> wrote:
> >
> > I am using AWS Opsworks and AWS uses an old version of monit (5.3.2) to
> monitor their agent. Obviously, when their opsworks-agent dies, monit
> restarts it.
> > Recently, I wanted to monitor few processes of my own and required newer
> versions of monit to use the explicit "restart" command support. I upgraded
> monit to 5.13.
> > Now, monit does not restart opsworks agent if it dies!
> >
> > I tried looking for changelog of monit to see if something was changed
> between versions, but could not find them for all versions beyond 5.7.
> > Can someone please take a look at opsworks config below and see what
> might be breaking?
> >
> > opsworks-config follows -
> > check process opsworks-agent with pidfile
> "/var/lib/aws/opsworks/pid/opsworks-agent.pid"
> >   start program = "/etc/init.d/opsworks-agent start"
> >   stop program = "/etc/init.d/opsworks-agent stop"
> >   depends on opsworks-agent-master-running
> >   depends on opsworks-agent-statistic-daemons-log
> >   depends on opsworks-agent-process-command-daemons-log
> >   depends on opsworks-agent-keep-alive-daemons-log
> >   group opsworks
> >
> > check process opsworks-agent-master-running matching
> "opsworks-agent:\smaster"
> >   if not exist for 2 cycles then restart
> >   group opsworks
> >
> > # check run of statistic daemon
> > check file opsworks-agent-statistic-daemons-log with path
> "/var/log/aws/opsworks/opsworks-agent.statistics.log"
> >   if timestamp > 2 minutes for 3 cycles then restart
> >   if does not exist for 3 cycles then restart
> >   group opsworks
> >
> > # check run of process command daemon
> > check file opsworks-agent-process-command-daemons-log with path
> "/var/log/aws/opsworks/opsworks-agent.process_command.log"
> >   if timestamp > 2 minutes for 3 cycles then restart
> >   if does not exist for 3 cycles then restart
> >   group opsworks
> >
> > # check run of keep alive deamon
> > check file opsworks-agent-keep-alive-daemons-log with path
> "/var/log/aws/opsworks/opsworks-agent.keep_alive.log"
> >   if timestamp > 2 minutes for 3 cycles then restart
> >   if does not exist for 3 cycles then restart
> >   group opsworks
> >
> > - end of file
> >
> > Monit logs say restart done, but opsworks doesn't run. If I downgrade to
> 5.3.2, it does magically run!
> > --
> > To unsubscribe:
> > https://lists.nongnu.org/mailman/listinfo/monit-general
>
>
> --
> To unsubscribe:
> https://lists.nongnu.org/mailman/listinfo/monit-general
--
To unsubscribe:
https://lists.nongnu.org/mailman/listinfo/monit-general

Reply via email to