Ah! That change in original snippet is due to me trying random things to
fix it. Shots in the dark :(

I tried changing the start command to use /usr/sbin/service, though, the
opsworks-agent master process starts, but is killed within 2 seconds.
So my first thought was that the agent itself is buggy, but if I start
manually, "service opsworks-agent restart", it runs just fine.
Also, there is no such problem from monit version 5.3.2.
Weird.

Just more observations/info -
The opsworks-agent when run manually throws some output to screen and says
"started child 1/3", "started child 2/3", "started child 3/3" and "started
master <pid> with 3 children".
This also goes to their logs.
But when started through monit, only the first 3 statements are there in
the log, but not the last one saying "started master <pid> with 3 children"


On Wed, May 13, 2015 at 10:45 AM Shrinath M <[email protected]> wrote:

> I am using AWS Opsworks and AWS uses an old version of monit (5.3.2) to
> monitor their agent. Obviously, when their opsworks-agent dies, monit
> restarts it.
> Recently, I wanted to monitor few processes of my own and required newer
> versions of monit to use the explicit "restart" command support. I upgraded
> monit to 5.13.
> Now, monit does not restart opsworks agent if it dies!
>
> I tried looking for changelog of monit to see if something was changed
> between versions, but could not find them for all versions beyond 5.7.
> Can someone please take a look at opsworks config below and see what might
> be breaking?
>
> opsworks-config follows -
> check process opsworks-agent with pidfile
> "/var/lib/aws/opsworks/pid/opsworks-agent.pid"
>   start program = "/etc/init.d/opsworks-agent start"
>   stop program = "/etc/init.d/opsworks-agent stop"
>   depends on opsworks-agent-master-running
>   depends on opsworks-agent-statistic-daemons-log
>   depends on opsworks-agent-process-command-daemons-log
>   depends on opsworks-agent-keep-alive-daemons-log
>   group opsworks
>
> check process opsworks-agent-master-running matching
> "opsworks-agent:\smaster"
>   if not exist for 2 cycles then restart
>   group opsworks
>
> # check run of statistic daemon
> check file opsworks-agent-statistic-daemons-log with path
> "/var/log/aws/opsworks/opsworks-agent.statistics.log"
>   if timestamp > 2 minutes for 3 cycles then restart
>   if does not exist for 3 cycles then restart
>   group opsworks
>
> # check run of process command daemon
> check file opsworks-agent-process-command-daemons-log with path
> "/var/log/aws/opsworks/opsworks-agent.process_command.log"
>   if timestamp > 2 minutes for 3 cycles then restart
>   if does not exist for 3 cycles then restart
>   group opsworks
>
> # check run of keep alive deamon
> check file opsworks-agent-keep-alive-daemons-log with path
> "/var/log/aws/opsworks/opsworks-agent.keep_alive.log"
>   if timestamp > 2 minutes for 3 cycles then restart
>   if does not exist for 3 cycles then restart
>   group opsworks
>
> - end of file
>
> Monit logs say restart done, but opsworks doesn't run. If I downgrade to
> 5.3.2, it does magically run!
>
--
To unsubscribe:
https://lists.nongnu.org/mailman/listinfo/monit-general

Reply via email to