Alexey Goncharuk created IGNITE-6587:
----------------------------------------

             Summary: Ignite watchdog service
                 Key: IGNITE-6587
                 URL: https://issues.apache.org/jira/browse/IGNITE-6587
             Project: Ignite
          Issue Type: Improvement
          Components: general
    Affects Versions: 2.2
            Reporter: Alexey Goncharuk
             Fix For: 2.4


We need to come up with a 'watchdog service' to monitor for Ignite node local 
health and kill the process under some critical conditions.
For example, if one of the mission-critical Ignite threads die, the Ignite node 
must be stopped.
At the first glance, the list of critical threads is:
All TCP discovery threads
All communication NIO threads (acceptor and workers)
Exchange worker
Striped pool threads
Timeout Worker
Checkpointer 
WAL archiver

The mechanism should support pluggable components so that self-check can be 
extended via plugins.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to