Ray Chiang created YARN-5078:
--------------------------------

             Summary: [Umbrella] NodeManager health checker improvements
                 Key: YARN-5078
                 URL: https://issues.apache.org/jira/browse/YARN-5078
             Project: Hadoop YARN
          Issue Type: Bug
          Components: nodemanager
            Reporter: Ray Chiang
            Assignee: Ray Chiang


There have been a bunch of NodeManager health checker improvement requests in 
the past.

Right now, I expect that initially there just need to be a bunch of base 
functionality added.  The most obvious parts are:

- Finding appropriate measurements of health
- Storing measurements as metrics.  This should allow easy comparison of good 
nodes and bad nodes.  This should eventually lead to threshold 
blacklisting/whitelisting.
- Adding metrics to the NodeManager UI

After this basic functionality is added, we can start consider some enhanced 
form of NodeManager health status conditions.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to