Ray Chiang created YARN-5078:
--------------------------------
Summary: [Umbrella] NodeManager health checker improvements
Key: YARN-5078
URL: https://issues.apache.org/jira/browse/YARN-5078
Project: Hadoop YARN
Issue Type: Bug
Components: nodemanager
Reporter: Ray Chiang
Assignee: Ray Chiang
There have been a bunch of NodeManager health checker improvement requests in
the past.
Right now, I expect that initially there just need to be a bunch of base
functionality added. The most obvious parts are:
- Finding appropriate measurements of health
- Storing measurements as metrics. This should allow easy comparison of good
nodes and bad nodes. This should eventually lead to threshold
blacklisting/whitelisting.
- Adding metrics to the NodeManager UI
After this basic functionality is added, we can start consider some enhanced
form of NodeManager health status conditions.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]