[jira] [Commented] (AMBARI-9458) HDFS, YARN, and HBase Slave Health Alert Definitions

Hadoop QA (JIRA) Tue, 03 Feb 2015 10:05:03 -0800

    [ 
https://issues.apache.org/jira/browse/AMBARI-9458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14303684#comment-14303684
 ]


Hadoop QA commented on AMBARI-9458:
-----------------------------------

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  http://issues.apache.org/jira/secure/attachment/12696228/AMBARI-9458.patch
  against trunk revision .

    {color:red}-1 patch{color}.  Top-level trunk compilation may be broken.

Console output: 
https://builds.apache.org/job/Ambari-trunk-test-patch/1565//console

This message is automatically generated.

> HDFS, YARN, and HBase Slave Health Alert Definitions
> ----------------------------------------------------
>
>                 Key: AMBARI-9458
>                 URL: https://issues.apache.org/jira/browse/AMBARI-9458
>             Project: Ambari
>          Issue Type: Task
>          Components: ambari-server
>    Affects Versions: 2.0.0
>            Reporter: Yurii Shylov
>            Assignee: Yurii Shylov
>             Fix For: 2.0.0
>
>         Attachments: AMBARI-9458.patch
>
>
> When a slave component, such as a DataNode, encounters some catastrophic 
> problem like a heap allocation error, and no longer can perform its work, the 
> NameNode marks this DataNode as being unhealthy.
> The current alert definitions only check for the DataNode process being 
> alive, which is still technically is. We need to add new alert definitions 
> for:
> - HDFS/DataNode (runs on NameNode, query is to NameNode JMX)
> - YARN/NodeManager (runs on ResourceManager, query is to ResourceManager JMX)
> - HBase/RegionServer (runs on HBase Master, queries HBase Master JMX)
> Which will check for slaves that are in some sort of bad state. Depending on 
> the JMX structures that need to be queried, these can either be METRIC or 
> SCRIPT style alert definitions.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (AMBARI-9458) HDFS, YARN, and HBase Slave Health Alert Definitions

Reply via email to