[ 
https://issues.apache.org/jira/browse/HADOOP-4594?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12645971#action_12645971
 ] 

Chris Douglas commented on HADOOP-4594:
---------------------------------------

Sorry, my previous comments were cursory and this deserves more attention. I 
ran this past someone who had worked with nagios and Hadoop and it's my 
understanding that- while a good base- the patch as-is is not generally 
applicable. Specifically, the feedback was that the configuration contained 
several hard-coded values (hostnames, ports, environment variables) and that a 
combination of jps and grep was not a sufficient check of node health.

What I can speak to directly: configuration files- even templates for common 
tools- are not usually checked into contrib. On the other hand: you're right, 
there isn't a good place for this right now. As far as monitoring goes, there 
are several efforts currently in progress (Chukwa, metrics, HADOOP-3628 (the 
{{ping()}} functionality specifically)) that would be compelling points of 
integration.

> Monitoring Scripts for Nagios
> -----------------------------
>
>                 Key: HADOOP-4594
>                 URL: https://issues.apache.org/jira/browse/HADOOP-4594
>             Project: Hadoop Core
>          Issue Type: Wish
>            Reporter: Edward Capriolo
>            Priority: Minor
>         Attachments: HADOOP-4594.patch
>
>
> I would like to create a set of local via NRPE and remote check scripts that 
> can be shipped with the hadoop distribution and used to monitor Hadoop. I 
> already have completed the NRPE scripts. The second set of scripts would use 
> wget to read the output of the hadoop web interfaces. Do these already exist?
> I guess these would fall under a new contrib project.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to