GH-118: JT/NN backoff if response time over threshold
-----------------------------------------------------

                 Key: OOZIE-144
                 URL: https://issues.apache.org/jira/browse/OOZIE-144
             Project: Oozie
          Issue Type: Bug
            Reporter: Hadoop QA


If the JT/NN and overloaded Oozie should back-off temporary.

This can be done in the HadoopAccessorService.

Because JT/NN does not provide and API to find out the current health this has 
to be determined using API calls that do a known/fixed amount of work. For 
example for JT asking for the queue names, for NN asking for the contents of 
the root directory.

A tool that queries this values should be run against the cluster to find the 
normal values an values under stress. This would help to determine the 
threshold value for Oozie.

Oozie, before using a  JT/NN handle (JobClient/FileSystem) will test the  
response time, if the response time is above the threshold Oozie will backoff 
for # seconds and will not attempt any call to the cluster.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to