GH-118: JT/NN backoff if response time over threshold
-----------------------------------------------------
Key: OOZIE-144
URL: https://issues.apache.org/jira/browse/OOZIE-144
Project: Oozie
Issue Type: Bug
Reporter: Hadoop QA
If the JT/NN and overloaded Oozie should back-off temporary.
This can be done in the HadoopAccessorService.
Because JT/NN does not provide and API to find out the current health this has
to be determined using API calls that do a known/fixed amount of work. For
example for JT asking for the queue names, for NN asking for the contents of
the root directory.
A tool that queries this values should be run against the cluster to find the
normal values an values under stress. This would help to determine the
threshold value for Oozie.
Oozie, before using a JT/NN handle (JobClient/FileSystem) will test the
response time, if the response time is above the threshold Oozie will backoff
for # seconds and will not attempt any call to the cluster.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira