[ 
https://issues.apache.org/jira/browse/HADOOP-2847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12569751#action_12569751
 ] 

Runping Qi commented on HADOOP-2847:
------------------------------------


a flip side of the problem is that the hod may miss small jobs running on the 
cluster if the jobs complete in less than 2 minutes.
HOD should use Hadoop API to get the complete time of the last job if no job is 
currently running.
And use that time to determine the cluster idle time.


> [HOD] Idle cluster cleanup does not work if the JobTracker becomes 
> unresponsive to RPC calls
> --------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-2847
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2847
>             Project: Hadoop Core
>          Issue Type: Bug
>          Components: contrib/hod
>    Affects Versions: 0.16.0
>            Reporter: Hemanth Yamijala
>            Assignee: Hemanth Yamijala
>            Priority: Blocker
>             Fix For: 0.16.1
>
>
> In some erroneous conditions, the Hadoop JobTracker becomes unresponsive to 
> RPC calls (for e.g. if a misconfiguration causes the JobTracker to run out of 
> memory). In such cases, a cluster allocated by HOD no longer runs any jobs 
> and is wastefully holding up nodes. The usual idle cluster cleaner should 
> deallocate the cluster ideally, but it does not.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to