[ https://issues.apache.org/jira/browse/YARN-999?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16765630#comment-16765630 ]
Junping Du commented on YARN-999: --------------------------------- bq. I tried to go through the code to see where the overcommit timeout was used but I didn't get anywhere useful. Does anybody know if this is actually implemented? No. YARN-2489 is supposed to work on it but haven't done yet. bq. As this has been 6 years, I'd take over this if nobody is on it. My bad. My priority keep changing... Please feel free to take it. I will help on review. > In case of long running tasks, reduce node resource should balloon out > resource quickly by calling preemption API and suspending running task. > ----------------------------------------------------------------------------------------------------------------------------------------------- > > Key: YARN-999 > URL: https://issues.apache.org/jira/browse/YARN-999 > Project: Hadoop YARN > Issue Type: Sub-task > Components: graceful, nodemanager, scheduler > Reporter: Junping Du > Assignee: Junping Du > Priority: Major > > In current design and implementation, when we decrease resource on node to > less than resource consumption of current running tasks, tasks can still be > running until the end. But just no new task get assigned on this node > (because AvailableResource < 0) until some tasks are finished and > AvailableResource > 0 again. This is good for most cases but in case of long > running task, it could be too slow for resource setting to actually work so > preemption could be hired here. -- This message was sent by Atlassian JIRA (v7.6.3#76005) --------------------------------------------------------------------- To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org