[ 
https://issues.apache.org/jira/browse/MAPREDUCE-4328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13439804#comment-13439804
 ] 

Arun C Murthy commented on MAPREDUCE-4328:
------------------------------------------

Ah, I thought I responded to ATM, my bad.

As I've described in the description of the jira the primary use-case is to 
allow JobTracker to be resilient to NN failures (hardware or software).

I did think long and hard about doing this in YARN, but with HDFS-HA this 
use-case is pretty much non-existent. Furthermore, since YARN isn't tied to 
HDFS as MR1 is; and since it's distributed across several AMs there is no 
single point of control like the JT in MR1. Thus, I think there isn't enough 
value in porting it as-is, conceptually (not code-wise). 

In many ways this is similar to MAPREDUCE-3837, i.e. no straight-backport.

Having said that, I plan to make sure we pay attention to this when we get 
around to fixing RM Restart. This is something I definitely plan to do later 
this year, at which point we'll ensure there is no 'feature regression'.

Makes sense?

----

Eli's point about draining queues is a good one, I've opened MAPREDUCE-4575 and 
YARN-38 to track that. That feature is something we can do a straight-mapping 
conceptually across MR1 and YARN.
                
> Add the option to quiesce the JobTracker
> ----------------------------------------
>
>                 Key: MAPREDUCE-4328
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4328
>             Project: Hadoop Map/Reduce
>          Issue Type: Improvement
>          Components: mrv1
>    Affects Versions: 1.0.3
>            Reporter: Arun C Murthy
>            Assignee: Arun C Murthy
>             Fix For: 1.2.0
>
>         Attachments: MAPREDUCE-4328.patch, MAPREDUCE-4328.patch, 
> TestJobTrackerQuiescence.java
>
>
> In several failure scenarios it would be very handy to have an option to 
> quiesce the JobTracker.
> Recently, we saw a case where the NameNode had to be rebooted at a customer 
> due to a random hardware failure - in such a case it would have been nice to 
> not lose jobs by quiescing the JobTracker.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to