[ 
https://issues.apache.org/jira/browse/HADOOP-3245?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12615158#action_12615158
 ] 

dhruba borthakur commented on HADOOP-3245:
------------------------------------------

+1 to Owen's comment.

A pressing need of our cluster is to not interrupt running jobs if the 
jobtracker has to be restarted. This means  that job states have to be 
persisted in the form of a transaction log. This requirement is all the more 
beneficial to sites that have long-running job trackers (instead of HOD).

However, isn't it better to be able to store state in HDFS? It is true that 
HDFS stores its transaction log in local files, but with the current focus on 
improving HDFS read/write latencies, HDFS itself is considering whether to 
store one copy of the transaction log in HDFS blocks (instead of NFS). In fact, 
if the JobTracker stores information in a org.apache.hadoop.fs.FileSystem, then 
a typical customer install could plug in various forms of storage to support 
the JobTracker transaction log.



> Provide ability to persist running jobs (extend HADOOP-1876)
> ------------------------------------------------------------
>
>                 Key: HADOOP-3245
>                 URL: https://issues.apache.org/jira/browse/HADOOP-3245
>             Project: Hadoop Core
>          Issue Type: New Feature
>          Components: mapred
>            Reporter: Devaraj Das
>            Assignee: Amar Kamat
>         Attachments: HADOOP-3245-v2.5.patch, HADOOP-3245-v2.6.5.patch, 
> HADOOP-3245-v2.6.9.patch, HADOOP-3245-v4.1.patch
>
>
> This could probably extend the work done in HADOOP-1876. This feature can be 
> applied for things like jobs being able to survive jobtracker restarts.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to