[
https://issues.apache.org/jira/browse/HADOOP-3245?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12620175#action_12620175
]
Amar Kamat commented on HADOOP-3245:
------------------------------------
bq. 1) This approach/patch works fine with history on local fs. With history on
HDFS, the history file becomes visible but not available (i.e file-size = 0).
The file becomes available only on close(). Sync() documentation indicates that
the file-data availability is not guaranteed.
The recently uploaded patch makes the block size of the history-file
configurable. The parameter to change it is
{{mapred.jobtracker.job.history.block.size}} (in bytes). For testing it, I
started a small job on a single node cluster with history-block-size as {{1k}}.
It seems to work fine and the history file is now visible and available. With
respect to the {{zero-size-file}}, I think we are ok to lose some small amount
of information.
> Provide ability to persist running jobs (extend HADOOP-1876)
> ------------------------------------------------------------
>
> Key: HADOOP-3245
> URL: https://issues.apache.org/jira/browse/HADOOP-3245
> Project: Hadoop Core
> Issue Type: New Feature
> Components: mapred
> Reporter: Devaraj Das
> Assignee: Amar Kamat
> Attachments: HADOOP-3245-v2.5.patch, HADOOP-3245-v2.6.5.patch,
> HADOOP-3245-v2.6.9.patch, HADOOP-3245-v4.1.patch, HADOOP-3245-v5.13.patch
>
>
> This could probably extend the work done in HADOOP-1876. This feature can be
> applied for things like jobs being able to survive jobtracker restarts.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.