[ 
https://issues.apache.org/jira/browse/HADOOP-5083?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12668746#action_12668746
 ] 

Amar Kamat commented on HADOOP-5083:
------------------------------------

bq. 1. If a job is completed and retired, and then the JT as well as the 
History Server restarts. Can a user get to the logs of a job that was completed 
earlier?
As of now the History server simply provides a web interface for the job 
history files on the history-fs. It simply reads the history file, parses it 
and allows users to analyze it. JobTracker restart will make sure that 
- the jobs that were marked completed will remain untouched
- the jobs that were running/pending will be completed. This also includes 
maintaining the history files and making sure that in the end there is only one 
history files for a completed job

bq.  Does the History Server keep some sort of an persistent index into the 
completed/failed jobs?
Nope. It doesnt require to keep any. All the files are maintained in a 
job-history folder. 



> Optionally a separate daemon should serve JobHistory
> ----------------------------------------------------
>
>                 Key: HADOOP-5083
>                 URL: https://issues.apache.org/jira/browse/HADOOP-5083
>             Project: Hadoop Core
>          Issue Type: Improvement
>          Components: mapred
>            Reporter: Arun C Murthy
>            Assignee: Amar Kamat
>         Attachments: HADOOP-5083-v1.2.patch, HADOOP-5083-v1.9.patch
>
>
> Currently the JobTracker serves the JobHistory to end-users off files 
> local-disk/hdfs. While running very large clusters with a large user-base 
> might result in lots of traffic for job-history which needlessly taxes the 
> JobTracker. The proposal is to have an optional daemon which handles serving 
> of job-history requests.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to