[jira] Commented: (MAPREDUCE-1213) TaskTrackers restart is very slow because it deletes distributed cache directory synchronously

Todd Lipcon (JIRA) Thu, 10 Dec 2009 17:52:42 -0800

    [ 
https://issues.apache.org/jira/browse/MAPREDUCE-1213?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12789073#action_12789073
 ]


Todd Lipcon commented on MAPREDUCE-1213:
----------------------------------------

bq. That's the plan but I suppose I cannot add that to common and at the same 
time use it in mapreduce?

Right. I think you should open a new JIRA in common, and link this jira to be 
Blocked by it. Once that jira is committed, you can use this jira to use it in 
mapreduce, and file another JIRA to switch over HDFS to use it. Sound good?

> TaskTrackers restart is very slow because it deletes distributed cache 
> directory synchronously
> ----------------------------------------------------------------------------------------------
>
>                 Key: MAPREDUCE-1213
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1213
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>    Affects Versions: 0.20.1
>            Reporter: dhruba borthakur
>            Assignee: Zheng Shao
>         Attachments: MAPREDUCE-1213.1.patch, MAPREDUCE-1213.2.patch
>
>
> We are seeing that when we restart a tasktracker, it tries to recursively 
> delete all the file in the distributed cache. It invoked 
> FileUtil.fullyDelete() which is very very slow. This means that the 
> TaskTracker cannot join the cluster for an extended period of time (upto 2 
> hours for us). The problem is acute if the number of files in a distributed 
> cache is a few-thousands.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-1213) TaskTrackers restart is very slow because it deletes distributed cache directory synchronously

Reply via email to