[ 
https://issues.apache.org/jira/browse/HADOOP-5286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12677150#action_12677150
 ] 

Owen O'Malley commented on HADOOP-5286:
---------------------------------------

I would propose that we make a pool of JobInit threads, so that a single slow 
data node can't block all of the progress. If the pool had 5 threads, that 
would be more than enough to prevent slow blocks from completely blocking the 
JobTracker.

> DFS client blocked for a long time reading blocks of a file on the JobTracker
> -----------------------------------------------------------------------------
>
>                 Key: HADOOP-5286
>                 URL: https://issues.apache.org/jira/browse/HADOOP-5286
>             Project: Hadoop Core
>          Issue Type: Bug
>          Components: dfs
>    Affects Versions: 0.20.0
>            Reporter: Hemanth Yamijala
>            Assignee: Raghu Angadi
>            Priority: Blocker
>             Fix For: 0.20.0
>
>         Attachments: jt-log-for-blocked-reads.txt
>
>
> On a large cluster, we've observed that DFS client was blocked on reading a 
> block of a file for almost 1 and half hours. The file was being read by the 
> JobTracker of the cluster, and was a split file of a job. On the NameNode 
> logs, we observed that the block had a message as follows:
> Inconsistent size for block blk_2044238107768440002_840946 reported from 
> <ip>:<port> current size is 195072 reported size is 1318567
> Details follow.
>  

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to