[ 
https://issues.apache.org/jira/browse/HADOOP-2755?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12564511#action_12564511
 ] 

Christian Kunz commented on HADOOP-2755:
----------------------------------------

Are you sure about this?

Assuming that all replication requests are logged as:

org.apache.hadoop.dfs.StateChange: BLOCK* NameSystem.pendingTransfer: ask 
<node> to replicate blk_id ...

then there were only about 1000 such requests in the last 10 hours, but most of 
them timed out with a log message:

org.apache.hadoop.fs.FSNamesystem: PendingReplicationMonitor timed out block 
blk_id

And the namenode is not really busy, as you would expect during high 
replication periods.

> dfs fsck extremely slow, dfs ls times out
> -----------------------------------------
>
>                 Key: HADOOP-2755
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2755
>             Project: Hadoop Core
>          Issue Type: Bug
>    Affects Versions: 0.16.0
>         Environment: Jan 28 nightly build
> With patches 2095, 2119, and 2723
>            Reporter: Christian Kunz
>            Assignee: dhruba borthakur
>            Priority: Blocker
>             Fix For: 0.16.0
>
>
> I upgraded a Hadoop installation to the Jan 28 nightly build.
> DFS contains 2.4+ M files.
> Upgrade finished but not finalized.
> Before finalizing I wanted to run fsck on the DFS. It hardly progressed after 
> 6 hours (not finished yet). With the '-files' option turned on, it lists 
> about 300 entries in 10 minutes.
> And when I tried to list a subdirectory with 100,000 files, it repeatedly 
> (about 20 attempts) timed out.
> Changing timeout from 1 to 10 minutes did not help.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to