[ 
https://issues.apache.org/jira/browse/HDFS-611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Zheng Shao updated HDFS-611:
----------------------------

    Attachment: HDFS-611.trunk.v5.patch


bq. It would look really good if you could move the deleteAsync() method 
together with the static class ReplicaFileDeleteClass inside AsyncDiskService. 
I understand your motivation that you want to keep AsyncDiskService independent 
of data-node connotations if the goal is to reuse it in MR, but this could be 
done by simple factoring our the common base class if such usage in MR will 
ever materialize.

For the MR usage, I am think about moving the AsyncDiskService class directly 
to common. Aggregation seems better than inheritance here, just like ThreadPool 
(I guess we don't extend ThreadPoolExecutor for different types of Tasks).

It seems to me that AsyncDiskService should not need to have knowledge of 
whatever Task is requested by the caller. In particular, the decDfsUsage call 
in the ReplicaFileDeletionTask is closely related to FSDataSet, and should be 
maintain inside FSDataSet. What do you think?

bq. A nit: could you convert the comment for AsyncDiskService to JavaDoc 
comment by just adding one more star.

Added the missing "*" for javadoc.


> Heartbeats times from Datanodes increase when there are plenty of blocks to 
> delete
> ----------------------------------------------------------------------------------
>
>                 Key: HDFS-611
>                 URL: https://issues.apache.org/jira/browse/HDFS-611
>             Project: Hadoop HDFS
>          Issue Type: Bug
>          Components: data-node
>    Affects Versions: 0.20.1, 0.21.0, 0.22.0
>            Reporter: dhruba borthakur
>            Assignee: Zheng Shao
>             Fix For: 0.20.2, 0.21.0, 0.22.0
>
>         Attachments: HDFS-611.branch-19.patch, HDFS-611.branch-19.v2.patch, 
> HDFS-611.branch-20.patch, HDFS-611.branch-20.v2.patch, HDFS-611.trunk.patch, 
> HDFS-611.trunk.v2.patch, HDFS-611.trunk.v3.patch, HDFS-611.trunk.v4.patch, 
> HDFS-611.trunk.v5.patch
>
>
> I am seeing that when we delete a large directory that has plenty of blocks, 
> the heartbeat times from datanodes increase significantly from the normal 
> value of 3 seconds to as large as 50 seconds or so. The heartbeat thread in 
> the Datanode deletes a bunch of blocks sequentially, this causes the 
> heartbeat times to increase.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to