[
https://issues.apache.org/jira/browse/HDFS-555?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Hairong Kuang reassigned HDFS-555:
----------------------------------
Assignee: Ravi Phulari (was: Hairong Kuang)
> Also, the patches are submitted by Ravi, but the JIRA is assigned to Hairong.
> I'm not sure if this is incorrect or not - just pointing out.
I submitted an initial patch to the original jira. That's why my name appeared
as the assignee. Thanks Ravi for working on this. I am assigning this jira to
you.
> A few improvements to DataNodeCluster - HADOOP-5556
> ----------------------------------------------------
>
> Key: HDFS-555
> URL: https://issues.apache.org/jira/browse/HDFS-555
> Project: Hadoop HDFS
> Issue Type: Improvement
> Components: test
> Affects Versions: 0.21.0
> Reporter: Ravi Phulari
> Assignee: Ravi Phulari
> Priority: Blocker
> Fix For: 0.21.0
>
> Attachments: HDFS-555-0.20.patch, HDFS-555-v1.patch,
> HDFS-555.0.20-test-patch.log, HDFS-555.patch
>
>
> Opening jira to address HDFS code changes made in HADOOP-5556.
> DataNodeCluster is a great tool to simulate a large scale DFS cluster using a
> small set of machines. A few suggestions to improve this tool:
> 1. DataNodeCluster uses MiniDFSCluster#startDataNode to start multiple
> instances of DataNode on one machine. MiniDFSCluster sets DataNode's address
> to be 127.0.0.1. We should allow to set its address to 0.0.0.0 so DataNodes
> in different machines could communicate.
> 2. Currently the size of the blocks injected to DataNode and created in
> CreatedEditsLog is hardcoded as 10. It would be more convenient if this could
> be configurable. Also we need to make sure that both use the same block size.
> 3. If the replication factor of blocks is larger than 1, currently a
> DataNode in DataNodeCluster will be injected blocks multiple times and
> therefore it sends block reports to NameNode multiple times. Initial block
> reports contain only a portion of its blocks and therefore may cause
> unnecessary block replications. It would be cleaner if only one block report
> with all its blocks is sent.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.