On 01.07.2013 10:19, Francis.Hu wrote:
Hi, All
I am installing a cluster with Hadoop 2.0.5-alpha. I have one namenode
and two datanodes. The dfs.replication is set as 2 in hdfs-site.xml.
After all configuration work is done, I started all nodes. Then I
saved a file into HDFS through java client. nOW I can access hdfs web
page: x.x.x.x:50070,and also see the file is already listed in the
hdfs list.
My question is: *The replication column in HDFS web page is showing as
3, not 2. Does anyone know What the problem is?*
**
---Actual setting of hdfs-site.xml
<property>
<name>dfs.replication</name>
<value>2</value>
</property>
After that, I typed dfsamdin command to check the file:
hdfs fsck /test3/
The result of above command:
/test3/hello005.txt: Under replicated
BP-609310498-192.168.219.129-1372323727200:blk_-1069303317294683372_1006.
Target Replicas is 3 but found 2 replica(s).
Status: HEALTHY
Total size: 35 B
Total dirs: 1
Total files: 1
Total blocks (validated): 1 (avg. block size 35 B)
Minimally replicated blocks: 1 (100.0 %)
Over-replicated blocks: 0 (0.0 %)
Under-replicated blocks: 1 (100.0 %)
Mis-replicated blocks: 0 (0.0 %)
Default replication factor: 2
Average block replication: 2.0
Corrupt blocks: 0
Missing replicas: 1 (33.333332 %)
Number of data-nodes: 3
Number of racks: 1
FSCK ended at Sat Jun 29 16:51:37 CST 2013 in 6 milliseconds
Thanks,
Francis Hu
If I'm not mistaking "dfs.replication" parameter in config sets only
default replication factor, which can be overrided when putting file to
hdfs.