[
https://issues.apache.org/jira/browse/HDFS-4562?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13595625#comment-13595625
]
Uma Maheswara Rao G commented on HDFS-4562:
-------------------------------------------
Can you also check, whether the DNs registration are getting expired due to
some reason and they are rejoing again in the cluster? [should see the logs
like 'Adding a new node'] By the time of rejoining the DNs, NN initiating the
replication for that blocks?
As Suresh said, we can check any node is very low responsive and NN is
replicating that blocks before he reports.
{quote}
There should not be an excess replica getting created. From the above log, if
the replica corresponding to x.y.z.67 has been in pending replicas.
I see that pendingReplications does not get updated
{quote}
That check is considering perdingReolications only (i.e numEffectiveReplicas).
SO, we should not remove if it is in pending replications. If more than that
any thing present in neededReplication again, we are removing I think.
{quote}
I see that pendingReplications does not get updated during the creation of the
file. Correct me if I'm wrong.
{quote}
I did not see 1.1.1 code now, but we should be updating neededReplications
while finalizing block.no? we will not update it while creating.
> Many excess replicas getting created
> ------------------------------------
>
> Key: HDFS-4562
> URL: https://issues.apache.org/jira/browse/HDFS-4562
> Project: Hadoop HDFS
> Issue Type: Bug
> Affects Versions: 1.1.1
> Reporter: Amareshwari Sriramadasu
> Fix For: 1.2.0
>
>
> We are seeing too many excess replicas getting created in our cluster. The
> number excess replicas in day coming out to be more than 1 lakh.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira