Utility to enforce current replication policy on already exitsting blocks
-------------------------------------------------------------------------

                 Key: HADOOP-3211
                 URL: https://issues.apache.org/jira/browse/HADOOP-3211
             Project: Hadoop Core
          Issue Type: New Feature
            Reporter: Robert Chansler


If the replica placement policy changes, or if the cluster topology changes, 
existing block replicas will not be deliberately moved to implement the new 
policy or to reflect the new topology. The re-replicator and the re-balancer 
always implement the current policy, but a healthy block may never attract the 
attention of either of those facilities. 

In practice, all replicas of a block have been found within a single rack (as 
allowed by the previous _random rack_ placement policy. The {{fsck}} facility 
reports such examples.

The re-placer utility could follow the model of the re-balancer. Or even be 
part of the re-balancer.

An alternative implementation would scan the {{fsck}} report for curious 
blocks, and rename a replica block file on the data node, so as to hide it. The 
re-replicator would be expected to create a new replica properly placed. (This 
is a fix up that any administrator could implement.)
(1761414)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to