[ 
https://issues.apache.org/jira/browse/HDFS-15?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12747115#action_12747115
 ] 

Jitendra Nath Pandey commented on HDFS-15:
------------------------------------------

In this patch, the blocks will stay in the list, but will not be scheduled for 
replication because no new rack would be found to allocate sufficient racks.
  Suggestion from Hairong: 
               If user doesn't specify a topology script for rack 
determination, we can ignore the check for enough racks.
  We can implement it by checking for config variable SCRIPT_FILENAME_KEY in 
blockHasEnoughRacks function. If this config key returns null 
blockHasEnoughRacks will return true, which will effectively eliminate the 
check for enough racks.
 



> All replicas of a block end up on only 1 rack
> ---------------------------------------------
>
>                 Key: HDFS-15
>                 URL: https://issues.apache.org/jira/browse/HDFS-15
>             Project: Hadoop HDFS
>          Issue Type: Bug
>            Reporter: Hairong Kuang
>            Assignee: Jitendra Nath Pandey
>            Priority: Critical
>         Attachments: HDFS-15.patch, HDFS-15.patch.2
>
>
> HDFS replicas placement strategy guarantees that the replicas of a block 
> exist on at least two racks when its replication factor is greater than one. 
> But fsck still reports that the replicas of some blocks  end up on one rack.
> The cause of the problem is that decommission and corruption handling only 
> check the block's replication factor but not the rack requirement. When an 
> over-replicated block loses a replica due to decomission, corruption, or 
> heartbeat lost, namenode does not take any action to guarantee that remaining 
> replicas are on different racks.
>  

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to