[GitHub] [ozone] sodonnel commented on a diff in pull request #4014: HDDS-7560. Placement Policy Interface changes to handle Overreplication

GitBox Fri, 06 Jan 2023 14:17:16 -0800


sodonnel commented on code in PR #4014:
URL: https://github.com/apache/ozone/pull/4014#discussion_r1063846569



##########
hadoop-hdds/server-scm/src/main/java/org/apache/hadoop/hdds/scm/SCMCommonPlacementPolicy.java:
##########
@@ -503,4 +508,100 @@ public Set<ContainerReplica> 
replicasToCopyToFixMisreplication(
   protected Node getPlacementGroup(DatanodeDetails dn) {
     return nodeManager.getClusterNetworkTopologyMap().getAncestor(dn, 1);
   }
+
+  /**
+   * Given a set of replicas, expectedCount for Each replica,
+   * number of unique replica indexes. Replicas to be deleted for fixing over
+   * replication is computed.
+   * The algorithm starts with creating a replicaIdMap which contains the
+   * replicas grouped by replica Index. A placementGroup Map is created which
+   * groups replicas based on their rack & the replicas within the rack
+   * are further grouped based on the replica Index.
+   * A placement Group Count Map is created which keeps
+   * track of the count of replicas in each rack.
+   * We iterate through overreplicated replica indexes sorted in descending
+   * order based on their current replication factor in a descending factor.
+   * For each replica Index the replica is removed from the rack which contains
+   * the most replicas, in order to achieve this the racks are put
+   * into priority queue & are based on the number of replicas they have.
+   * The replica is removed from the rack with maximum replicas & the replica
+   * to be removed is also removed from the maps created above &
+   * the count for rack is reduced.
+   * The set of replicas computed are then returned by the function.
+   * @param replicas: Map of replicas with value signifying if
+   *                  replica can be copied
+   * @param expectedCountPerUniqueReplica
+   * @return Set of replicas to be removed are computed.
+   */
+  @Override
+  public Set<ContainerReplica> replicasToRemoveToFixOverreplication(
+          Set<ContainerReplica> replicas, int expectedCountPerUniqueReplica) {
+    Map<Integer, Set<ContainerReplica>> replicaIdMap = new HashMap<>();
+    Map<Node, Map<Integer, Set<ContainerReplica>>> placementGroupReplicaIdMap
+            = new HashMap<>();
+    Map<Node, Integer> placementGroupCntMap = new HashMap<>();
+    for (ContainerReplica replica:replicas) {
+      Integer replicaId = replica.getReplicaIndex();
+      Node placementGroup = getPlacementGroup(replica.getDatanodeDetails());
+      if (!replicaIdMap.containsKey(replicaId)) {
+        replicaIdMap.put(replicaId, Sets.newHashSet());
+      }
+      if (!placementGroupReplicaIdMap.containsKey(placementGroup)) {
+        placementGroupReplicaIdMap.put(placementGroup, Maps.newHashMap());
+      }
+      placementGroupCntMap.compute(placementGroup,
+              (group, cnt) -> (cnt == null ? 0 : cnt) + 1);
+      replicaIdMap.get(replicaId).add(replica);
+      Map<Integer, Set<ContainerReplica>> placementGroupReplicaIDMap =
+              placementGroupReplicaIdMap.get(placementGroup);
+      placementGroupReplicaIDMap.compute(replicaId,
+              (rid, placementGroupReplicas) -> {
+                if (placementGroupReplicas == null) {
+                  placementGroupReplicas = Sets.newHashSet();
+                }
+                placementGroupReplicas.add(replica);
+                return placementGroupReplicas;
+              });
+    }
+
+    Set<ContainerReplica> replicasToRemove = new HashSet<>();
+    List<Integer> sortedRIDList = replicaIdMap.keySet().stream()
+            .sorted((o1, o2) -> Integer.compare(replicaIdMap.get(o2).size(),

Review Comment:
   I guess these sets are small, so its doesn't matter much, but should we not 
filter before sorting, so we have less to sort?
   
   Also, why does this list need to be sorted? Any index which has more than 
the expected number of replicas needs to have the excess removed, so does it 
matter if we process the most over replicated index or in a random order?



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] [ozone] sodonnel commented on a diff in pull request #4014: HDDS-7560. Placement Policy Interface changes to handle Overreplication

Reply via email to