Elek, Marton created HDDS-199:
---------------------------------

             Summary: Implement ReplicationManager to replicate ClosedContainer
                 Key: HDDS-199
                 URL: https://issues.apache.org/jira/browse/HDDS-199
             Project: Hadoop Distributed Data Store
          Issue Type: Improvement
          Components: SCM
            Reporter: Elek, Marton
             Fix For: 0.2.1


HDDS/Ozone supports Open and Closed containers. In case of specific conditions 
(container is full, node is failed) the container will be closed and will be 
replicated in a different way. The replication of Open containers are handled 
with Ratis and PipelineManger.

The ReplicationManager should handle the replication of the ClosedContainers. 
The replication information will be sent as an event 
(UnderReplicated/OverReplicated). 

The Replication manager will collect all of the events in a priority queue (to 
replicate first the containers where more replica is missing) calculate the 
destination datanode (first with a very simple algorithm, later with 
calculating scatter-width) and send the Copy/Delete container to the datanode 
(CommandQueue).

A CopyCommandWatcher/DeleteCommandWatcher are also included to retry the 
copy/delete in case of failure. This is an in-memory structure (based on 
HDDS-195) which can requeue the underreplicated/overreplicated events to the 
prioirity queue unless the confirmation of the copy/delete command is arrived.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: hdfs-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: hdfs-dev-h...@hadoop.apache.org

Reply via email to