Marton Elek created HDDS-4459:
---------------------------------

             Summary: Improve closed container replication
                 Key: HDDS-4459
                 URL: https://issues.apache.org/jira/browse/HDDS-4459
             Project: Hadoop Distributed Data Store
          Issue Type: Improvement
            Reporter: Marton Elek
            Assignee: Marton Elek


Closed are immutable therefore the replication is different from Ratis.

Closed containers are downloaded from other datanodes over a specific GRPC 
channel.

While the first implementation works as intended it has many spaces for 
improvements.

 1. the container replica is compressed with gzip. A more efficient compression 
codec like zstd might be better (reported Wei-Chiu)

 2. Current downloader uses only one thread (reported Wei-Chiu)

 3. Temporary space is configurable, but by default uses /tmp. This means we 
are bottlenecked by the root disk, even though we have 48 disks on each DN. 
There should be an option to store the temp file under the destination disk 
instead. (reported Wei-Chiu)

 4. Buffer size is hard coded (1MB) and may not be the ideal

 5. There are no metrics which clearly shows the performance of the replication

 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to