Xinyu Tan created RATIS-1787:
--------------------------------

             Summary: Don't  generate md5 file for each file when a file is 
received during InstallSnapshot to reduce the pressure on the file system
                 Key: RATIS-1787
                 URL: https://issues.apache.org/jira/browse/RATIS-1787
             Project: Ratis
          Issue Type: Improvement
            Reporter: Xinyu Tan


The IoTDB community has experienced a number of performance issues with MD5 
calculations when using the Snapshot feature.

Originally, MD5 calculation is to prevent errors in file transmission. However, 
in the previous implementation, when we generate a Snapshot, we need to load 
all files to calculate the snapshot, even though we do not need to send 
snapshot at this time, which causes a lot of resource preemption in the 
background. [~William Song] in RATIS-1597 has combined client-side MD5 
calculations with streaming, thus avoiding a lot of background IO and computing 
tasks. However, at the Snapshot receiver, one MD5 file is still currently 
stored for each file. At present, the snapshot file level of IoTDB may be tens 
of thousands. These tens of thousands of small md5 files cause a lot of 
pressure on the file system. Is it possible that we do not store an md5 file 
for each file when receiving?



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to