[ 
https://issues.apache.org/jira/browse/RATIS-1787?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17691685#comment-17691685
 ] 

Xinyu Tan commented on RATIS-1787:
----------------------------------

[~szetszwo] Thank you for your continued support of the IoTDB community. What 
are your thoughts on this issue? If our requirements are not common, can we 
work out a configuration so that more performance-oriented projects don't have 
to store these md5 files.

> Don't  generate md5 file for each file when a file is received during 
> InstallSnapshot to reduce the pressure on the file system
> -------------------------------------------------------------------------------------------------------------------------------
>
>                 Key: RATIS-1787
>                 URL: https://issues.apache.org/jira/browse/RATIS-1787
>             Project: Ratis
>          Issue Type: Improvement
>            Reporter: Xinyu Tan
>            Priority: Major
>
> The IoTDB community has experienced a number of performance issues with MD5 
> calculations when using the Snapshot feature.
> Originally, MD5 calculation is to prevent errors in file transmission. 
> However, in the previous implementation, when we generate a Snapshot, we need 
> to load all files to calculate the snapshot, even though we do not need to 
> send snapshot at this time, which causes a lot of resource preemption in the 
> background. [~William Song] in RATIS-1597 has combined client-side MD5 
> calculations with streaming, thus avoiding a lot of background IO and 
> computing tasks. However, at the Snapshot receiver, one MD5 file is still 
> currently stored for each file. At present, the snapshot file level of IoTDB 
> may be tens of thousands. These tens of thousands of small md5 files cause a 
> lot of pressure on the file system. Is it possible that we do not store an 
> md5 file for each file when receiving?



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to