ChenSammi commented on PR #6331:
URL: https://github.com/apache/ozone/pull/6331#issuecomment-1982866848

   > @duongkame @kerneltime thoughts? Making the smaller size the default 
across the board seems a little aggresive.
   
   Currently there is large gap between Ozone's sequential read performance and 
random read performance with 1MB byte.per.checksum value, which only favor 
applications who heavily rely on sequential read.  But for other applications, 
such as Hbase, impala, spark with Parquet files, they will suffer the bad 
random read performance. 
   
   I think we need a default value of this property to get a balanced 
sequential read and random read performance. With 16KB as default 
byte.per.checksum, the execution time for sequential read dropped from ~50s to 
~60s, while random read execution time improved from ~100s to ~60s(See the 
tables in MR description). If the applications has combined sequential read and 
random read, the overall performance will be get improved.  
   
   And it not only benefit HBASE, but will also benefit most of other 
applications too.  


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to