Re: [PR] [HUDI-7164] Add start time query API in CompletionTimeQueryView [hudi]

via GitHub Thu, 30 Nov 2023 02:16:25 -0800


danny0405 commented on code in PR #10218:
URL: https://github.com/apache/hudi/pull/10218#discussion_r1410453788



##########
hudi-spark-datasource/hudi-spark/src/test/scala/org/apache/spark/sql/execution/benchmark/LSMTimelineReadBenchmark.scala:
##########
@@ -42,8 +42,9 @@ object LSMTimelineReadBenchmark extends HoodieBenchmarkBase {
    * Apple M2
    * pref load archived instants:              Best Time(ms)   Avg Time(ms)   
Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
    * 
------------------------------------------------------------------------------------------------------------------------
-   * read shim instants                                   18             32    
      15          0.1       17914.8       1.0X
-   * read instants with commit metadata                   19             25    
       5          0.1       19403.1       0.9X
+   * read slim instants                                  494            521    
      27          0.5        1899.6       1.0X
+   * read instants with commit metadata                 2544           2625    
     116          0.1        9785.9       0.2X
+   * read start time                                     156            177    
      26          1.7         601.1       3.2X

Review Comment:
   The archived timeline instants loading firstly buffer all the instants in 
memory then sort all the instants by start time, that sort takes time.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Re: [PR] [HUDI-7164] Add start time query API in CompletionTimeQueryView [hudi]

Reply via email to