danny0405 commented on code in PR #10218:
URL: https://github.com/apache/hudi/pull/10218#discussion_r1410453788
##########
hudi-spark-datasource/hudi-spark/src/test/scala/org/apache/spark/sql/execution/benchmark/LSMTimelineReadBenchmark.scala:
##########
@@ -42,8 +42,9 @@ object LSMTimelineReadBenchmark extends HoodieBenchmarkBase {
* Apple M2
* pref load archived instants: Best Time(ms) Avg Time(ms)
Stdev(ms) Rate(M/s) Per Row(ns) Relative
*
------------------------------------------------------------------------------------------------------------------------
- * read shim instants 18 32
15 0.1 17914.8 1.0X
- * read instants with commit metadata 19 25
5 0.1 19403.1 0.9X
+ * read slim instants 494 521
27 0.5 1899.6 1.0X
+ * read instants with commit metadata 2544 2625
116 0.1 9785.9 0.2X
+ * read start time 156 177
26 1.7 601.1 3.2X
Review Comment:
The archived timeline instants loading firstly buffer all the instants in
memory then sort all the instants by start time, that sort takes time.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]