andygrove commented on issue #670: URL: https://github.com/apache/datafusion-comet/issues/670#issuecomment-2234100899
Benchmark runs @ sf=100 suggest that reading decimal from parquet could potentially be a performance issue. ``` TPCDS Micro Benchmarks: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative ------------------------------------------------------------------------------------------------------------------------ add_many_decimals 20502 20648 208 14.0 71.2 1.0X add_many_decimals 20498 20544 65 14.1 71.2 1.0X add_many_decimals: Comet (Scan) 28143 28161 26 10.2 97.7 0.7X add_many_decimals: Comet (Scan, Exec) 19323 19497 246 14.9 67.1 1.1X TPCDS Micro Benchmarks: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative -------------------------------------------------------------------------------------------------------------------------------- agg_sum_decimals_no_grouping 10552 10583 44 27.3 36.6 1.0X agg_sum_decimals_no_grouping 10406 10450 61 27.7 36.1 1.0X agg_sum_decimals_no_grouping: Comet (Scan) 46013 46278 375 6.3 159.8 0.2X agg_sum_decimals_no_grouping: Comet (Scan, Exec) 13840 13956 164 20.8 48.1 0.8X ``` -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org For additional commands, e-mail: github-h...@datafusion.apache.org