jychen7 commented on issue #5276: URL: https://github.com/apache/arrow-datafusion/issues/5276#issuecomment-1504223131
btw, glad that https://github.com/apache/arrow-datafusion/issues/5325 fix the regression on `q9` ``` SELECT "RegionID", SUM("AdvEngineID"), COUNT(*) AS c, AVG("ResolutionWidth"), COUNT(DISTINCT "UserID") FROM hits GROUP BY "RegionID" ORDER BY c DESC LIMIT 10; ``` I also create an issue for q32: #5969 ``` SELECT "WatchID", "ClientIP", COUNT(*) AS c, SUM("IsRefresh"), AVG("ResolutionWidth") FROM hits GROUP BY "WatchID", "ClientIP" ORDER BY c DESC LIMIT 10 ``` Summary of queries that is 2x slower than DuckDB |ClickBench| Datafusion v18 (parquet) | Datafusion v22 (parquet) | DuckDB 2022-11 (parquet)| Issue Tracker | |-|-|-|-|-| |q0|0.22s|0.22s|0.03s| |q4|2.66s|2.7s|0.79s| |q8|2.87s|3s|0.9s| |q9|52s|3.5s| 1.29s | #5325 | |q12|2s|2s|0.76s| |q13|3.62s|3.7s|1.21s| |q14|2.08s|2.08s|0.84s| |q15|3.42s|0.84s| |q16|4.99s|5.0s|1.86s| |q17|4.32s|4.34|1.81s| |q18|10.74s|10.9s|3.49s| |q30|2.88s|2.94s|0.92s| |q31|5.22s|4.4s|1.14s| |q32|-|-|5.28s| #5969 | |q33|8.54s|8.7s|3.15s| |q34|9.01s|9.2s|3.19s| |q35|3.66s|3.5s|0.88s| |q36|0.4s|0.4s|0.18s| |q39|0.77s|0.77s|0.32s| -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
