TheBuilderJR opened a new issue, #13785: URL: https://github.com/apache/datafusion/issues/13785
### Describe the bug I expected based on the published benchmarks to have improvements, but I haven't seen any. I do see statistics are turned on and in theory optimizations in the last few releases should be kicking in, but they don't seem to be? Is there any guide on how to debug this? Are the main optimizations used in the benchmarks still hidden behind feature flags? If so is there a guide on how to turn on these flags to optimize for performance? ### To Reproduce Create a 8gb file of random data, run SELECT * FROM table ORDER by timestamp two times, see both times take over 100s ### Expected behavior Maybe first time is slow, but I expected second time to at least be faster. Ideally first time also utilizes the file statistics to run faster ### Additional context _No response_ -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
