TheBuilderJR opened a new issue, #13785:
URL: https://github.com/apache/datafusion/issues/13785

   ### Describe the bug
   
   I expected based on the published benchmarks to have improvements, but I 
haven't seen any. I do see statistics are turned on and in theory optimizations 
in the last few releases should be kicking in, but they don't seem to be? Is 
there any guide on how to debug this? Are the main optimizations used in the 
benchmarks still hidden behind feature flags? If so is there a guide on how to 
turn on these flags to optimize for performance?
   
   ### To Reproduce
   
   Create a 8gb file of random data, run SELECT * FROM table ORDER by timestamp 
two times, see both times take over 100s
   
   ### Expected behavior
   
   Maybe first time is slow, but I expected second time to at least be faster. 
Ideally first time also utilizes the file statistics to run faster
   
   ### Additional context
   
   _No response_


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to