EmilyMatt opened a new issue, #17146:
URL: https://github.com/apache/datafusion/issues/17146

   ### Is your feature request related to a problem or challenge?
   
   During benchmarking I've realized that the sort operations(especially the 
complex ones like multi-level sorts) don't really give me a lot of information 
to go on(generally spill info and the baseline metrics)
   I wanted to look into what happens in the actual sort and what takes time 
there, so I created additional metrics which I found helpful and I think can be 
useful for anyone else (I'll submit a PR shortly).
   
   ### Describe the solution you'd like
   
   What I found useful were timers regarding the evaluation of the actual sort 
columns, the actual sort on the indices, and the following take algorithm.
   but this can be extended further, into "how many levels were in this 
multi-level merge", "how much time were we awaiting spill tasks", or any other 
metrics that comes to mind.
   
   ### Describe alternatives you've considered
   
   _No response_
   
   ### Additional context
   
   _No response_


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org
For additional commands, e-mail: github-h...@datafusion.apache.org

Reply via email to