[GitHub] [arrow-datafusion] tustvold commented on issue #5230: Use Arrow Row Format in SortExec

via GitHub Fri, 03 Mar 2023 06:40:07 -0800


tustvold commented on issue #5230:
URL: 
https://github.com/apache/arrow-datafusion/issues/5230#issuecomment-1453629481


   >  we are seeing regression when the execute call is sorting a single batch 
of size 12500 (total benchmark input size is 100000, broken up into 8 
partitions)
   
   Do you see a similar regression in the single partition case, but if you 
instead reduce the size of the total benchmark down by a factor of 8? I could 
understand it if there were dictionaries involved, but the worst regression 
appears to be "sort mixed tuple preserve partitioning" which is just strings 
and primitives...


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

[GitHub] [arrow-datafusion] tustvold commented on issue #5230: Use Arrow Row Format in SortExec

Reply via email to