pitrou opened a new pull request #11273:
URL: https://github.com/apache/arrow/pull/11273


   When sorting a table, rechunk it homogeneously as record batches, to pay the 
price of chunked indexing once for all columns.
   
   This helps performance when cardinality is low in the first sort column, 
yielding up to a 60% speedup on the set of sorting benchmarks.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Reply via email to