mbutrovich commented on issue #21543: URL: https://github.com/apache/datafusion/issues/21543#issuecomment-4237566750
> I think this is a good idea to pursue as well -- I also wonder if we have already created data in the row format, we could avoid the second copy entirely perhaps by keeping a list of sorted indices and then merging using those rather than copying the data again 🤔 Yeah I was looking at this right now, in fact: could a radix sort in row format benefit the merge phase? It's more plumbing and might be one of the later PRs in a sequence of changes, but on paper it seems like a good idea. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
