alamb commented on PR #22010: URL: https://github.com/apache/datafusion/pull/22010#issuecomment-4481358430
> Thanks @alamb and @Dandandan for the reviews! I'm pulling this one in now, will likely invest some more time in this operator to catch some other low hanging fruits LOL I am not sure there are many **low hanging** fruits in the operator, but I think any improvements you can make will make a big different to performance. RepartitionExec is one of the most performance critical operators we have in the system I think Here is one idea for your amusement (I think it could be a pretty big win for queries that repartition a lot of rows, esp if those rows are strings) - https://github.com/apache/datafusion/issues/11680 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
