korowa commented on PR #5057: URL: https://github.com/apache/arrow-datafusion/pull/5057#issuecomment-1404084910
@tustvold , thank you for the comments! Initially my intention was to handle scan planning as early as possible, so `ListingTable` looked like proper place for this - it actually holds parallelism settings in `options` attribute, and fetches required metadata. But, yeah, now I see that physical optimizer, and especially its `repartitioning` rule is much better suited for repatitioning ParquetExec 🤔 . I guess I'll convert this PR to draft and come up a bit later with updated version of this optimizer rule. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
