cloud-fan commented on pull request #28778: URL: https://github.com/apache/spark/pull/28778#issuecomment-642593763
After more thoughts, I'm wondering what's the real use case of it. The default parallelism depends on the cluster resources, and it looks weird if different sessions can have different default parallelism. Looking at the changes in this PR, I think most of them don't really need a per-session config to tune it. The only place looks reasonable is where we split file partitions. Maybe we can just add a new config to do fine-grained control of the file partition splitting? ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
