milenkovicm commented on PR #60: URL: https://github.com/apache/datafusion-ray/pull/60#issuecomment-2643370553
> > Something desirable is to be able to specify the number of workers for the query. If we did this, maybe `--workers`? This way you can have a predictable resource allocation, worker wise, and having more concurrent queries on the cluster might be more manageable. > Would `--workers` be set per session or overall ray cluster? I apologise if I give too many references to ballista, but my brain is wired to that concept at the moment. Overall cluster parallelism in ballista is tied up to sum of executor parallelism. Specific session context parallelism can be set as session configuration parameter. Would it make sense to use `datafusion.execution.target_partitions` to control `--workers`? Ballista had `ballista.shuffle.partitions` session configuration option which would set `datafusion.execution.target_partitions` to desired task parallelism. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org For additional commands, e-mail: github-h...@datafusion.apache.org