milenkovicm commented on PR #60:
URL: https://github.com/apache/datafusion-ray/pull/60#issuecomment-2643370553

   > 
   > Something desirable is to be able to specify the number of workers for the 
query. If we did this, maybe `--workers`? This way you can have a predictable 
resource allocation, worker wise, and having more concurrent queries on the 
cluster might be more manageable.
   >
   
   Would `--workers` be set per session or overall ray cluster? 
   I apologise if I give too many references to ballista, but my brain is wired 
to that concept at the moment. 
   Overall cluster parallelism in ballista is tied up to sum of executor 
parallelism. Specific session context parallelism can be set as session 
configuration parameter.
   
   Would it make sense to use `datafusion.execution.target_partitions` to 
control `--workers`? Ballista had `ballista.shuffle.partitions` session 
configuration option which would set  `datafusion.execution.target_partitions` 
to desired task parallelism.
   
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org
For additional commands, e-mail: github-h...@datafusion.apache.org

Reply via email to