2010YOUY01 commented on issue #12088:
URL: https://github.com/apache/datafusion/issues/12088#issuecomment-2308014959

   > @2010YOUY01 I'm personally not very familiar with exchange based 
parallelism. Could you point me in the direction of a good paper/resource on 
the topic. Assuming `How Query Engines Work` has a good section on it: 
https://howqueryengineswork.com/12-parallel-query.html
   
   I think this doc has a really good explanation 
https://github.com/apache/datafusion/blob/932adabcc04faf65b8c8670b2385dc13a0f849f4/datafusion/expr-common/src/accumulator.rs#L151-L197
   If run a query with `target_partitions = 2`, the dataflow will look like the 
figure in that doc
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org
For additional commands, e-mail: github-h...@datafusion.apache.org

Reply via email to