2010YOUY01 commented on issue #12088: URL: https://github.com/apache/datafusion/issues/12088#issuecomment-2308014959
> @2010YOUY01 I'm personally not very familiar with exchange based parallelism. Could you point me in the direction of a good paper/resource on the topic. Assuming `How Query Engines Work` has a good section on it: https://howqueryengineswork.com/12-parallel-query.html I think this doc has a really good explanation https://github.com/apache/datafusion/blob/932adabcc04faf65b8c8670b2385dc13a0f849f4/datafusion/expr-common/src/accumulator.rs#L151-L197 If run a query with `target_partitions = 2`, the dataflow will look like the figure in that doc -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org For additional commands, e-mail: github-h...@datafusion.apache.org