crepererum commented on issue #8777: URL: https://github.com/apache/arrow-datafusion/issues/8777#issuecomment-1894093051
> where the same stream can be consumed at different rates potentially needing to buffer the entire intermediate result or else the plan will deadlock I think this is very similar to our repartitioning code and the trade-offs and problems we see there. The reason is that a repartition is basically also as single input with mulitiple consumers. Just think of it like the same data but with a column "bool: belongs to this output" added. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
