alamb commented on issue #866:
URL: 
https://github.com/apache/arrow-datafusion/issues/866#issuecomment-897869195


   > If possible, can we have number of output streams/partitions per operator 
and their corresponding output rows, too? I am not sure if they are captured in 
repartition or not. If IIRC, the repartitioning only happens with certain 
setups.
   
   One thing perhaps we could do is to capture the statistics for each output 
partition and then add some way to aggregate them together. I think @andygrove  
suggested something like this on 
https://github.com/apache/arrow-datafusion/issues/679#issuecomment-874187741 
though in the context of aggregating for distributed queries


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to