Github user ppadma commented on the issue:
https://github.com/apache/drill/pull/1227
@Ben-Zvi Thanks a lot for the review. updated PR with review comments taken
care of. Please take a look.
Regarding spill files, here are my thoughts.
For build side, I am using aggregate statistics i.e. average of all
batches. On probe side, I am using stats for each batch coming in and
adjusting the output row count. So, we can skip applying sizing for batches
spilled from build side and continue to do what I am doing on the probe side.
Once your code is merged in, I will refactor the code as needed.
---