rtpsw commented on PR #14352:
URL: https://github.com/apache/arrow/pull/14352#issuecomment-1328266070

   > > I'm still pretty reluctant to add code to handle chunked arrays. I feel 
it adds complexity that we will end up maintaining when chunked arrays don't 
really have a place in a streaming execution engine (since we process things 
once batch at a time usually).
   > 
   > This is understandable. I'll try to drop support for chunked arrays in 
this PR and report back on what seems to break; we may be able to find an 
alternative approach.
   
   My investigation suggests that the reason for introducing chunks in the 
first place is that the testers use tables, and their implementing class 
`SimpleTable` has `ChunkedArray` columns (even after `CombineChunks`) that the 
aggregation code needs to handle. Therefore, if we remove support for chunked 
arrays in the aggregation code, then it won't work nicely with table inputs. 
AFAIU, aggregating tables is a valid use case that should be supported. 
@westonpace, let me know you thoughts.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to