Rachelint commented on PR #11758:
URL: https://github.com/apache/datafusion/pull/11758#issuecomment-2283482260

   @JasonLi-cn As I think, maybe we should impl the special block based 
`GroupValues` impls:
   - We pass the `block size` when initializing it
   - It manage the inner values block by block
   - It return all blocks with internal `block size`
   We can always make the `block size == batch size`, so we can totally avoid 
any split operators.
   
   I think the `GroupValues` impls maybe should not care about the `batch 
size`? And we just do the `split and merge` work in the 
`GroupedHashAggregateStream::poll` if the unfortunately, the `batch size != 
block size` (usually they will equal)?
   
   I am making a try about it in #11943 , and have done some related code 
changes.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to