westonpace opened a new issue, #14866:
URL: https://github.com/apache/arrow/issues/14866

   ### Describe the enhancement requested
   
   Currently there are two ways to compute a group by.  The supported way is to 
use an aggregate node in an exec plan.  The second (internal) way is to use the 
internal function `arrow::internal::GroupBy`.  This internal function 
simulates, but does not actually use, an aggregate node.
   
   The internal implementation has caused issues in the past where we did not 
notice an error in the aggregate node's invocation of aggregate kernels since 
we use the internal function for testing aggregates and it behaves slightly 
differently.  The internal implementation also requires maintenance and 
significantly complicated #14352 .
   
   I would like to remove the internal implementation.  Unfortunately, the 
internal implementation is used by tests, benchmarks, and pyarrow.  However, we 
should be able to update those bindings to a friendly wrapper around exec plans.
   
   ### Component(s)
   
   C++


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to