There is an open pull request (https://github.com/apache/arrow/pull/11624) to expose these. They should be accessible in Arrow 7.0.
On Sat, Oct 30, 2021 at 10:05 AM Harshit Gupta <[email protected]> wrote: > 1. The hash_aggregate functions are in pyarrow.compute, but they're not > directly callable. Looks like GroupBy is unavailable in Python. > 2. Can you provide an example of hash_sum from pc.compute? > > Thanks >
