Thanks.

Lirong Jian
HashData Inc.


Wes McKinney <wesmck...@gmail.com> 于2020年5月7日周四 下午10:44写道:

> Feel free to follow ARROW-5002
>
> On Thu, May 7, 2020 at 2:52 AM Lirong Jian <jianlir...@gmail.com> wrote:
> >
> > Any update of this task?
> >
> > We are very interested in this feature. Thanks.
> >
> > Lirong Jian
> > HashData Inc.
> >
> >
> > Chendi.Xue (Jira) <j...@apache.org> 于2019年11月14日周四 下午2:02写道:
> >
> > > Chendi.Xue created ARROW-7165:
> > > ---------------------------------
> > >
> > >              Summary: [C++] Arrow Compute Group By Support
> > >                  Key: ARROW-7165
> > >                  URL: https://issues.apache.org/jira/browse/ARROW-7165
> > >              Project: Apache Arrow
> > >           Issue Type: New Feature
> > >           Components: C++ - Compute
> > >             Reporter: Chendi.Xue
> > >
> > >
> > > Not sure if there is any plan to support groupby in arrow?
> > >
> > > Here is some to do in my mind:
> > >  # To make current arrow/compute/kernels/hash supporting received a
> > > memo_table as input, so multiple array will be able to get dictencode
> and
> > > valuecount based on same hashmap with a unified index.
> > >  # To add a split array function instead of using take multiple time to
> > > split one array to several ones.
> > >  # so the output array can use current funcs under compute/kernels,
> such
> > > as sum/count/sort to support group by.
> > >
> > > But this is some of my basic idea, wanna know if there is a on going
> plan
> > > on this?
> > >
> > >
> > >
> > > --
> > > This message was sent by Atlassian Jira
> > > (v8.3.4#803005)
> > >
>

Reply via email to