[C++] Dataset API simplification

2021-03-25 Thread Weston Pace
This is a bit of a follow-up on https://issues.apache.org/jira/browse/ARROW-11782 and also a bit of a consequence of my work on https://issues.apache.org/jira/browse/ARROW-7001 (nested scan parallelism). I think the current dataset interface should be simplified. Currently, we have Dataset ->* Fra

Re: [RESULT] [VOTE] Accept donation of Rust Ballista project

2021-03-25 Thread Andy Grove
I have started a thread on gene...@incubator.apache.org regarding IP clearance. https://lists.apache.org/x/thread.html/rf59e9b18e09ed02fb2aa2ed085704f1b987be889d244cf1a7178463d@%3Cgeneral.incubator.apache.org%3E On Thu, Mar 25, 2021 at 8:03 AM Andy Grove wrote: > Re-sending with result subject

[RESULT] [VOTE] Accept donation of Rust Ballista project

2021-03-25 Thread Andy Grove
Re-sending with result subject line. On Thu, Mar 25, 2021 at 7:28 AM Andy Grove wrote: > Thank you all for voting. > > The vote passes with 8 binding votes from PMC members and 8 non-binding > votes. > > I will begin the process of contacting contributors and asking them to > summit CLAs and I w

Re: [VOTE] Accept donation of Rust Ballista project

2021-03-25 Thread Andy Grove
Thank you all for voting. The vote passes with 8 binding votes from PMC members and 8 non-binding votes. I will begin the process of contacting contributors and asking them to summit CLAs and I will also reach out to the Apache Incubator team about the process in case we cannot obtain CLAs from a

[NIGHTLY] Arrow Build Report for Job nightly-2021-03-25-0

2021-03-25 Thread Crossbow
Arrow Build Report for Job nightly-2021-03-25-0 All tasks: https://github.com/ursacomputing/crossbow/branches/all?query=nightly-2021-03-25-0 Failed Tasks: - conda-linux-gcc-py37-aarch64: URL: https://github.com/ursacomputing/crossbow/branches/all?query=nightly-2021-03-25-0-drone-conda-linux

Re: sparse data array

2021-03-25 Thread Jorge Cardoso Leitão
Would it be an option to use a StructArray for that? One array with the values, and one with the repetitions: Int32([1, 2, 2, 2, 3, 3, 3, 3, 3, 3, 1, 2]) -> StructArray([ "values": Int32([1, 2, 3, 1, 2]), "repetitions": UInt32([1, 3, 5, 1, 1]), ]) It does not have the same API, but I thi

Re: sparse data array

2021-03-25 Thread Kirill Lykov
Thanks for the answer. I asked about it because we need it and I was about writing a summer intern proposal for a student to work on it. Looks like it could work fine. On Wed, Mar 24, 2021 at 3:49 PM Wes McKinney wrote: > The SparseTensor stuff is something else entirely (that's matrices > where