Re: [VOTE][RUST][DataFusion] Release Apache Arrow DataFusion 37.1.0 RC2

2024-04-19 Thread Andy Grove
+1 (binding)

Verified on Ubuntu 24.04.4 LTS.

Thanks, Andrew.

On Thu, Apr 18, 2024 at 8:04 PM L. C. Hsieh  wrote:

> +1 (binding)
>
> Verified on M3 Mac.
>
> Thanks Andrew.
>
> On Thu, Apr 18, 2024 at 2:20 PM Andrew Lamb  wrote:
> >
> > I would like to propose a release of Apache Arrow DataFusion
> Implementation,
> > version 37.1.0.
> >
> > Note this is the second RC (the first RC[4] did not include the change to
> > the version numbers[5] :facepalm:). I apologize for the runaround.
> >
> >
> > This release candidate is based on commit:
> > aee976aa1a75514c7dbb33ef47527b3ba99081dd [1]
> > The proposed release tarball and signatures are hosted at [2].
> > The changelog is located at [3].
> >
> > Please download, verify checksums and signatures, run the unit tests, and
> > vote
> > on the release. The vote will be open for at least 72 hours.
> >
> > Only votes from PMC members are binding, but all members of the community
> > are
> > encouraged to test the release and vote with "(non-binding)".
> >
> > The standard verification procedure is documented at
> >
> https://github.com/apache/arrow-datafusion/blob/main/dev/release/README.md#verifying-release-candidates
> > .
> >
> > [ ] +1 Release this as Apache Arrow DataFusion 37.1.0
> > [ ] +0
> > [ ] -1 Do not release this as Apache Arrow DataFusion 37.1.0 because...
> >
> > Here is my vote:
> >
> > +1
> >
> > [1]:
> >
> https://github.com/apache/arrow-datafusion/tree/aee976aa1a75514c7dbb33ef47527b3ba99081dd
> > [2]:
> >
> https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-37.1.0-rc2
> > [3]:
> >
> https://github.com/apache/arrow-datafusion/blob/aee976aa1a75514c7dbb33ef47527b3ba99081dd/CHANGELOG.md
> > [4]: https://lists.apache.org/thread/33bkbrlkqv962y0topx9rlqg19g5q2vk
> > [5]: https://github.com/apache/arrow-datafusion/pull/10128
>


Re: [VOTE][RUST][DataFusion] Release Apache Arrow DataFusion 37.1.0 RC1

2024-04-18 Thread Andy Grove
+1

Verified on Ubuntu 22.04.4 LTS

Note that I still had to set RUST_MIN_STACK to avoid a stack overflow. I
don't know if that is still expected.

On Thu, Apr 18, 2024 at 8:01 AM Andrew Lamb  wrote:

> Hi,
>
> I would like to propose a release of Apache Arrow DataFusion
> Implementation,
> version 37.1.0, a patch release with some bug fixes. Please see [4] for
> details.
> There is a failing CI test which only affects development tools [6].
>
> While DataFusion is now officially its own top level Apache project, we do
> not yet have enough infrastructure (email lists) setup to do voting
> there[5], so I would like to do this one last time on the arrow list.
>
> This release candidate is based on commit:
> d4eb72c30d45c0f3f359c64f41a6caed30abe750 [1]
> The proposed release tarball and signatures are hosted at [2].
> The changelog is located at [3].
>
> Please download, verify checksums and signatures, run the unit tests, and
> vote
> on the release. The vote will be open for at least 72 hours.
>
> Only votes from PMC members are binding, but all members of the community
> are
> encouraged to test the release and vote with "(non-binding)".
>
> The standard verification procedure is documented at
>
> https://github.com/apache/arrow-datafusion/blob/main/dev/release/README.md#verifying-release-candidates
> .
>
> [ ] +1 Release this as Apache Arrow DataFusion 37.1.0
> [ ] +0
> [ ] -1 Do not release this as Apache Arrow DataFusion 37.1.0 because...
>
> Here is my vote:
>
> +1
>
> [1]:
>
> https://github.com/apache/arrow-datafusion/tree/d4eb72c30d45c0f3f359c64f41a6caed30abe750
> [2]:
>
> https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-37.1.0-rc1
> [3]:
>
> https://github.com/apache/arrow-datafusion/blob/d4eb72c30d45c0f3f359c64f41a6caed30abe750/CHANGELOG.md
> [4]: https://github.com/apache/arrow-datafusion/issues/9904
> [5]: https://github.com/apache/arrow-datafusion/issues/9691
> [6]:
>
> https://github.com/apache/arrow-datafusion/pull/10128#issuecomment-2063655318
> .
>


Re: Arrow board report due April 10

2024-04-05 Thread Andy Grove
Thanks for all the contributions so far. Although the report does not need
to be submitted until 4/10, I plan on submitting it tomorrow (4/6) since I
am traveling from Sunday. Please let me know if this is an issue.

Thanks,

Andy.

On Sat, Mar 30, 2024 at 11:06 AM Andy Grove  wrote:

> It is time for us to submit another board report. I have created a Google
> document [1], which is currently a copy of the previous report, so that we
> can collaborate on preparing this.
>
> I would appreciate it if contributors of each subproject could update the
> relevant section of the report.
>
> Thanks,
>
> Andy.
>
> [1]
> https://docs.google.com/document/d/1q6uBW4MNijY8cThZf0d-XZTcCxAOhUgp1sUG6_eOBMY/edit?usp=sharing
>
>
>


Re: [RESULT][VOTE][RUST][DataFusion] Release Apache Arrow DataFusion 37.0.0 RC2

2024-04-04 Thread Andy Grove
Thanks for finishing the release process, Andrew.

On Thu, Apr 4, 2024 at 2:57 PM Andrew Lamb  wrote:

> The release is available here:
>   https://dist.apache.org/repos/dist/release/arrow/arrow-datafusion-37.0.0
>
> The release has also been published to crates.io:
> https://crates.io/crates/datafusion/37.0.0
>
> Thank you all for your help!
>
> Andrew
>
>
> On Thu, Apr 4, 2024 at 4:31 PM Andrew Lamb  wrote:
>
> > With 4 +1 votes (3 binding) the release is approved. I will now upload to
> > crates.io
> >
> > On Mon, Apr 1, 2024 at 11:59 AM Andrew Lamb 
> wrote:
> >
> >> +1
> >>
> >> I verified on M3 mac
> >>
> >> Thanks Andy for the quick turnaround on making a new RC
> >>
> >>
> >> Andrew
> >>
> >>
> >> On Mon, Apr 1, 2024 at 1:51 AM Jean-Baptiste Onofré 
> >> wrote:
> >>
> >>> +1 (non binding)
> >>>
> >>> - Hashed and signatures are OK
> >>> - ASF headers are present on expected file
> >>> - No binary file found in the source distribution
> >>> - Checked on MacOS M3
> >>>
> >>> Regards
> >>> JB
> >>>
> >>> On Mon, Apr 1, 2024 at 12:07 AM Andy Grove 
> >>> wrote:
> >>> >
> >>> > Subject: [VOTE][RUST][DataFusion] Release Apache Arrow DataFusion
> >>> 37.0.0 RC2
> >>> > Hi,
> >>> >
> >>> > I would like to propose a release of Apache Arrow DataFusion
> >>> Implementation,
> >>> > version 37.0.0.
> >>> >
> >>> > This release candidate is based on commit:
> >>> > 1fa25ae5d50c5f34f17e77e9f635f854ef5e7642 [1]
> >>> > The proposed release tarball and signatures are hosted at [2].
> >>> > The changelog is located at [3].
> >>> >
> >>> > Please download, verify checksums and signatures, run the unit tests,
> >>> and
> >>> > vote
> >>> > on the release. The vote will be open for at least 72 hours.
> >>> >
> >>> > Only votes from PMC members are binding, but all members of the
> >>> community
> >>> > are
> >>> > encouraged to test the release and vote with "(non-binding)".
> >>> >
> >>> > The standard verification procedure is documented at
> >>> >
> >>>
> https://github.com/apache/arrow-datafusion/blob/main/dev/release/README.md#verifying-release-candidates
> >>> > .
> >>> >
> >>> > [ ] +1 Release this as Apache Arrow DataFusion 37.0.0
> >>> > [ ] +0
> >>> > [ ] -1 Do not release this as Apache Arrow DataFusion 37.0.0
> because...
> >>> >
> >>> > Here is my vote:
> >>> >
> >>> > +1
> >>> >
> >>> > [1]:
> >>> >
> >>>
> https://github.com/apache/arrow-datafusion/tree/1fa25ae5d50c5f34f17e77e9f635f854ef5e7642
> >>> > [2]:
> >>> >
> >>>
> https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-37.0.0-rc2
> >>> > [3]:
> >>> >
> >>>
> https://github.com/apache/arrow-datafusion/blob/1fa25ae5d50c5f34f17e77e9f635f854ef5e7642/CHANGELOG.md
> >>> > -
> >>> > Running rat license checker on
> >>> >
> >>>
> /home/andy/git/apache/arrow-datafusion/dev/dist/apache-arrow-datafusion-37.0.0-rc2/apache-arrow-datafusion-37.0.0.tar.gz
> >>> > NOT APPROVED: .github/workflows/pr_benchmarks.yml
> >>> > (apache-arrow-datafusion-37.0.0/.github/workflows/pr_benchmarks.yml):
> >>> false
> >>> > NOT APPROVED: .github/workflows/pr_comment.yml
> >>> > (apache-arrow-datafusion-37.0.0/.github/workflows/pr_comment.yml):
> >>> false
> >>> > 2 unapproved licences. Check rat report: rat.txt
> >>> > (base) andy@ripper:~/git/apache/arrow-datafusion$
> >>> >
> >>>
> GH_TOKEN=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
> >>> > ./dev/release/create-tarball.sh 37.0.0 2
> >>> > Attempting to create  from tag 37.0.0-rc2
> >>> > Draft email for dev@arrow.apache.org mailing list
> >>> >
> >>> > -
> >

[VOTE][RUST][DataFusion] Release Apache Arrow DataFusion 37.0.0 RC2

2024-03-31 Thread Andy Grove
Subject: [VOTE][RUST][DataFusion] Release Apache Arrow DataFusion 37.0.0 RC2
Hi,

I would like to propose a release of Apache Arrow DataFusion Implementation,
version 37.0.0.

This release candidate is based on commit:
1fa25ae5d50c5f34f17e77e9f635f854ef5e7642 [1]
The proposed release tarball and signatures are hosted at [2].
The changelog is located at [3].

Please download, verify checksums and signatures, run the unit tests, and
vote
on the release. The vote will be open for at least 72 hours.

Only votes from PMC members are binding, but all members of the community
are
encouraged to test the release and vote with "(non-binding)".

The standard verification procedure is documented at
https://github.com/apache/arrow-datafusion/blob/main/dev/release/README.md#verifying-release-candidates
.

[ ] +1 Release this as Apache Arrow DataFusion 37.0.0
[ ] +0
[ ] -1 Do not release this as Apache Arrow DataFusion 37.0.0 because...

Here is my vote:

+1

[1]:
https://github.com/apache/arrow-datafusion/tree/1fa25ae5d50c5f34f17e77e9f635f854ef5e7642
[2]:
https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-37.0.0-rc2
[3]:
https://github.com/apache/arrow-datafusion/blob/1fa25ae5d50c5f34f17e77e9f635f854ef5e7642/CHANGELOG.md
-
Running rat license checker on
/home/andy/git/apache/arrow-datafusion/dev/dist/apache-arrow-datafusion-37.0.0-rc2/apache-arrow-datafusion-37.0.0.tar.gz
NOT APPROVED: .github/workflows/pr_benchmarks.yml
(apache-arrow-datafusion-37.0.0/.github/workflows/pr_benchmarks.yml): false
NOT APPROVED: .github/workflows/pr_comment.yml
(apache-arrow-datafusion-37.0.0/.github/workflows/pr_comment.yml): false
2 unapproved licences. Check rat report: rat.txt
(base) andy@ripper:~/git/apache/arrow-datafusion$
GH_TOKEN=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
./dev/release/create-tarball.sh 37.0.0 2
Attempting to create  from tag 37.0.0-rc2
Draft email for dev@arrow.apache.org mailing list

-
To: dev@arrow.apache.org
Hi,

I would like to propose a release of Apache Arrow DataFusion Implementation,
version 37.0.0.

This release candidate is based on commit:
1fa25ae5d50c5f34f17e77e9f635f854ef5e7642 [1]
The proposed release tarball and signatures are hosted at [2].
The changelog is located at [3].

Please download, verify checksums and signatures, run the unit tests, and
vote
on the release. The vote will be open for at least 72 hours.

Only votes from PMC members are binding, but all members of the community
are
encouraged to test the release and vote with "(non-binding)".

The standard verification procedure is documented at
https://github.com/apache/arrow-datafusion/blob/main/dev/release/README.md#verifying-release-candidates
.

[ ] +1 Release this as Apache Arrow DataFusion 37.0.0
[ ] +0
[ ] -1 Do not release this as Apache Arrow DataFusion 37.0.0 because...

Here is my vote:

+1 (verified on Ubuntu)

[1]:
https://github.com/apache/arrow-datafusion/tree/1fa25ae5d50c5f34f17e77e9f635f854ef5e7642
[2]:
https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-37.0.0-rc2
[3]:
https://github.com/apache/arrow-datafusion/blob/1fa25ae5d50c5f34f17e77e9f635f854ef5e7642/CHANGELOG.md


Arrow board report due April 10

2024-03-30 Thread Andy Grove
It is time for us to submit another board report. I have created a Google
document [1], which is currently a copy of the previous report, so that we
can collaborate on preparing this.

I would appreciate it if contributors of each subproject could update the
relevant section of the report.

Thanks,

Andy.

[1]
https://docs.google.com/document/d/1q6uBW4MNijY8cThZf0d-XZTcCxAOhUgp1sUG6_eOBMY/edit?usp=sharing


Re: [VOTE][RUST][DataFusion] Release Apache Arrow DataFusion 37.0.0 RC1

2024-03-28 Thread Andy Grove
Yes, I also saw this (both on Ubuntu and Mac), and I had to set
RUST_MIN_STACK=300 for the tests to pass.

I filed https://github.com/apache/arrow-datafusion/issues/9848 to improve
the release verification documentation to mention this.


On Thu, Mar 28, 2024 at 7:59 PM L. C. Hsieh  wrote:

> I got the following error when running verify-release-candidate.sh:
>
> thread 'tpcds_physical_q54' has overflowed its stack
> fatal runtime error: stack overflow
> error: test failed, to rerun pass `-p datafusion --test tpcds_planning`
>
>
> On Thu, Mar 28, 2024 at 4:22 PM Andy Grove  wrote:
> >
> > Hi,
> >
> > I would like to propose a release of Apache Arrow DataFusion
> Implementation,
> > version 37.0.0.
> >
> > This release candidate is based on commit:
> > 799be5e76bd631608b2357dbbe600afc2cebc359 [1]
> > The proposed release tarball and signatures are hosted at [2].
> > The changelog is located at [3].
> >
> > Please download, verify checksums and signatures, run the unit tests, and
> > vote
> > on the release. The vote will be open for at least 72 hours.
> >
> > Only votes from PMC members are binding, but all members of the community
> > are
> > encouraged to test the release and vote with "(non-binding)".
> >
> > The standard verification procedure is documented at
> >
> https://github.com/apache/arrow-datafusion/blob/main/dev/release/README.md#verifying-release-candidates
> > .
> >
> > [ ] +1 Release this as Apache Arrow DataFusion 37.0.0
> > [ ] +0
> > [ ] -1 Do not release this as Apache Arrow DataFusion 37.0.0 because...
> >
> > Here is my vote:
> >
> > +1
> >
> > *NOTE: I had to set RUST_MIN_STACK=300 for the tests to pass.*
> >
> > [1]:
> >
> https://github.com/apache/arrow-datafusion/tree/799be5e76bd631608b2357dbbe600afc2cebc359
> > [2]:
> >
> https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-37.0.0-rc1
> > [3]:
> >
> https://github.com/apache/arrow-datafusion/blob/799be5e76bd631608b2357dbbe600afc2cebc359/CHANGELOG.md
>


[VOTE][RUST][DataFusion] Release Apache Arrow DataFusion 37.0.0 RC1

2024-03-28 Thread Andy Grove
Hi,

I would like to propose a release of Apache Arrow DataFusion Implementation,
version 37.0.0.

This release candidate is based on commit:
799be5e76bd631608b2357dbbe600afc2cebc359 [1]
The proposed release tarball and signatures are hosted at [2].
The changelog is located at [3].

Please download, verify checksums and signatures, run the unit tests, and
vote
on the release. The vote will be open for at least 72 hours.

Only votes from PMC members are binding, but all members of the community
are
encouraged to test the release and vote with "(non-binding)".

The standard verification procedure is documented at
https://github.com/apache/arrow-datafusion/blob/main/dev/release/README.md#verifying-release-candidates
.

[ ] +1 Release this as Apache Arrow DataFusion 37.0.0
[ ] +0
[ ] -1 Do not release this as Apache Arrow DataFusion 37.0.0 because...

Here is my vote:

+1

*NOTE: I had to set RUST_MIN_STACK=300 for the tests to pass.*

[1]:
https://github.com/apache/arrow-datafusion/tree/799be5e76bd631608b2357dbbe600afc2cebc359
[2]:
https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-37.0.0-rc1
[3]:
https://github.com/apache/arrow-datafusion/blob/799be5e76bd631608b2357dbbe600afc2cebc359/CHANGELOG.md


[RESULT][VOTE][RUST][DataFusion] Release DataFusion Python Bindings 36.0.0 RC1

2024-03-10 Thread Andy Grove
On Sun, Mar 10, 2024 at 5:26 PM Andy Grove  wrote:

> Thank you both. The vote passes with three binding +1 votes.
>
> The release is now complete.
>
> On Sun, Mar 10, 2024 at 4:24 PM QP Hou  wrote:
>
>> +1 (binding)
>>
>> On Sun, Mar 10, 2024 at 10:18 AM Andy Grove 
>> wrote:
>> >
>> > Bumping this email thread. We need one more +1 PMC vote.
>> >
>> > Thanks,
>> >
>> > Andy.
>> >
>> > On Sun, Mar 3, 2024 at 8:31 PM L. C. Hsieh  wrote:
>> >
>> > > +1 (binding)
>> > >
>> > > Verified on M3 Mac.
>> > >
>> > > Thanks Andy.
>> > >
>> > > On Sun, Mar 3, 2024 at 6:53 PM Andy Grove 
>> wrote:
>> > > >
>> > > > Hi,
>> > > >
>> > > > I would like to propose a release of Apache Arrow DataFusion Python
>> > > > Bindings,
>> > > > version 36.0.0.
>> > > >
>> > > > This release candidate is based on commit:
>> > > > 3a82be08c458358a3c07587c2b4d9ffbaf646ca2 [1]
>> > > > The proposed release tarball and signatures are hosted at [2].
>> > > > The changelog is located at [3].
>> > > > The Python wheels are located at [4].
>> > > >
>> > > > Please download, verify checksums and signatures, run the unit
>> tests, and
>> > > > vote
>> > > > on the release. The vote will be open for at least 72 hours.
>> > > >
>> > > > Only votes from PMC members are binding, but all members of the
>> community
>> > > > are
>> > > > encouraged to test the release and vote with "(non-binding)".
>> > > >
>> > > > The standard verification procedure is documented at
>> > > >
>> > >
>> https://github.com/apache/arrow-datafusion-python/blob/main/dev/release/README.md#verifying-release-candidates
>> > > > .
>> > > >
>> > > > [ ] +1 Release this as Apache Arrow DataFusion Python 36.0.0
>> > > > [ ] +0
>> > > > [ ] -1 Do not release this as Apache Arrow DataFusion Python 36.0.0
>> > > > because...
>> > > >
>> > > > Here is my vote:
>> > > >
>> > > > +1
>> > > >
>> > > > [1]:
>> > > >
>> > >
>> https://github.com/apache/arrow-datafusion-python/tree/3a82be08c458358a3c07587c2b4d9ffbaf646ca2
>> > > > [2]:
>> > > >
>> > >
>> https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-python-36.0.0-rc1
>> > > > [3]:
>> > > >
>> > >
>> https://github.com/apache/arrow-datafusion-python/blob/3a82be08c458358a3c07587c2b4d9ffbaf646ca2/CHANGELOG.md
>> > > > [4]: https://test.pypi.org/project/datafusion/36.0.0/
>> > >
>>
>


Re: [VOTE][RUST][DataFusion] Release DataFusion Python Bindings 36.0.0 RC1

2024-03-10 Thread Andy Grove
Thank you both. The vote passes with three binding +1 votes.

The release is now complete.

On Sun, Mar 10, 2024 at 4:24 PM QP Hou  wrote:

> +1 (binding)
>
> On Sun, Mar 10, 2024 at 10:18 AM Andy Grove  wrote:
> >
> > Bumping this email thread. We need one more +1 PMC vote.
> >
> > Thanks,
> >
> > Andy.
> >
> > On Sun, Mar 3, 2024 at 8:31 PM L. C. Hsieh  wrote:
> >
> > > +1 (binding)
> > >
> > > Verified on M3 Mac.
> > >
> > > Thanks Andy.
> > >
> > > On Sun, Mar 3, 2024 at 6:53 PM Andy Grove 
> wrote:
> > > >
> > > > Hi,
> > > >
> > > > I would like to propose a release of Apache Arrow DataFusion Python
> > > > Bindings,
> > > > version 36.0.0.
> > > >
> > > > This release candidate is based on commit:
> > > > 3a82be08c458358a3c07587c2b4d9ffbaf646ca2 [1]
> > > > The proposed release tarball and signatures are hosted at [2].
> > > > The changelog is located at [3].
> > > > The Python wheels are located at [4].
> > > >
> > > > Please download, verify checksums and signatures, run the unit
> tests, and
> > > > vote
> > > > on the release. The vote will be open for at least 72 hours.
> > > >
> > > > Only votes from PMC members are binding, but all members of the
> community
> > > > are
> > > > encouraged to test the release and vote with "(non-binding)".
> > > >
> > > > The standard verification procedure is documented at
> > > >
> > >
> https://github.com/apache/arrow-datafusion-python/blob/main/dev/release/README.md#verifying-release-candidates
> > > > .
> > > >
> > > > [ ] +1 Release this as Apache Arrow DataFusion Python 36.0.0
> > > > [ ] +0
> > > > [ ] -1 Do not release this as Apache Arrow DataFusion Python 36.0.0
> > > > because...
> > > >
> > > > Here is my vote:
> > > >
> > > > +1
> > > >
> > > > [1]:
> > > >
> > >
> https://github.com/apache/arrow-datafusion-python/tree/3a82be08c458358a3c07587c2b4d9ffbaf646ca2
> > > > [2]:
> > > >
> > >
> https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-python-36.0.0-rc1
> > > > [3]:
> > > >
> > >
> https://github.com/apache/arrow-datafusion-python/blob/3a82be08c458358a3c07587c2b4d9ffbaf646ca2/CHANGELOG.md
> > > > [4]: https://test.pypi.org/project/datafusion/36.0.0/
> > >
>


Re: [VOTE][RUST][DataFusion] Release DataFusion Python Bindings 36.0.0 RC1

2024-03-10 Thread Andy Grove
Bumping this email thread. We need one more +1 PMC vote.

Thanks,

Andy.

On Sun, Mar 3, 2024 at 8:31 PM L. C. Hsieh  wrote:

> +1 (binding)
>
> Verified on M3 Mac.
>
> Thanks Andy.
>
> On Sun, Mar 3, 2024 at 6:53 PM Andy Grove  wrote:
> >
> > Hi,
> >
> > I would like to propose a release of Apache Arrow DataFusion Python
> > Bindings,
> > version 36.0.0.
> >
> > This release candidate is based on commit:
> > 3a82be08c458358a3c07587c2b4d9ffbaf646ca2 [1]
> > The proposed release tarball and signatures are hosted at [2].
> > The changelog is located at [3].
> > The Python wheels are located at [4].
> >
> > Please download, verify checksums and signatures, run the unit tests, and
> > vote
> > on the release. The vote will be open for at least 72 hours.
> >
> > Only votes from PMC members are binding, but all members of the community
> > are
> > encouraged to test the release and vote with "(non-binding)".
> >
> > The standard verification procedure is documented at
> >
> https://github.com/apache/arrow-datafusion-python/blob/main/dev/release/README.md#verifying-release-candidates
> > .
> >
> > [ ] +1 Release this as Apache Arrow DataFusion Python 36.0.0
> > [ ] +0
> > [ ] -1 Do not release this as Apache Arrow DataFusion Python 36.0.0
> > because...
> >
> > Here is my vote:
> >
> > +1
> >
> > [1]:
> >
> https://github.com/apache/arrow-datafusion-python/tree/3a82be08c458358a3c07587c2b4d9ffbaf646ca2
> > [2]:
> >
> https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-python-36.0.0-rc1
> > [3]:
> >
> https://github.com/apache/arrow-datafusion-python/blob/3a82be08c458358a3c07587c2b4d9ffbaf646ca2/CHANGELOG.md
> > [4]: https://test.pypi.org/project/datafusion/36.0.0/
>


[VOTE][RUST][DataFusion] Release DataFusion Python Bindings 36.0.0 RC1

2024-03-03 Thread Andy Grove
Hi,

I would like to propose a release of Apache Arrow DataFusion Python
Bindings,
version 36.0.0.

This release candidate is based on commit:
3a82be08c458358a3c07587c2b4d9ffbaf646ca2 [1]
The proposed release tarball and signatures are hosted at [2].
The changelog is located at [3].
The Python wheels are located at [4].

Please download, verify checksums and signatures, run the unit tests, and
vote
on the release. The vote will be open for at least 72 hours.

Only votes from PMC members are binding, but all members of the community
are
encouraged to test the release and vote with "(non-binding)".

The standard verification procedure is documented at
https://github.com/apache/arrow-datafusion-python/blob/main/dev/release/README.md#verifying-release-candidates
.

[ ] +1 Release this as Apache Arrow DataFusion Python 36.0.0
[ ] +0
[ ] -1 Do not release this as Apache Arrow DataFusion Python 36.0.0
because...

Here is my vote:

+1

[1]:
https://github.com/apache/arrow-datafusion-python/tree/3a82be08c458358a3c07587c2b4d9ffbaf646ca2
[2]:
https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-python-36.0.0-rc1
[3]:
https://github.com/apache/arrow-datafusion-python/blob/3a82be08c458358a3c07587c2b4d9ffbaf646ca2/CHANGELOG.md
[4]: https://test.pypi.org/project/datafusion/36.0.0/


Re: [VOTE] Move Arrow DataFusion Subproject to new Top Level Apache Project

2024-03-01 Thread Andy Grove
+1 (binding)

On Fri, Mar 1, 2024 at 6:20 AM Weston Pace  wrote:

> +1 (binding)
>
> On Fri, Mar 1, 2024 at 3:33 AM Andrew Lamb  wrote:
>
> > Hello,
> >
> > As we have discussed[1][2] I would like to vote on the proposal to
> > create a new Apache Top Level Project for DataFusion. The text of the
> > proposed resolution and background document is copy/pasted below
> >
> > If the community is in favor of this, we plan to submit the resolution
> > to the ASF board for approval with the next Arrow report (for the
> > April 2024 board meeting).
> >
> > The vote will be open for at least 7 days.
> >
> > [ ] +1 Accept this Proposal
> > [ ] +0
> > [ ] -1 Do not accept this proposal because...
> >
> > Andrew
> >
> > [1] https://lists.apache.org/thread/c150t1s1x0kcb3r03cjyx31kqs5oc341
> > [2] https://github.com/apache/arrow-datafusion/discussions/6475
> >
> > -- Proposed Resolution -
> >
> > Resolution to Create the Apache DataFusion Project from the Apache
> > Arrow DataFusion Sub Project
> >
> > =
> >
> > X. Establish the Apache DataFusion Project
> >
> > WHEREAS, the Board of Directors deems it to be in the best
> > interests of the Foundation and consistent with the
> > Foundation's purpose to establish a Project Management
> > Committee charged with the creation and maintenance of
> > open-source software related to an extensible query engine
> > for distribution at no charge to the public.
> >
> > NOW, THEREFORE, BE IT RESOLVED, that a Project Management
> > Committee (PMC), to be known as the "Apache DataFusion Project",
> > be and hereby is established pursuant to Bylaws of the
> > Foundation; and be it further
> >
> > RESOLVED, that the Apache DataFusion Project be and hereby is
> > responsible for the creation and maintenance of software
> > related to an extensible query engine; and be it further
> >
> > RESOLVED, that the office of "Vice President, Apache DataFusion" be
> > and hereby is created, the person holding such office to
> > serve at the direction of the Board of Directors as the chair
> > of the Apache DataFusion Project, and to have primary responsibility
> > for management of the projects within the scope of
> > responsibility of the Apache DataFusion Project; and be it further
> >
> > RESOLVED, that the persons listed immediately below be and
> > hereby are appointed to serve as the initial members of the
> > Apache DataFusion Project:
> >
> > * Andy Grove (agr...@apache.org)
> > * Andrew Lamb (al...@apache.org)
> > * Daniël Heres (dhe...@apache.org)
> > * Jie Wen (jake...@apache.org)
> > * Kun Liu (liu...@apache.org)
> > * Liang-Chi Hsieh (vii...@apache.org)
> > * Qingping Hou: (ho...@apache.org)
> > * Wes McKinney(w...@apache.org)
> > * Will Jones (wjones...@apache.org)
> >
> > RESOLVED, that the Apache DataFusion Project be and hereby
> > is tasked with the migration and rationalization of the Apache
> > Arrow DataFusion sub-project; and be it further
> >
> > RESOLVED, that all responsibilities pertaining to the Apache
> > Arrow DataFusion sub-project encumbered upon the
> > Apache Arrow Project are hereafter discharged.
> >
> > NOW, THEREFORE, BE IT FURTHER RESOLVED, that Andrew Lamb
> > be appointed to the office of Vice President, Apache DataFusion, to
> > serve in accordance with and subject to the direction of the
> > Board of Directors and the Bylaws of the Foundation until
> > death, resignation, retirement, removal or disqualification,
> > or until a successor is appointed.
> > =
> >
> >
> > ---
> >
> >
> > Summary:
> >
> > We propose creating a new top level project, Apache DataFusion, from
> > an existing sub project of Apache Arrow to facilitate additional
> > community and project growth.
> >
> > Abstract
> >
> > Apache Arrow DataFusion[1]  is a very fast, extensible query engine
> > for building high-quality data-centric systems in Rust, using the
> > Apache Arrow in-memory format. DataFusion offers SQL and Dataframe
> > APIs, excellent performance, built-in support for CSV, Parquet, JSON,
> > and Avro, extensive customization, and a great community.
> >
> > [1] https://arrow.apache.org/datafusion/
> >
> >
> > Proposal
> >
> > We propose creating a new top leve

Re: [DISCUSS] Move sqlparser-rs back into DataFusion project?

2024-02-29 Thread Andy Grove
I will put this proposal on hold for now and restart the conversation later
this year once DataFusion is a top-level ASF project.

Thanks again for all the feedback.

Andy.

On Wed, Feb 28, 2024 at 9:58 AM Andy Grove  wrote:

> Thanks for all the feedback so far.
>
> It does seem that the least contentious way to do this would be to follow
> Andrew's suggestion of having a separate
> apache/[arrow-]datafusion-sqlparser repository as this will ensure that we
> do not end up adding any DataFusion dependencies to the sqlparser project,
> and that it continues to have its own release process.
>
> The main benefit here is that it would bring it under ASF governance and
> allow those who have permission from their employers to contribute to
> Apache Arrow/DataFusion to be able to help with the maintenance burden.
>
> Andy.
>
>
>
> On Wed, Feb 28, 2024 at 4:28 AM Andrew Lamb  wrote:
>
>> One potential way "moving sqlparser-rs into DataFusion" could look is that
>> code/repo is moved from the sqlparser-rs [1] organization to the apache
>> organization. For example
>>
>> https://github.com/sqlparser-rs/sqlparser-rs
>> to
>> https://github.com/apache/datafusion-sqlparser
>>
>> We could continue development separately from any other code, release it
>> as
>> a separate artifact, but use the same overarching governance structure
>> (voting on releases, committer access, etc)
>>
>> To follow this model, I think the largest work item would be to run the IP
>> clearance process, and since sqlparser-rs has many distinct contributors
>> that may take a while
>>
>> Andrew
>>
>>
>>
>> On Wed, Feb 28, 2024 at 1:45 AM Aldrin 
>> wrote:
>>
>> > Maybe it would be valuable to more explicitly define "moving back into
>> > DataFusion project".
>> >
>> > I assumed it meant absorbing into the datafusion repo, but it occurs to
>> me
>> > that may not be the case. Then, how would sqlparser-rs be "moved"?
>> >
>> >
>> >
>> > # --
>> > # Aldrin
>> >
>> >
>> > https://github.com/drin/
>> > https://gitlab.com/octalene
>> > https://keybase.io/octalene
>> >
>> >
>> > On Tuesday, February 27th, 2024 at 16:20, Chak-Pong Chung <
>> > chakpongch...@gmail.com> wrote:
>> >
>> > > There are cases where people need datafusion but not a SQL parser. For
>> > > example, people building a composable query engine for graph or other
>> > data
>> > > modality may not choose SQL as the DSL. Decoupling them seems to be a
>> > good
>> > > idea.
>> > >
>> >
>> > > On Tue, Feb 27, 2024, 6:20 AM Mehmet Ozan Kabak o...@synnada.ai
>> wrote:
>> > >
>> >
>> > > > In this case, maybe we can bring sqlparser-rs into the ASF umbrella
>> > > > following the arrow-datafusion model?
>> > > >
>> >
>> > > > Once DataFusion becomes a top-level project, we could move it to
>> > > > datafusion-sqlparser-rs — it would be a quasi-independent project
>> just
>> > like
>> > > > how DataFusion is today w.r.t. Arrow. But it would get most
>> benefits of
>> > > > having a community behind it.
>> > > >
>> >
>> > > > > On Feb 27, 2024, at 2:11 AM, Andrew Lamb al...@influxdata.com
>> wrote:
>> > > > >
>> >
>> > > > > Julian, thank you for your insight. I very much agree with it.
>> > > > >
>> >
>> > > > > > I think the ASF is wrong on this. I think it needs to provide a
>> > home
>> > > > > > for medium-sized projects such as sqlparser-rs in an existing
>> > > > > > top-level project;
>> > > > >
>> >
>> > > > > It could be said that DataFusion fits this model -- it isn't
>> really
>> > an
>> > > > > "Arrow" project but needed a place to live and grow, and the Arrow
>> > ASF
>> > > > > community provided that.
>> > > > >
>> >
>> > > > > Andrew
>> > > > >
>> >
>> > > > > On Mon, Feb 26, 2024 at 1:09 PM Julian Hyde jh...@apache.org
>> wrote:
>> > > > >
>> >
>> > > > > > I am torn on this.
>> > > > > >
>> >
>> > &

Re: [DISCUSS] Move sqlparser-rs back into DataFusion project?

2024-02-28 Thread Andy Grove
ot much overlap between DataFusion and
> sqlparser-rs
> > > > > > users - but it takes a lot of effort to create and run a
> top-level
> > > > > > project, and DataFusion is already up and running.
> > > > > >
> >
> > > > > > The tension is that people want to consume components that they
> > > > > > perceive to be standalone, and yet the ASF wants to create
> > communities
> > > > > > that produce either a single large component or sets of
> > highly-coupled
> > > > > > components. The ASF used to do 'umbrella projects' whose
> > sub-projects
> > > > > > were in the same subject area but had little or no dependencies.
> > For
> > > > > > example, Apache DB [ https://db.apache.org/ ] has JDO, Derby and
> > > > > > Torque. And commons included many useful Java libraries. Umbrella
> > > > > > projects caused problems during the Jakarta and Hadoop eras, and
> > now
> > > > > > are strongly discouraged at the ASF.
> > > > > >
> >
> > > > > > I think the ASF is wrong on this. I think it needs to provide a
> > home
> > > > > > for medium-sized projects such as sqlparser-rs in an existing
> > > > > > top-level project; maybe those projects grow into top-level
> > projects,
> > > > > > or maybe they remain medium-sized projects. This is especially
> > > > > > necessary in the Rust community, where there are many exciting
> > > > > > projects, but they are almost all happening outside ASF. (This is
> > > > > > exactly where Java was in ~2005. Maybe we need a rust-commons or
> > > > > > rust-db?)
> > > > > >
> >
> > > > > > My conclusion is to leave sqlparser-rs where it is for now, but
> to
> > > > > > continue talking about what might be an attractive home for it in
> > ASF.
> > > > > >
> >
> > > > > > Julian
> > > > > >
> >
> > > > > > On Mon, Feb 26, 2024 at 8:12 AM Andrew Lamb al...@influxdata.com
> > > > > > wrote:
> > > > > >
> >
> > > > > > > Sorry for the late reply,
> > > > > > >
> >
> > > > > > > I think sqlparser-rs users are quite a bit more varied than
> > DataFusion
> > > > > > > and
> > > > > > > there is not a large overlap between the contributors of the
> two
> > > > > > > projects.
> > > > > > > I currently seem to be the one reviewing / merging most
> > sqlparser-rs
> > > > > > > reviews, and I would definitely love some more help.
> > > > > > >
> >
> > > > > > > However, given that the project is not an Apache project, I did
> > not
> > > > > > > have
> > > > > > > good luck attracting help. A related discussion is here 1.
> > > > > > >
> >
> > > > > > > If the DataFusion community would like to accelerate releases,
> > we can
> > > > > > > also
> > > > > > > try to do that without bringing it into Apache governance.
> > > > > > > Specifically,
> > > > > > > it
> > > > > > > would be great to have help reviewing the PRs -- the actual
> > release
> > > > > > > process
> > > > > > > is pretty low overhead. The reviews are what take the vast
> > majority of
> > > > > > > the
> > > > > > > maintenance time.
> > > > > > >
> >
> > > > > > > Andrew
> > > > > > >
> >
> > > > > > > On Sat, Feb 17, 2024 at 4:44 PM Aldrin
> octalene@pm.me.invalid
> > > > > > > wrote:
> > > > > > >
> >
> > > > > > > > do users of sqlparser-rs mostly use datafusion? I don't know
> > the
> > > > > > > > community, but it seems like it would be an annoying change
> > for users
> > > > > > > > who
> > > > > > > > use it with a different query engine. Just a thought
> > > > > > > >
> >
> > > > > > > > Sent from Proton Mail https://proton.me/mail/home for iOS
> > >

Re: [VOTE][RUST][DataFusion] Release Apache Arrow DataFusion 36.0.1 RC1

2024-02-25 Thread Andy Grove
The vote passes with 3 binding +1 votes. Thanks, everyone.

Unfortunately, in the rush to make this patch release, I forgot to actually
update the version numbers in the crates, so I am unable to publish this to
crates.io.

Given that nobody has complained about the circular dependency in 36.0.0, I
think I will just wait until we release 37.0.0 to resolve this.

Andy.

On Tue, Feb 20, 2024 at 12:00 AM vin jake  wrote:

> +1 (binding)
>
> Verified on my M1 Macbook.
>
> Thanks Andy
>
> On Tue, Feb 20, 2024 at 5:11 AM Andy Grove  wrote:
>
> > Hi,
> >
> > I would like to propose a release of Apache Arrow DataFusion, version
> > 36.0.1.
> >
> > *This is a patch release to fix an issue that prevented 36.0.0 from being
> > published to crates.io <http://crates.io>*
> >
> > This release candidate is based on commit:
> > e53ccd0756644e6522e6f8c41c4497b47e4f4ceb [1]
> > The proposed release tarball and signatures are hosted at [2].
> > The changelog is located at [3].
> >
> > Please download, verify checksums and signatures, run the unit tests, and
> > vote
> > on the release. The vote will be open for at least 72 hours.
> >
> > Only votes from PMC members are binding, but all members of the community
> > are
> > encouraged to test the release and vote with "(non-binding)".
> >
> > The standard verification procedure is documented at
> >
> >
> https://github.com/apache/arrow-datafusion/blob/main/dev/release/README.md#verifying-release-candidates
> > .
> >
> > [ ] +1 Release this as Apache Arrow DataFusion 36.0.1
> > [ ] +0
> > [ ] -1 Do not release this as Apache Arrow DataFusion 36.0.1 because...
> >
> > Here is my vote:
> >
> > +1
> >
> > [1]:
> >
> >
> https://github.com/apache/arrow-datafusion/tree/e53ccd0756644e6522e6f8c41c4497b47e4f4ceb
> > [2]:
> >
> >
> https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-36.0.1-rc1
> > [3]:
> >
> >
> https://github.com/apache/arrow-datafusion/blob/e53ccd0756644e6522e6f8c41c4497b47e4f4ceb/CHANGELOG.md
> >
>


[RESULT][VOTE][RUST][DataFusion] Release Apache Arrow DataFusion 36.0.0 RC1

2024-02-20 Thread Andy Grove
On Tue, Feb 20, 2024 at 7:08 AM Andy Grove  wrote:

> I went ahead and published with --no-verify, and I started a vote on
> 36.0.1 to resolve the circular dependency issue
>
> On Mon, Feb 19, 2024 at 12:16 PM Andy Grove  wrote:
>
>> The vote passes with three binding votes. Thank you all.
>>
>> The source release has been published, but unfortunately, I cannot
>> publish it to crates.io because a circular dependency has been
>> introduced between crates. I have filed an issue to track this.
>>
>> https://github.com/apache/arrow-datafusion/issues/9277
>>
>> I propose that we invest in a CI check for this somehow, as we have seen
>> this at least once before, and this kind of change can potentially be
>> disruptive to undo. I have filed an issue for this as well:
>>
>> https://github.com/apache/arrow-datafusion/issues/9278
>>
>> Thanks,
>>
>> Andy.
>>
>> On Sat, Feb 17, 2024 at 2:25 AM Andrew Lamb  wrote:
>>
>>> +1 (binding)
>>>
>>> Verified on M3 Mac
>>>
>>> Thank you for keeping the release training humming Andy
>>>
>>> Andrew
>>>
>>> On Fri, Feb 16, 2024 at 12:23 PM L. C. Hsieh  wrote:
>>>
>>> > +1 (binding)
>>> >
>>> > Verified on M3 Mac.
>>> >
>>> > Thanks Andy.
>>> >
>>> >
>>> > On Fri, Feb 16, 2024 at 9:08 AM Andy Grove 
>>> wrote:
>>> > >
>>> > > Hi,
>>> > >
>>> > > I would like to propose a release of Apache Arrow DataFusion
>>> > Implementation,
>>> > > version 36.0.0.
>>> > >
>>> > > This release candidate is based on commit:
>>> > > bf6f83b3d228fb386f9b4b20c254fa58e2412660 [1]
>>> > > The proposed release tarball and signatures are hosted at [2].
>>> > > The changelog is located at [3].
>>> > >
>>> > > Please download, verify checksums and signatures, run the unit
>>> tests, and
>>> > > vote
>>> > > on the release. The vote will be open for at least 72 hours.
>>> > >
>>> > > Only votes from PMC members are binding, but all members of the
>>> community
>>> > > are
>>> > > encouraged to test the release and vote with "(non-binding)".
>>> > >
>>> > > The standard verification procedure is documented at
>>> > >
>>> >
>>> https://github.com/apache/arrow-datafusion/blob/main/dev/release/README.md#verifying-release-candidates
>>> > > .
>>> > >
>>> > > [ ] +1 Release this as Apache Arrow DataFusion 36.0.0
>>> > > [ ] +0
>>> > > [ ] -1 Do not release this as Apache Arrow DataFusion 36.0.0
>>> because...
>>> > >
>>> > > Here is my vote:
>>> > >
>>> > > +1
>>> > >
>>> > > [1]:
>>> > >
>>> >
>>> https://github.com/apache/arrow-datafusion/tree/bf6f83b3d228fb386f9b4b20c254fa58e2412660
>>> > > [2]:
>>> > >
>>> >
>>> https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-36.0.0-rc1
>>> > > [3]:
>>> > >
>>> >
>>> https://github.com/apache/arrow-datafusion/blob/bf6f83b3d228fb386f9b4b20c254fa58e2412660/CHANGELOG.md
>>> >
>>>
>>


Re: [VOTE][RUST][DataFusion] Release Apache Arrow DataFusion 36.0.0 RC1

2024-02-20 Thread Andy Grove
I went ahead and published with --no-verify, and I started a vote on 36.0.1
to resolve the circular dependency issue

On Mon, Feb 19, 2024 at 12:16 PM Andy Grove  wrote:

> The vote passes with three binding votes. Thank you all.
>
> The source release has been published, but unfortunately, I cannot publish
> it to crates.io because a circular dependency has been introduced between
> crates. I have filed an issue to track this.
>
> https://github.com/apache/arrow-datafusion/issues/9277
>
> I propose that we invest in a CI check for this somehow, as we have seen
> this at least once before, and this kind of change can potentially be
> disruptive to undo. I have filed an issue for this as well:
>
> https://github.com/apache/arrow-datafusion/issues/9278
>
> Thanks,
>
> Andy.
>
> On Sat, Feb 17, 2024 at 2:25 AM Andrew Lamb  wrote:
>
>> +1 (binding)
>>
>> Verified on M3 Mac
>>
>> Thank you for keeping the release training humming Andy
>>
>> Andrew
>>
>> On Fri, Feb 16, 2024 at 12:23 PM L. C. Hsieh  wrote:
>>
>> > +1 (binding)
>> >
>> > Verified on M3 Mac.
>> >
>> > Thanks Andy.
>> >
>> >
>> > On Fri, Feb 16, 2024 at 9:08 AM Andy Grove 
>> wrote:
>> > >
>> > > Hi,
>> > >
>> > > I would like to propose a release of Apache Arrow DataFusion
>> > Implementation,
>> > > version 36.0.0.
>> > >
>> > > This release candidate is based on commit:
>> > > bf6f83b3d228fb386f9b4b20c254fa58e2412660 [1]
>> > > The proposed release tarball and signatures are hosted at [2].
>> > > The changelog is located at [3].
>> > >
>> > > Please download, verify checksums and signatures, run the unit tests,
>> and
>> > > vote
>> > > on the release. The vote will be open for at least 72 hours.
>> > >
>> > > Only votes from PMC members are binding, but all members of the
>> community
>> > > are
>> > > encouraged to test the release and vote with "(non-binding)".
>> > >
>> > > The standard verification procedure is documented at
>> > >
>> >
>> https://github.com/apache/arrow-datafusion/blob/main/dev/release/README.md#verifying-release-candidates
>> > > .
>> > >
>> > > [ ] +1 Release this as Apache Arrow DataFusion 36.0.0
>> > > [ ] +0
>> > > [ ] -1 Do not release this as Apache Arrow DataFusion 36.0.0
>> because...
>> > >
>> > > Here is my vote:
>> > >
>> > > +1
>> > >
>> > > [1]:
>> > >
>> >
>> https://github.com/apache/arrow-datafusion/tree/bf6f83b3d228fb386f9b4b20c254fa58e2412660
>> > > [2]:
>> > >
>> >
>> https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-36.0.0-rc1
>> > > [3]:
>> > >
>> >
>> https://github.com/apache/arrow-datafusion/blob/bf6f83b3d228fb386f9b4b20c254fa58e2412660/CHANGELOG.md
>> >
>>
>


[VOTE][RUST][DataFusion] Release Apache Arrow DataFusion 36.0.1 RC1

2024-02-19 Thread Andy Grove
Hi,

I would like to propose a release of Apache Arrow DataFusion, version
36.0.1.

*This is a patch release to fix an issue that prevented 36.0.0 from being
published to crates.io *

This release candidate is based on commit:
e53ccd0756644e6522e6f8c41c4497b47e4f4ceb [1]
The proposed release tarball and signatures are hosted at [2].
The changelog is located at [3].

Please download, verify checksums and signatures, run the unit tests, and
vote
on the release. The vote will be open for at least 72 hours.

Only votes from PMC members are binding, but all members of the community
are
encouraged to test the release and vote with "(non-binding)".

The standard verification procedure is documented at
https://github.com/apache/arrow-datafusion/blob/main/dev/release/README.md#verifying-release-candidates
.

[ ] +1 Release this as Apache Arrow DataFusion 36.0.1
[ ] +0
[ ] -1 Do not release this as Apache Arrow DataFusion 36.0.1 because...

Here is my vote:

+1

[1]:
https://github.com/apache/arrow-datafusion/tree/e53ccd0756644e6522e6f8c41c4497b47e4f4ceb
[2]:
https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-36.0.1-rc1
[3]:
https://github.com/apache/arrow-datafusion/blob/e53ccd0756644e6522e6f8c41c4497b47e4f4ceb/CHANGELOG.md


Re: [VOTE][RUST][DataFusion] Release Apache Arrow DataFusion 36.0.0 RC1

2024-02-19 Thread Andy Grove
The vote passes with three binding votes. Thank you all.

The source release has been published, but unfortunately, I cannot publish
it to crates.io because a circular dependency has been introduced between
crates. I have filed an issue to track this.

https://github.com/apache/arrow-datafusion/issues/9277

I propose that we invest in a CI check for this somehow, as we have seen
this at least once before, and this kind of change can potentially be
disruptive to undo. I have filed an issue for this as well:

https://github.com/apache/arrow-datafusion/issues/9278

Thanks,

Andy.

On Sat, Feb 17, 2024 at 2:25 AM Andrew Lamb  wrote:

> +1 (binding)
>
> Verified on M3 Mac
>
> Thank you for keeping the release training humming Andy
>
> Andrew
>
> On Fri, Feb 16, 2024 at 12:23 PM L. C. Hsieh  wrote:
>
> > +1 (binding)
> >
> > Verified on M3 Mac.
> >
> > Thanks Andy.
> >
> >
> > On Fri, Feb 16, 2024 at 9:08 AM Andy Grove 
> wrote:
> > >
> > > Hi,
> > >
> > > I would like to propose a release of Apache Arrow DataFusion
> > Implementation,
> > > version 36.0.0.
> > >
> > > This release candidate is based on commit:
> > > bf6f83b3d228fb386f9b4b20c254fa58e2412660 [1]
> > > The proposed release tarball and signatures are hosted at [2].
> > > The changelog is located at [3].
> > >
> > > Please download, verify checksums and signatures, run the unit tests,
> and
> > > vote
> > > on the release. The vote will be open for at least 72 hours.
> > >
> > > Only votes from PMC members are binding, but all members of the
> community
> > > are
> > > encouraged to test the release and vote with "(non-binding)".
> > >
> > > The standard verification procedure is documented at
> > >
> >
> https://github.com/apache/arrow-datafusion/blob/main/dev/release/README.md#verifying-release-candidates
> > > .
> > >
> > > [ ] +1 Release this as Apache Arrow DataFusion 36.0.0
> > > [ ] +0
> > > [ ] -1 Do not release this as Apache Arrow DataFusion 36.0.0 because...
> > >
> > > Here is my vote:
> > >
> > > +1
> > >
> > > [1]:
> > >
> >
> https://github.com/apache/arrow-datafusion/tree/bf6f83b3d228fb386f9b4b20c254fa58e2412660
> > > [2]:
> > >
> >
> https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-36.0.0-rc1
> > > [3]:
> > >
> >
> https://github.com/apache/arrow-datafusion/blob/bf6f83b3d228fb386f9b4b20c254fa58e2412660/CHANGELOG.md
> >
>


Re: [DISCUSS] Move sqlparser-rs back into DataFusion project?

2024-02-17 Thread Andy Grove
I agree that it simplifies shipping new SQL features in DataFusion since we
can develop the changes in the parser concurrently with the changes in
other DataFusion crates and then release them all together.

The name of the crate would not need to change, so downstream users should
see no impact.

We would need to decide if we want to keep a separate version number or
bring it in line with DataFusion version numbers (I have no preference
either way).



On Sat, Feb 17, 2024 at 11:09 AM Mehmet Ozan Kabak  wrote:

> Doing this will probably reduce the time-to-ship for DataFusion features
> that need parsing support due to increased convenience, so I’m inclined to
> see it in a positive light.
>
> What would be the impact of doing this on people who use only
> sqlparser-rs, if any?
>
> > On Feb 17, 2024, at 7:16 PM, Andy Grove  wrote:
> >
> > The sqlparser-rs project [1] seems to have become the de-facto SQL parser
> > for Rust, with almost 4 million downloads so far. This was originally
> part
> > of DataFusion very early on, and I moved it into a separate project
> because
> > it seemed useful for other projects. This was before DataFusion was known
> > as a composable query engine, and with hindsight, I probably should have
> > left it as part of the DataFusion project.
> >
> > Now that DataFusion has a reputation as a composable query engine, I
> think
> > it would make sense to move this code back into DataFusion, where it
> would
> > benefit from a larger community of maintainers.
> >
> > I would like to hear thoughts from the Apache Arrow / DataFusion
> community.
> > Does this seem like a good idea?
> >
> > Thanks,
> >
> > Andy.
> >
> > [1] https://github.com/sqlparser-rs/sqlparser-rs
>
>


[DISCUSS] Move sqlparser-rs back into DataFusion project?

2024-02-17 Thread Andy Grove
The sqlparser-rs project [1] seems to have become the de-facto SQL parser
for Rust, with almost 4 million downloads so far. This was originally part
of DataFusion very early on, and I moved it into a separate project because
it seemed useful for other projects. This was before DataFusion was known
as a composable query engine, and with hindsight, I probably should have
left it as part of the DataFusion project.

Now that DataFusion has a reputation as a composable query engine, I think
it would make sense to move this code back into DataFusion, where it would
benefit from a larger community of maintainers.

I would like to hear thoughts from the Apache Arrow / DataFusion community.
Does this seem like a good idea?

Thanks,

Andy.

[1] https://github.com/sqlparser-rs/sqlparser-rs


[VOTE][RUST][DataFusion] Release Apache Arrow DataFusion 36.0.0 RC1

2024-02-16 Thread Andy Grove
Hi,

I would like to propose a release of Apache Arrow DataFusion Implementation,
version 36.0.0.

This release candidate is based on commit:
bf6f83b3d228fb386f9b4b20c254fa58e2412660 [1]
The proposed release tarball and signatures are hosted at [2].
The changelog is located at [3].

Please download, verify checksums and signatures, run the unit tests, and
vote
on the release. The vote will be open for at least 72 hours.

Only votes from PMC members are binding, but all members of the community
are
encouraged to test the release and vote with "(non-binding)".

The standard verification procedure is documented at
https://github.com/apache/arrow-datafusion/blob/main/dev/release/README.md#verifying-release-candidates
.

[ ] +1 Release this as Apache Arrow DataFusion 36.0.0
[ ] +0
[ ] -1 Do not release this as Apache Arrow DataFusion 36.0.0 because...

Here is my vote:

+1

[1]:
https://github.com/apache/arrow-datafusion/tree/bf6f83b3d228fb386f9b4b20c254fa58e2412660
[2]:
https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-36.0.0-rc1
[3]:
https://github.com/apache/arrow-datafusion/blob/bf6f83b3d228fb386f9b4b20c254fa58e2412660/CHANGELOG.md


Re: [DataFusion] Choosing new logos for DataFusion

2024-02-08 Thread Andy Grove
The DataFusion community is close to choosing a new logo for when
DataFusion becomes a top-level project. The discussions have been happening
in a GitHub issue [1], which is technically "on the mailing list," but I
wanted to mention it here again to make sure everyone has a chance to
provide their input. Please add any feedback on the GitHub issue.

Thanks,

Andy.

[1] https://github.com/apache/arrow-datafusion/issues/8788

On Sat, Jan 13, 2024 at 12:08 PM Andy Grove  wrote:

> There are two related discussions in the community about creating new
> logos for DataFusion, for the following reasons:
>
> 1) Our current logo does not have Apache Arrow attribution and is also
> lacking a trademark symbol. [1]
> 2) We need a new version of the logo for the anticipated graduation of
> DataFusion to a top-level ASF project. [2]
>
> It would be great to get more feedback on these issues.
>
> Thanks,
>
> Andy.
>
> [1] https://github.com/apache/arrow-datafusion/issues/8754
> [2] https://github.com/apache/arrow-datafusion/issues/8788
>


[RESULT][VOTE][RUST][Ballista] Release Apache Arrow Ballista 0.12.0 RC4

2024-02-06 Thread Andy Grove
On Tue, Feb 6, 2024 at 5:29 PM Andy Grove  wrote:

> The vote passes with three +1 binding votes.
>
> Source release:
> https://dist.apache.org/repos/dist/release/arrow/arrow-ballista-0.12.0
>
> I have also published the crates to crates.io
>
> On Sun, Feb 4, 2024 at 4:48 AM Andrew Lamb  wrote:
>
>> +1 (binding)
>>
>> Verified in M3 Mac
>>
>> Thanks Andy
>>
>>
>> On Sat, Feb 3, 2024 at 5:35 PM L. C. Hsieh  wrote:
>>
>> > +1 (binding)
>> >
>> > Verified on M1 Mac.
>> >
>> > Thanks Andy.
>> >
>> > On Sat, Feb 3, 2024 at 2:15 PM Andy Grove 
>> wrote:
>> > >
>> > > Hi,
>> > >
>> > > I would like to propose a release of Apache Arrow Ballista
>> > Implementation,
>> > > version 0.12.0.
>> > >
>> > > This release candidate is based on commit:
>> > > a8ee11e55cfae4b7418f7044580318d33be9669e [1]
>> > > The proposed release tarball and signatures are hosted at [2].
>> > > The changelog is located at [3].
>> > >
>> > > Please download, verify checksums and signatures, run the unit tests,
>> and
>> > > vote
>> > > on the release. The vote will be open for at least 72 hours.
>> > >
>> > > Only votes from PMC members are binding, but all members of the
>> community
>> > > are
>> > > encouraged to test the release and vote with "(non-binding)".
>> > >
>> > > The standard verification procedure is documented at
>> > >
>> >
>> https://github.com/apache/arrow-ballista/blob/main/dev/release/README.md#verifying-release-candidates
>> > > .
>> > >
>> > > [ ] +1 Release this as Apache Arrow Ballista 0.12.0
>> > > [ ] +0
>> > > [ ] -1 Do not release this as Apache Arrow Ballista 0.12.0 because...
>> > >
>> > > [1]:
>> > >
>> >
>> https://github.com/apache/arrow-ballista/tree/a8ee11e55cfae4b7418f7044580318d33be9669e
>> > > [2]:
>> > >
>> >
>> https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-ballista-0.12.0-rc4
>> > > [3]:
>> > >
>> >
>> https://github.com/apache/arrow-ballista/blob/a8ee11e55cfae4b7418f7044580318d33be9669e/CHANGELOG.md
>> >
>>
>


Re: [VOTE][RUST][Ballista] Release Apache Arrow Ballista 0.12.0 RC4

2024-02-06 Thread Andy Grove
The vote passes with three +1 binding votes.

Source release:
https://dist.apache.org/repos/dist/release/arrow/arrow-ballista-0.12.0

I have also published the crates to crates.io

On Sun, Feb 4, 2024 at 4:48 AM Andrew Lamb  wrote:

> +1 (binding)
>
> Verified in M3 Mac
>
> Thanks Andy
>
>
> On Sat, Feb 3, 2024 at 5:35 PM L. C. Hsieh  wrote:
>
> > +1 (binding)
> >
> > Verified on M1 Mac.
> >
> > Thanks Andy.
> >
> > On Sat, Feb 3, 2024 at 2:15 PM Andy Grove  wrote:
> > >
> > > Hi,
> > >
> > > I would like to propose a release of Apache Arrow Ballista
> > Implementation,
> > > version 0.12.0.
> > >
> > > This release candidate is based on commit:
> > > a8ee11e55cfae4b7418f7044580318d33be9669e [1]
> > > The proposed release tarball and signatures are hosted at [2].
> > > The changelog is located at [3].
> > >
> > > Please download, verify checksums and signatures, run the unit tests,
> and
> > > vote
> > > on the release. The vote will be open for at least 72 hours.
> > >
> > > Only votes from PMC members are binding, but all members of the
> community
> > > are
> > > encouraged to test the release and vote with "(non-binding)".
> > >
> > > The standard verification procedure is documented at
> > >
> >
> https://github.com/apache/arrow-ballista/blob/main/dev/release/README.md#verifying-release-candidates
> > > .
> > >
> > > [ ] +1 Release this as Apache Arrow Ballista 0.12.0
> > > [ ] +0
> > > [ ] -1 Do not release this as Apache Arrow Ballista 0.12.0 because...
> > >
> > > [1]:
> > >
> >
> https://github.com/apache/arrow-ballista/tree/a8ee11e55cfae4b7418f7044580318d33be9669e
> > > [2]:
> > >
> >
> https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-ballista-0.12.0-rc4
> > > [3]:
> > >
> >
> https://github.com/apache/arrow-ballista/blob/a8ee11e55cfae4b7418f7044580318d33be9669e/CHANGELOG.md
> >
>


[RESULT][VOTE][RUST][DataFusion] Release DataFusion Python Bindings 35.0.0 RC1

2024-02-04 Thread Andy Grove
On Sun, Feb 4, 2024 at 4:56 PM Andy Grove  wrote:

> The vote passes with 3 binding +1 votes. Thanks, everyone. I have
> published the release.
>
> Andy.
>
> On Fri, Feb 2, 2024 at 4:16 AM Andrew Lamb  wrote:
>
>> +1 (binding)
>>
>> Verified on M3 mac
>>
>> As before it seems as if python 3.11 isn't supported in the verification
>> script, only python 3.10. When I used 3.10 everything looks good.
>>
>> Thanks a lot,
>> Andrew
>>
>> On Thu, Feb 1, 2024 at 7:27 PM L. C. Hsieh  wrote:
>>
>> > +1 (binding)
>> >
>> > Verified on M1 Mac.
>> >
>> > Thanks Andy.
>> >
>> > On Thu, Feb 1, 2024 at 3:53 PM Andy Grove 
>> wrote:
>> > >
>> > > Hi,
>> > >
>> > > I would like to propose a release of Apache Arrow DataFusion Python
>> > > Bindings,
>> > > version 35.0.0.
>> > >
>> > > This release candidate is based on commit:
>> > > bef6cb66599588c096dae59ddfd707053e5741cd [1]
>> > > The proposed release tarball and signatures are hosted at [2].
>> > > The changelog is located at [3].
>> > > The Python wheels are located at [4].
>> > >
>> > > Please download, verify checksums and signatures, run the unit tests,
>> and
>> > > vote
>> > > on the release. The vote will be open for at least 72 hours.
>> > >
>> > > Only votes from PMC members are binding, but all members of the
>> community
>> > > are
>> > > encouraged to test the release and vote with "(non-binding)".
>> > >
>> > > The standard verification procedure is documented at
>> > >
>> >
>> https://github.com/apache/arrow-datafusion-python/blob/main/dev/release/README.md#verifying-release-candidates
>> > > .
>> > >
>> > > [ ] +1 Release this as Apache Arrow DataFusion Python 35.0.0
>> > > [ ] +0
>> > > [ ] -1 Do not release this as Apache Arrow DataFusion Python 35.0.0
>> > > because...
>> > >
>> > > Here is my vote:
>> > >
>> > > +1
>> > >
>> > > [1]:
>> > >
>> >
>> https://github.com/apache/arrow-datafusion-python/tree/bef6cb66599588c096dae59ddfd707053e5741cd
>> > > [2]:
>> > >
>> >
>> https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-python-35.0.0-rc1
>> > > [3]:
>> > >
>> >
>> https://github.com/apache/arrow-datafusion-python/blob/bef6cb66599588c096dae59ddfd707053e5741cd/CHANGELOG.md
>> > > [4]: https://test.pypi.org/project/datafusion/35.0.0/
>> >
>>
>


Re: [VOTE][RUST][DataFusion] Release DataFusion Python Bindings 35.0.0 RC1

2024-02-04 Thread Andy Grove
The vote passes with 3 binding +1 votes. Thanks, everyone. I have published
the release.

Andy.

On Fri, Feb 2, 2024 at 4:16 AM Andrew Lamb  wrote:

> +1 (binding)
>
> Verified on M3 mac
>
> As before it seems as if python 3.11 isn't supported in the verification
> script, only python 3.10. When I used 3.10 everything looks good.
>
> Thanks a lot,
> Andrew
>
> On Thu, Feb 1, 2024 at 7:27 PM L. C. Hsieh  wrote:
>
> > +1 (binding)
> >
> > Verified on M1 Mac.
> >
> > Thanks Andy.
> >
> > On Thu, Feb 1, 2024 at 3:53 PM Andy Grove  wrote:
> > >
> > > Hi,
> > >
> > > I would like to propose a release of Apache Arrow DataFusion Python
> > > Bindings,
> > > version 35.0.0.
> > >
> > > This release candidate is based on commit:
> > > bef6cb66599588c096dae59ddfd707053e5741cd [1]
> > > The proposed release tarball and signatures are hosted at [2].
> > > The changelog is located at [3].
> > > The Python wheels are located at [4].
> > >
> > > Please download, verify checksums and signatures, run the unit tests,
> and
> > > vote
> > > on the release. The vote will be open for at least 72 hours.
> > >
> > > Only votes from PMC members are binding, but all members of the
> community
> > > are
> > > encouraged to test the release and vote with "(non-binding)".
> > >
> > > The standard verification procedure is documented at
> > >
> >
> https://github.com/apache/arrow-datafusion-python/blob/main/dev/release/README.md#verifying-release-candidates
> > > .
> > >
> > > [ ] +1 Release this as Apache Arrow DataFusion Python 35.0.0
> > > [ ] +0
> > > [ ] -1 Do not release this as Apache Arrow DataFusion Python 35.0.0
> > > because...
> > >
> > > Here is my vote:
> > >
> > > +1
> > >
> > > [1]:
> > >
> >
> https://github.com/apache/arrow-datafusion-python/tree/bef6cb66599588c096dae59ddfd707053e5741cd
> > > [2]:
> > >
> >
> https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-python-35.0.0-rc1
> > > [3]:
> > >
> >
> https://github.com/apache/arrow-datafusion-python/blob/bef6cb66599588c096dae59ddfd707053e5741cd/CHANGELOG.md
> > > [4]: https://test.pypi.org/project/datafusion/35.0.0/
> >
>


[DISCUSS] Ballista Python Bindings

2024-02-04 Thread Andy Grove
The current Ballista Python bindings [1] were created by cloning the
DataFusion Python bindings and then making some modifications. The
resulting codebase proved to be challenging to maintain and has not been
maintained for almost a year. This repository contains around 1,100 lines
of Rust code.

I propose that we archive this repository and adopt a new Python client
that only exposes SQL capabilities rather than providing both SQL and
DataFrame APIs. I have a PR [2] up for a new client, and this only contains
75 lines of Rust code. This new client uses the datafusion-python crate as
a dependency rather than duplicating code.

My hope is that this much leaner implementation will be easier to maintain
and keep up-to-date with Ballista releases. We can add the DataFrame API in
the future as a thin wrapper around the datafusion-python dependency if the
project gains enough traction.

If there are no objections, I will go ahead and archive the old repository
in the next week or two (and update the README to point to the new client).

Thanks,

Andy.

[1] https://github.com/apache/arrow-ballista-python

[2] https://github.com/apache/arrow-ballista/pull/970


[VOTE][RUST][Ballista] Release Apache Arrow Ballista 0.12.0 RC4

2024-02-03 Thread Andy Grove
Hi,

I would like to propose a release of Apache Arrow Ballista Implementation,
version 0.12.0.

This release candidate is based on commit:
a8ee11e55cfae4b7418f7044580318d33be9669e [1]
The proposed release tarball and signatures are hosted at [2].
The changelog is located at [3].

Please download, verify checksums and signatures, run the unit tests, and
vote
on the release. The vote will be open for at least 72 hours.

Only votes from PMC members are binding, but all members of the community
are
encouraged to test the release and vote with "(non-binding)".

The standard verification procedure is documented at
https://github.com/apache/arrow-ballista/blob/main/dev/release/README.md#verifying-release-candidates
.

[ ] +1 Release this as Apache Arrow Ballista 0.12.0
[ ] +0
[ ] -1 Do not release this as Apache Arrow Ballista 0.12.0 because...

[1]:
https://github.com/apache/arrow-ballista/tree/a8ee11e55cfae4b7418f7044580318d33be9669e
[2]:
https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-ballista-0.12.0-rc4
[3]:
https://github.com/apache/arrow-ballista/blob/a8ee11e55cfae4b7418f7044580318d33be9669e/CHANGELOG.md


[VOTE][RUST][DataFusion] Release DataFusion Python Bindings 35.0.0 RC1

2024-02-01 Thread Andy Grove
Hi,

I would like to propose a release of Apache Arrow DataFusion Python
Bindings,
version 35.0.0.

This release candidate is based on commit:
bef6cb66599588c096dae59ddfd707053e5741cd [1]
The proposed release tarball and signatures are hosted at [2].
The changelog is located at [3].
The Python wheels are located at [4].

Please download, verify checksums and signatures, run the unit tests, and
vote
on the release. The vote will be open for at least 72 hours.

Only votes from PMC members are binding, but all members of the community
are
encouraged to test the release and vote with "(non-binding)".

The standard verification procedure is documented at
https://github.com/apache/arrow-datafusion-python/blob/main/dev/release/README.md#verifying-release-candidates
.

[ ] +1 Release this as Apache Arrow DataFusion Python 35.0.0
[ ] +0
[ ] -1 Do not release this as Apache Arrow DataFusion Python 35.0.0
because...

Here is my vote:

+1

[1]:
https://github.com/apache/arrow-datafusion-python/tree/bef6cb66599588c096dae59ddfd707053e5741cd
[2]:
https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-python-35.0.0-rc1
[3]:
https://github.com/apache/arrow-datafusion-python/blob/bef6cb66599588c096dae59ddfd707053e5741cd/CHANGELOG.md
[4]: https://test.pypi.org/project/datafusion/35.0.0/


[RESULT][VOTE] Accept donation of Comet Spark native engine

2024-01-30 Thread Andy Grove
On Tue, Jan 30, 2024 at 12:52 PM Andy Grove  wrote:

> The vote passes with 17 votes (11 binding).
>
> Thank you to everyone who took the time to vote.
>
> The next step will be to complete the IP clearance process.
>
> Thanks,
>
> Andy.
>
> On Tue, Jan 30, 2024 at 1:31 AM Peter Toth  wrote:
>
>> +1
>>
>> Parth Chandra  ezt írta (időpont: 2024. jan. 29., H,
>> 19:07):
>>
>> > +1 (non-binding)
>> >
>> > On Sat, Jan 27, 2024 at 7:44 AM Andy Grove 
>> wrote:
>> >
>> > > Hello,
>> > >
>> > > This vote is to determine if the Arrow PMC is in favor of accepting
>> the
>> > > donation of Comet (a Spark native engine that is powered by DataFusion
>> > and
>> > > the Rust implementation of Arrow).
>> > >
>> > > The donation was previously discussed on the mailing list [1].
>> > >
>> > > The proposed donation is at [2].
>> > >
>> > > The Arrow PMC will start the IP clearance process if the vote passes.
>> > There
>> > > is a Google document [3] where the community is working on the draft
>> > > contents for the IP clearance form.
>> > >
>> > > The vote will be open for at least 72 hours.
>> > >
>> > > [ ] +1 : Accept the donation
>> > > [ ] 0 : No opinion
>> > > [ ] -1 : Reject donation because...
>> > >
>> > > My vote: +1
>> > >
>> > > Thanks,
>> > >
>> > > Andy.
>> > >
>> > >
>> > > [1] https://lists.apache.org/thread/0q1rb11jtpopc7vt1ffdzro0omblsh0s
>> > > [2] https://github.com/apache/arrow-datafusion-comet/pull/1
>> > > [3]
>> > >
>> > >
>> >
>> https://docs.google.com/document/d/1azmxE1LERNUdnpzqDO5ortKTsPKrhNgQC4oZSmXa8x4/edit?usp=sharing
>> > >
>> >
>>
>


Re: [VOTE] Accept donation of Comet Spark native engine

2024-01-30 Thread Andy Grove
The vote passes with 17 votes (11 binding).

Thank you to everyone who took the time to vote.

The next step will be to complete the IP clearance process.

Thanks,

Andy.

On Tue, Jan 30, 2024 at 1:31 AM Peter Toth  wrote:

> +1
>
> Parth Chandra  ezt írta (időpont: 2024. jan. 29., H,
> 19:07):
>
> > +1 (non-binding)
> >
> > On Sat, Jan 27, 2024 at 7:44 AM Andy Grove 
> wrote:
> >
> > > Hello,
> > >
> > > This vote is to determine if the Arrow PMC is in favor of accepting the
> > > donation of Comet (a Spark native engine that is powered by DataFusion
> > and
> > > the Rust implementation of Arrow).
> > >
> > > The donation was previously discussed on the mailing list [1].
> > >
> > > The proposed donation is at [2].
> > >
> > > The Arrow PMC will start the IP clearance process if the vote passes.
> > There
> > > is a Google document [3] where the community is working on the draft
> > > contents for the IP clearance form.
> > >
> > > The vote will be open for at least 72 hours.
> > >
> > > [ ] +1 : Accept the donation
> > > [ ] 0 : No opinion
> > > [ ] -1 : Reject donation because...
> > >
> > > My vote: +1
> > >
> > > Thanks,
> > >
> > > Andy.
> > >
> > >
> > > [1] https://lists.apache.org/thread/0q1rb11jtpopc7vt1ffdzro0omblsh0s
> > > [2] https://github.com/apache/arrow-datafusion-comet/pull/1
> > > [3]
> > >
> > >
> >
> https://docs.google.com/document/d/1azmxE1LERNUdnpzqDO5ortKTsPKrhNgQC4oZSmXa8x4/edit?usp=sharing
> > >
> >
>


[VOTE] Accept donation of Comet Spark native engine

2024-01-27 Thread Andy Grove
Hello,

This vote is to determine if the Arrow PMC is in favor of accepting the
donation of Comet (a Spark native engine that is powered by DataFusion and
the Rust implementation of Arrow).

The donation was previously discussed on the mailing list [1].

The proposed donation is at [2].

The Arrow PMC will start the IP clearance process if the vote passes. There
is a Google document [3] where the community is working on the draft
contents for the IP clearance form.

The vote will be open for at least 72 hours.

[ ] +1 : Accept the donation
[ ] 0 : No opinion
[ ] -1 : Reject donation because...

My vote: +1

Thanks,

Andy.


[1] https://lists.apache.org/thread/0q1rb11jtpopc7vt1ffdzro0omblsh0s
[2] https://github.com/apache/arrow-datafusion-comet/pull/1
[3]
https://docs.google.com/document/d/1azmxE1LERNUdnpzqDO5ortKTsPKrhNgQC4oZSmXa8x4/edit?usp=sharing


[RESULT][VOTE][RUST][DataFusion] Release Apache Arrow DataFusion 35.0.0 RC1

2024-01-25 Thread Andy Grove
On Thu, Jan 25, 2024 at 8:33 AM Andy Grove  wrote:

> The vote passes with three binding +1 votes. Thanks, everyone.
>
> The release is available at
> https://dist.apache.org/repos/dist/release/arrow/arrow-datafusion-35.0.0/
>
> On Sun, Jan 21, 2024 at 12:38 PM L. C. Hsieh  wrote:
>
>> +1 (binding)
>>
>> Agreed with Andrew. This looks like a test only issue.
>> I think we should address the Expr PartialOrd further
>> (https://github.com/apache/arrow-datafusion/issues/8932), but it
>> should not block the release.
>>
>> Thanks Andy.
>>
>> On Sun, Jan 21, 2024 at 3:13 AM Andrew Lamb  wrote:
>> >
>> > +1 (binding)
>> >
>> > I verified it on Mac (M3).
>> >
>> > I got the same error in test_partial_ord and I agree it looks very much
>> the
>> > the same as https://github.com/apache/arrow-datafusion/pull/8908 -- a
>> test
>> > only issue that should not block the release
>> >
>> > Thanks Andy
>> >
>> >
>> > On Sat, Jan 20, 2024 at 10:43 AM Andy Grove 
>> wrote:
>> >
>> > > Hi,
>> > >
>> > > I would like to propose a release of Apache Arrow DataFusion
>> > > Implementation,
>> > > version 35.0.0.
>> > >
>> > > This release candidate is based on commit:
>> > > e58446bbe9ebe3f5a2aae1abd3c17a694070b0d1 [1]
>> > > The proposed release tarball and signatures are hosted at [2].
>> > > The changelog is located at [3].
>> > >
>> > > Please download, verify checksums and signatures, run the unit tests,
>> and
>> > > vote
>> > > on the release. The vote will be open for at least 72 hours.
>> > >
>> > > Only votes from PMC members are binding, but all members of the
>> community
>> > > are
>> > > encouraged to test the release and vote with "(non-binding)".
>> > >
>> > > The standard verification procedure is documented at
>> > >
>> > >
>> https://github.com/apache/arrow-datafusion/blob/main/dev/release/README.md#verifying-release-candidates
>> > > .
>> > >
>> > > [ ] +1 Release this as Apache Arrow DataFusion 35.0.0
>> > > [ ] +0
>> > > [ ] -1 Do not release this as Apache Arrow DataFusion 35.0.0
>> because...
>> > >
>> > > Here is my vote:
>> > >
>> > > +1
>> > >
>> > > [1]:
>> > >
>> > >
>> https://github.com/apache/arrow-datafusion/tree/e58446bbe9ebe3f5a2aae1abd3c17a694070b0d1
>> > > [2]:
>> > >
>> > >
>> https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-35.0.0-rc1
>> > > [3]:
>> > >
>> > >
>> https://github.com/apache/arrow-datafusion/blob/e58446bbe9ebe3f5a2aae1abd3c17a694070b0d1/CHANGELOG.md
>> > >
>>
>


Re: [VOTE][RUST][DataFusion] Release Apache Arrow DataFusion 35.0.0 RC1

2024-01-25 Thread Andy Grove
The vote passes with three binding +1 votes. Thanks, everyone.

The release is available at
https://dist.apache.org/repos/dist/release/arrow/arrow-datafusion-35.0.0/

On Sun, Jan 21, 2024 at 12:38 PM L. C. Hsieh  wrote:

> +1 (binding)
>
> Agreed with Andrew. This looks like a test only issue.
> I think we should address the Expr PartialOrd further
> (https://github.com/apache/arrow-datafusion/issues/8932), but it
> should not block the release.
>
> Thanks Andy.
>
> On Sun, Jan 21, 2024 at 3:13 AM Andrew Lamb  wrote:
> >
> > +1 (binding)
> >
> > I verified it on Mac (M3).
> >
> > I got the same error in test_partial_ord and I agree it looks very much
> the
> > the same as https://github.com/apache/arrow-datafusion/pull/8908 -- a
> test
> > only issue that should not block the release
> >
> > Thanks Andy
> >
> >
> > On Sat, Jan 20, 2024 at 10:43 AM Andy Grove 
> wrote:
> >
> > > Hi,
> > >
> > > I would like to propose a release of Apache Arrow DataFusion
> > > Implementation,
> > > version 35.0.0.
> > >
> > > This release candidate is based on commit:
> > > e58446bbe9ebe3f5a2aae1abd3c17a694070b0d1 [1]
> > > The proposed release tarball and signatures are hosted at [2].
> > > The changelog is located at [3].
> > >
> > > Please download, verify checksums and signatures, run the unit tests,
> and
> > > vote
> > > on the release. The vote will be open for at least 72 hours.
> > >
> > > Only votes from PMC members are binding, but all members of the
> community
> > > are
> > > encouraged to test the release and vote with "(non-binding)".
> > >
> > > The standard verification procedure is documented at
> > >
> > >
> https://github.com/apache/arrow-datafusion/blob/main/dev/release/README.md#verifying-release-candidates
> > > .
> > >
> > > [ ] +1 Release this as Apache Arrow DataFusion 35.0.0
> > > [ ] +0
> > > [ ] -1 Do not release this as Apache Arrow DataFusion 35.0.0 because...
> > >
> > > Here is my vote:
> > >
> > > +1
> > >
> > > [1]:
> > >
> > >
> https://github.com/apache/arrow-datafusion/tree/e58446bbe9ebe3f5a2aae1abd3c17a694070b0d1
> > > [2]:
> > >
> > >
> https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-35.0.0-rc1
> > > [3]:
> > >
> > >
> https://github.com/apache/arrow-datafusion/blob/e58446bbe9ebe3f5a2aae1abd3c17a694070b0d1/CHANGELOG.md
> > >
>


[VOTE][RUST][DataFusion] Release Apache Arrow DataFusion 35.0.0 RC1

2024-01-20 Thread Andy Grove
Hi,

I would like to propose a release of Apache Arrow DataFusion Implementation,
version 35.0.0.

This release candidate is based on commit:
e58446bbe9ebe3f5a2aae1abd3c17a694070b0d1 [1]
The proposed release tarball and signatures are hosted at [2].
The changelog is located at [3].

Please download, verify checksums and signatures, run the unit tests, and
vote
on the release. The vote will be open for at least 72 hours.

Only votes from PMC members are binding, but all members of the community
are
encouraged to test the release and vote with "(non-binding)".

The standard verification procedure is documented at
https://github.com/apache/arrow-datafusion/blob/main/dev/release/README.md#verifying-release-candidates
.

[ ] +1 Release this as Apache Arrow DataFusion 35.0.0
[ ] +0
[ ] -1 Do not release this as Apache Arrow DataFusion 35.0.0 because...

Here is my vote:

+1

[1]:
https://github.com/apache/arrow-datafusion/tree/e58446bbe9ebe3f5a2aae1abd3c17a694070b0d1
[2]:
https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-35.0.0-rc1
[3]:
https://github.com/apache/arrow-datafusion/blob/e58446bbe9ebe3f5a2aae1abd3c17a694070b0d1/CHANGELOG.md


Re: [DISCUSS] Donation of a Spark native engine based on DataFusion & Arrow

2024-01-15 Thread Andy Grove
Hi Chao,

I have created https://github.com/apache/arrow-datafusion-comet and you
should be able to create a PR against the repo.

Thanks,

Andy.

Andy.

On Fri, Jan 12, 2024 at 3:45 PM Chao Sun  wrote:

> Thanks all for the positive support!
>
> Andy, we plan to name the project Comet (BTW if you have better
> suggestions please let us know). Could you help to create a repo named
> arrow-datafusion-comet or arrow-comet? We'll clean up our internal
> repo and prepare for the donation in the next few days. Thanks for the
> help!
>
> Best,
> Chao
>
>
>
> On Fri, Jan 12, 2024 at 7:09 AM Andy Grove  wrote:
> >
> > I think the next step here would be to create a new repo so that Chao can
> > create a PR for the contribution, and then we can proceed to a vote.
> >
> > Chao - do you have a proposal for the name of the project? Given that
> this
> > is being donated to Apache Arrow, the repo name will start with "arrow-".
> > Also, given that this is more of a DataFusion sub-project, I think it
> would
> > make sense to prefix the repo name with "arrow-datafusion-" and then
> rename
> > to "datafusion-" once we move the DataFusion projects to the new
> top-level
> > project.
> >
> > If the vote passes, we must complete the IP clearance process before the
> PR
> > is accepted [1].
> >
> > [1] https://incubator.apache.org/ip-clearance/
> >
> >
> >
> > On Fri, Jan 12, 2024 at 12:36 AM Albert  wrote:
> >
> > > Like Andrew Lamb mentioned, blaze-rs has similar goals, I'd really be
> > > interested to know some comparisons when the donations are made.
> > > All in all, I look forward to the new native project for spark
> > > acceleration.
> > >
> > > On Thu, Jan 11, 2024 at 9:50 PM Andrew Lamb 
> wrote:
> > >
> > > > I am very supportive of this donation. I know of at least one other
> > > > DataFusion-based project, blaze-rs[1], which has the same design
> goal and
> > > > bringing this project into the ASF may help consolidate these efforts
> > > >
> > > > As Andy said, I believe it was very valuable to have a major consumer
> > > > project (e.g. DataFusion) to help drive the definition and
> implementation
> > > > of arrow-rs implementation. We never achieved the same synergy with
> > > > Ballista and DataFusion but I think it is more likely with a more
> > > actively
> > > > maintained Spark accelerator.
> > > >
> > > > I am not sure it affects this discussion, but the Gluten project,
> based
> > > on
> > > > Velox, was accepted yesterday[2] into the Apache Incubator[2].
> While the
> > > > functionality may be similar, the technology (Rust vs C/C++) and the
> > > > communities are different so having both in the same (big) tent of
> the
> > > ASF
> > > > doesn't seem concerning to me.
> > > >
> > > > Also, as Chao says, I think this new sub project would naturally
> move to
> > > a
> > > > new DataFusion top level project when we get there (we plan a
> proposed
> > > > resolution April ASF board meeting)
> > > >
> > > > Looking forward to seeing more!
> > > > Andrew
> > > >
> > > > [1]: https://github.com/blaze-init/blaze
> > > > [2]:
> https://lists.apache.org/thread/6lrozds10jn9gknj9rf74lqbh7j55pq6
> > > >
> > > > On Wed, Jan 10, 2024 at 5:10 PM Andy Grove 
> > > wrote:
> > > >
> > > > > Hi Chao,
> > > > >
> > > > > This sounds like a really interesting project. I am interested in
> > > seeing
> > > > > how it compares to Spark RAPIDS (the project that I work on at
> NVIDIA)
> > > > and
> > > > > Intel's Gluten project (that works with Velox).
> > > > >
> > > > > I can see the following benefits of having this project being under
> > > > Apache
> > > > > Arrow governance:
> > > > >
> > > > > - Assuming that this is a drop-in replacement that doesn't require
> > > users
> > > > to
> > > > > change their code (as I imagine is the case), then it could lead to
> > > > greater
> > > > > adoption of DataFusion, especially for more demanding use cases
> where
> > > > > processing on a single node is not possible.
> > > > > - Given that it has a deep integration with t

[DataFusion] Choosing new logos for DataFusion

2024-01-13 Thread Andy Grove
There are two related discussions in the community about creating new logos
for DataFusion, for the following reasons:

1) Our current logo does not have Apache Arrow attribution and is also
lacking a trademark symbol. [1]
2) We need a new version of the logo for the anticipated graduation of
DataFusion to a top-level ASF project. [2]

It would be great to get more feedback on these issues.

Thanks,

Andy.

[1] https://github.com/apache/arrow-datafusion/issues/8754
[2] https://github.com/apache/arrow-datafusion/issues/8788


Ensuring compliance with ASF branding policy

2024-01-13 Thread Andy Grove
It was recently brought to the attention of the Arrow PMC that some of our
documentation and other web content is not fully compliant with the ASF
branding policy.

I filed an issue in the mono repo with the details and to track the work
needed across the various language implementations and sub-projects [1],
but I wanted to bring this to everyone's attention, hence this email.

Thank you to everyone who is already contributing towards this effort.

Andy.

[1] https://github.com/apache/arrow/issues/39462


Re: [DISCUSS] Donation of a Spark native engine based on DataFusion & Arrow

2024-01-12 Thread Andy Grove
I think the next step here would be to create a new repo so that Chao can
create a PR for the contribution, and then we can proceed to a vote.

Chao - do you have a proposal for the name of the project? Given that this
is being donated to Apache Arrow, the repo name will start with "arrow-".
Also, given that this is more of a DataFusion sub-project, I think it would
make sense to prefix the repo name with "arrow-datafusion-" and then rename
to "datafusion-" once we move the DataFusion projects to the new top-level
project.

If the vote passes, we must complete the IP clearance process before the PR
is accepted [1].

[1] https://incubator.apache.org/ip-clearance/



On Fri, Jan 12, 2024 at 12:36 AM Albert  wrote:

> Like Andrew Lamb mentioned, blaze-rs has similar goals, I'd really be
> interested to know some comparisons when the donations are made.
> All in all, I look forward to the new native project for spark
> acceleration.
>
> On Thu, Jan 11, 2024 at 9:50 PM Andrew Lamb  wrote:
>
> > I am very supportive of this donation. I know of at least one other
> > DataFusion-based project, blaze-rs[1], which has the same design goal and
> > bringing this project into the ASF may help consolidate these efforts
> >
> > As Andy said, I believe it was very valuable to have a major consumer
> > project (e.g. DataFusion) to help drive the definition and implementation
> > of arrow-rs implementation. We never achieved the same synergy with
> > Ballista and DataFusion but I think it is more likely with a more
> actively
> > maintained Spark accelerator.
> >
> > I am not sure it affects this discussion, but the Gluten project, based
> on
> > Velox, was accepted yesterday[2] into the Apache Incubator[2].  While the
> > functionality may be similar, the technology (Rust vs C/C++) and the
> > communities are different so having both in the same (big) tent of the
> ASF
> > doesn't seem concerning to me.
> >
> > Also, as Chao says, I think this new sub project would naturally move to
> a
> > new DataFusion top level project when we get there (we plan a proposed
> > resolution April ASF board meeting)
> >
> > Looking forward to seeing more!
> > Andrew
> >
> > [1]: https://github.com/blaze-init/blaze
> > [2]: https://lists.apache.org/thread/6lrozds10jn9gknj9rf74lqbh7j55pq6
> >
> > On Wed, Jan 10, 2024 at 5:10 PM Andy Grove 
> wrote:
> >
> > > Hi Chao,
> > >
> > > This sounds like a really interesting project. I am interested in
> seeing
> > > how it compares to Spark RAPIDS (the project that I work on at NVIDIA)
> > and
> > > Intel's Gluten project (that works with Velox).
> > >
> > > I can see the following benefits of having this project being under
> > Apache
> > > Arrow governance:
> > >
> > > - Assuming that this is a drop-in replacement that doesn't require
> users
> > to
> > > change their code (as I imagine is the case), then it could lead to
> > greater
> > > adoption of DataFusion, especially for more demanding use cases where
> > > processing on a single node is not possible.
> > > - Given that it has a deep integration with the Rust implementation of
> > > Arrow as well as DataFusion, and given the overlap of committers
> between
> > > these projects, having them under the same governance and communication
> > > channels will generally be more efficient than if this project is
> > separate.
> > > - Hopefully this leads to more upstream contributions to DataFusion,
> > > perhaps even allowing other projects such as Ballista to benefit from
> > > Spark-compatible operators and expressions in the future.
> > > - Having another project that uses DataFusion as a dependency could
> help
> > > with stabilizing the public APIs and generally driving more innovation.
> > >
> > > Given these points, I would be supportive of a donation. I see it as
> > being
> > > similar to the Ballista project, which is already part of Arrow (and we
> > > plan to move along with DataFusion once it becomes a top-level
> project).
> > >
> > > Thanks,
> > >
> > > Andy.
> > >
> > > On Wed, Jan 10, 2024 at 2:28 PM Chao Sun  wrote:
> > >
> > > > Hi all,
> > > >
> > > > We have been working on a native execution engine for Apache Spark
> > > > that is heavily based on DataFusion and Arrow. Our goal is to
> > > > accelerate Spark query execution via delegating Spark's physical plan
> > > > execution to Da

Re: [DISCUSS] Donation of a Spark native engine based on DataFusion & Arrow

2024-01-10 Thread Andy Grove
Hi Chao,

This sounds like a really interesting project. I am interested in seeing
how it compares to Spark RAPIDS (the project that I work on at NVIDIA) and
Intel's Gluten project (that works with Velox).

I can see the following benefits of having this project being under Apache
Arrow governance:

- Assuming that this is a drop-in replacement that doesn't require users to
change their code (as I imagine is the case), then it could lead to greater
adoption of DataFusion, especially for more demanding use cases where
processing on a single node is not possible.
- Given that it has a deep integration with the Rust implementation of
Arrow as well as DataFusion, and given the overlap of committers between
these projects, having them under the same governance and communication
channels will generally be more efficient than if this project is separate.
- Hopefully this leads to more upstream contributions to DataFusion,
perhaps even allowing other projects such as Ballista to benefit from
Spark-compatible operators and expressions in the future.
- Having another project that uses DataFusion as a dependency could help
with stabilizing the public APIs and generally driving more innovation.

Given these points, I would be supportive of a donation. I see it as being
similar to the Ballista project, which is already part of Arrow (and we
plan to move along with DataFusion once it becomes a top-level project).

Thanks,

Andy.

On Wed, Jan 10, 2024 at 2:28 PM Chao Sun  wrote:

> Hi all,
>
> We have been working on a native execution engine for Apache Spark
> that is heavily based on DataFusion and Arrow. Our goal is to
> accelerate Spark query execution via delegating Spark's physical plan
> execution to DataFusion's highly modular execution framework, while
> still maintaining the same semantics to Spark users (i.e., no Spark
> behavior change from the end users' point of view). Several of us are
> Spark and/or Arrow committers. At the moment, the project is under
> active development and not yet feature complete. However, some of the
> existing functionalities are relatively mature and have been put in
> production for a while now.
>
> Given the current momentum towards accelerating Spark through native
> vectorized execution, we believe open sourcing this work will benefit
> other Spark users too. In addition, we think the project itself can
> also leverage the vibrant and strong community behind Arrow and
> DataFusion, and grow faster. Because of this, we are exploring the
> possibility of contributing this project to the Apache Software
> Foundation (ASF) under the Apache Arrow project umbrella.
>
> We'd very much like to hear your opinion on this. Thanks.
>
> Best,
> Chao
>


Re: [CROWDSOURCING] 2024 ASF Board Report -- January 10, 2024

2024-01-10 Thread Andy Grove
generically interacting with object store systems such as AWS S3, Google
Cloud
Storage, and Azure Blob Storage. This crate has seen significant adoption
outside of the arrow community, for example the crates.io service itself.


### C (GLib)

No update

### MATLAB

We are currently working on integrating with the project release tooling to
make it possible to distribute pre-built MLTBX files for easy installation
of
the MATLAB interface.

### Python

There has been ongoing work on improving interoperability with other Python
projects for example adding C Data Interface PyCapsule protocol and
implementing the usage of capsules in ADBC and nanoarrow-python. We have
also
implemented the DLPack protocol on Arrow Arrays that is used to move the
data
to ML libraries.

A critical security vulnerability was discovered in PyArrow versions 0.14.0
to
14.0.0 that allowed arbitrary code execution when loading a malicious Arrow
IPC, Feather, or Parquet data file (CVE-2023-47248). The vulnerability was
patched in PyArrow version 14.0.1. A hotfix package was released to patch
the
vulnerability in all other versions of PyArrow for users unable to
immediately
upgrade.

### R

Completed large parts of a major rework of the Arrow R package build system.
These changes aim to reduce maintenance burden and streamline
new-contributor
experience e.g. by automating the use of nightly builds which enables
contributions to the R package without having to setup a C++ development
environment.

### Ruby

Added some convenient APIs.

### Swift

Improved Flight SQL implementation.

## Community Health:
Community communication continues to be strong.

There have been 5 blog posts published to https://arrow.apache.org/blog/  in
the last 3 months.

The mailing lists are active




On Sun, Jan 7, 2024 at 12:16 PM Andy Grove  wrote:

> Thanks for the updates so far. There are still no updates for many of the
> language implementations, and it would be good to get 1-2 lines for each of
> them if possible.
>
> We are currently missing updates for the C (GLib), C++, Go, Java,
> JavaScript, Julia, MATLAB, Python, R, Ruby, and Swift Arrow
> implementations, as well as the Acera, nanoarrow, Arrow Flight, and Arrow
> Flight SQL adapter for PostgreSQL subprojects.
>
> The report is due this Wednesday, January 10.
>
> I will endeavor to start this process for future board reports earlier so
> we have more time to complete this.
>
> Thanks,
>
> Andy.
>
>
>
> On Thu, Jan 4, 2024 at 4:08 PM Andy Grove  wrote:
>
>> Hello Arrow Community,
>>
>> Please add any comments or board content directly to [1] or reply to
>> this email and I will incorporate your comments. You can see what we
>> currently have at the end of this email.
>>
>> One of the responsibilities of being part of the Apache Software
>> Foundation
>> (ASF) is to regularly summarize the state of the project in a quarterly
>> update to the ASF board. I plan to submit the next report on January 10,
>> 2024
>>
>> While this is partly an administrative reporting exercise, I think it is
>> also valuable to reflect on past accomplishments and think about goals for
>> the future.
>>
>> Historically, Arrow has crowd sourced the content which has worked well.
>> It would be especially interesting and valuable for members of the various
>> language
>> implementation communities and subprojects could provide a sentence or two
>> updates
>>
>> Thank you,
>> Andy
>>
>> [1]
>> https://docs.google.com/document/d/1wZkDTcaR-fZwT5QUd6sFeYmmNKZXEhSrTu5U0glLAQg/edit?usp=sharing
>>
>>
>> ---
>>
>>
>> 2024-01-10 Arrow ASF Board Report
>>
>> Arrow PMC Chair Note: Please add any relevant comments / content to this
>> document. Andy Grove will submit to the ASF board on January 10, 2024
>> (about one week prior to the scheduled
>> <https://svn.apache.org/repos/private/committers/board/calendar.txt>
>> board meeting).
>>
>> The rationale and process for this report:
>> https://www.apache.org/foundation/board/reporting
>>
>> Past report: 2023-10-11 Arrow ASF Board Report
>> <https://docs.google.com/document/d/1MU5cxzVuAIuDb6KXOAkwT4ze7IBGHKks_l92gxZeTbg/edit>
>>
>> The metrics in this report are derived from
>> https://reporter.apache.org/wizard/?arrow
>>
>> ## Description:
>>
>> The mission of Apache Arrow is the creation and maintenance of software
>>
>> related to columnar in-memory processing and data interchange. More
>>
>> information can be found at https://arrow.apache.org/overview/
>>
>> ## Project Status:
>>
>> Current project status: Ongoing (high activity)
>>
>> Is

Re: [CROWDSOURCING] 2024 ASF Board Report -- January 10, 2024

2024-01-07 Thread Andy Grove
Thanks for the updates so far. There are still no updates for many of the
language implementations, and it would be good to get 1-2 lines for each of
them if possible.

We are currently missing updates for the C (GLib), C++, Go, Java,
JavaScript, Julia, MATLAB, Python, R, Ruby, and Swift Arrow
implementations, as well as the Acera, nanoarrow, Arrow Flight, and Arrow
Flight SQL adapter for PostgreSQL subprojects.

The report is due this Wednesday, January 10.

I will endeavor to start this process for future board reports earlier so
we have more time to complete this.

Thanks,

Andy.



On Thu, Jan 4, 2024 at 4:08 PM Andy Grove  wrote:

> Hello Arrow Community,
>
> Please add any comments or board content directly to [1] or reply to
> this email and I will incorporate your comments. You can see what we
> currently have at the end of this email.
>
> One of the responsibilities of being part of the Apache Software Foundation
> (ASF) is to regularly summarize the state of the project in a quarterly
> update to the ASF board. I plan to submit the next report on January 10,
> 2024
>
> While this is partly an administrative reporting exercise, I think it is
> also valuable to reflect on past accomplishments and think about goals for
> the future.
>
> Historically, Arrow has crowd sourced the content which has worked well.
> It would be especially interesting and valuable for members of the various
> language
> implementation communities and subprojects could provide a sentence or two
> updates
>
> Thank you,
> Andy
>
> [1]
> https://docs.google.com/document/d/1wZkDTcaR-fZwT5QUd6sFeYmmNKZXEhSrTu5U0glLAQg/edit?usp=sharing
>
>
> ---
>
>
> 2024-01-10 Arrow ASF Board Report
>
> Arrow PMC Chair Note: Please add any relevant comments / content to this
> document. Andy Grove will submit to the ASF board on January 10, 2024
> (about one week prior to the scheduled
> <https://svn.apache.org/repos/private/committers/board/calendar.txt>
> board meeting).
>
> The rationale and process for this report:
> https://www.apache.org/foundation/board/reporting
>
> Past report: 2023-10-11 Arrow ASF Board Report
> <https://docs.google.com/document/d/1MU5cxzVuAIuDb6KXOAkwT4ze7IBGHKks_l92gxZeTbg/edit>
>
> The metrics in this report are derived from
> https://reporter.apache.org/wizard/?arrow
>
> ## Description:
>
> The mission of Apache Arrow is the creation and maintenance of software
>
> related to columnar in-memory processing and data interchange. More
>
> information can be found at https://arrow.apache.org/overview/
>
> ## Project Status:
>
> Current project status: Ongoing (high activity)
>
> Issues for the board: None
>
> ## Membership Data:
>
> There are currently 103 committers and 52 PMC members in this project.
>
> The Committer-to-PMC ratio is roughly 7:4.
>
> Community changes, past quarter:
>
> - Jonathan Keane was added to the PMC on 2023-10-13
>
> - Raúl Cumplido was added to the PMC on 2023-11-12
>
> - Curt Hagenlocher was added as committer on 2023-10-14
>
> - Felipe Oliveira Carvalho was added as committer on 2023-12-06
>
> - James Duong was added as committer on 2023-11-16
>
> - Xuwei Fu was added as committer on 2023-10-22
>
>
> ## Project Activity:
>
>
>
> ## Sub Project Updates
>
> Arrow has several subprojects, as listed on https://arrow.apache.org/
>
> ### ADBC
>
>
> ### Arrow Flight
>
>
>
> ### Arrow Flight SQL
>
>
>
> ### Arrow Flight SQL adapter for PostgreSQL
>
>
>
> ### DataFusion & Ballista
>
> DataFusion continues releasing regularly. We are working on a paper
> describing the system for ACM SIGMOD, and in general are trying to scale
> the project as it grows in popularity.
>
> Ballista is not very active but continues to receive occasional
> contributions.
>
>
> ### Acero
>
>
>
> ### nanoarrow
>
>
>
> ## Language Area Updates
>
> Arrow has at least 13 different language implementations, as explained in
>
> https://arrow.apache.org/overview/
>
> Arrow 14.0.0 was released from the monorepo:
>
> https://arrow.apache.org/blog/2023/11/01/14.0.0-release/
>
> ### C++
>
>
> ### C#
>
>
>
> ### Go
>
>
>
> ### Java
>
>
> ### JavaScript
>
>
> ### Julia
>
>
> ### Rust
>
>
> ### C (GLib)
>
>
> ### MATLAB
>
>
> ### Python
>
>
> ### R
>
>
> ### Ruby
>
>
> ### Swift
>
>
>
> ## Recent Releases:
> Recent releases:
>
>-
>
>14.0.2 was released on 2023-12-18.
>-
>
>RS-DATAFUSION-34.0.0 was released on 2023-12-17.
>-
>
>JULIA-2.

[CROWDSOURCING] 2024 ASF Board Report -- January 10, 2024

2024-01-04 Thread Andy Grove
Hello Arrow Community,

Please add any comments or board content directly to [1] or reply to
this email and I will incorporate your comments. You can see what we
currently have at the end of this email.

One of the responsibilities of being part of the Apache Software Foundation
(ASF) is to regularly summarize the state of the project in a quarterly
update to the ASF board. I plan to submit the next report on January 10,
2024

While this is partly an administrative reporting exercise, I think it is
also valuable to reflect on past accomplishments and think about goals for
the future.

Historically, Arrow has crowd sourced the content which has worked well.
It would be especially interesting and valuable for members of the various
language
implementation communities and subprojects could provide a sentence or two
updates

Thank you,
Andy

[1]
https://docs.google.com/document/d/1wZkDTcaR-fZwT5QUd6sFeYmmNKZXEhSrTu5U0glLAQg/edit?usp=sharing


---


2024-01-10 Arrow ASF Board Report

Arrow PMC Chair Note: Please add any relevant comments / content to this
document. Andy Grove will submit to the ASF board on January 10, 2024
(about one week prior to the scheduled
<https://svn.apache.org/repos/private/committers/board/calendar.txt> board
meeting).

The rationale and process for this report:
https://www.apache.org/foundation/board/reporting

Past report: 2023-10-11 Arrow ASF Board Report
<https://docs.google.com/document/d/1MU5cxzVuAIuDb6KXOAkwT4ze7IBGHKks_l92gxZeTbg/edit>

The metrics in this report are derived from
https://reporter.apache.org/wizard/?arrow

## Description:

The mission of Apache Arrow is the creation and maintenance of software

related to columnar in-memory processing and data interchange. More

information can be found at https://arrow.apache.org/overview/

## Project Status:

Current project status: Ongoing (high activity)

Issues for the board: None

## Membership Data:

There are currently 103 committers and 52 PMC members in this project.

The Committer-to-PMC ratio is roughly 7:4.

Community changes, past quarter:

- Jonathan Keane was added to the PMC on 2023-10-13

- Raúl Cumplido was added to the PMC on 2023-11-12

- Curt Hagenlocher was added as committer on 2023-10-14

- Felipe Oliveira Carvalho was added as committer on 2023-12-06

- James Duong was added as committer on 2023-11-16

- Xuwei Fu was added as committer on 2023-10-22


## Project Activity:



## Sub Project Updates

Arrow has several subprojects, as listed on https://arrow.apache.org/

### ADBC


### Arrow Flight



### Arrow Flight SQL



### Arrow Flight SQL adapter for PostgreSQL



### DataFusion & Ballista

DataFusion continues releasing regularly. We are working on a paper
describing the system for ACM SIGMOD, and in general are trying to scale
the project as it grows in popularity.

Ballista is not very active but continues to receive occasional
contributions.


### Acero



### nanoarrow



## Language Area Updates

Arrow has at least 13 different language implementations, as explained in

https://arrow.apache.org/overview/

Arrow 14.0.0 was released from the monorepo:

https://arrow.apache.org/blog/2023/11/01/14.0.0-release/

### C++


### C#



### Go



### Java


### JavaScript


### Julia


### Rust


### C (GLib)


### MATLAB


### Python


### R


### Ruby


### Swift



## Recent Releases:
Recent releases:

   -

   14.0.2 was released on 2023-12-18.
   -

   RS-DATAFUSION-34.0.0 was released on 2023-12-17.
   -

   JULIA-2.7.0 was released on 2023-12-10.
   -

   RS-DATAFUSION-PYTHON-33.0.0 was released on 2023-11-19.
   -

   RS-DATAFUSION-33.0.0 was released on 2023-11-16.
   -

   RS-48.0.1 was released on 2023-11-13.
   -

   RS-49.0.0 was released on 2023-11-13.
   -

   ADBC-0.8.0 was released on 2023-11-09.
   -

   14.0.1 was released on 2023-11-08.
   -

   RS-OS-0.8.0 was released on 2023-11-06.
   -

   14.0.0 was released on 2023-11-01.
   -

   RS-DATAFUSION-PYTHON-32.0.0 was released on 2023-10-25.
   -

   RS-48.0.0 was released on 2023-10-23.
   -

   RS-DATAFUSION-32.0.0 was released on 2023-10-12.




## Community Health:

Community communication continues to be strong.

There have been 5 blog posts published to https://arrow.apache.org/blog/  in

the last 3 months.

The mailing lists are active


dev@arrow.apache.org had a 51% increase in traffic in the past quarter (715
emails compared to 472)


For the mono repo:

2374 commits in the past quarter (-2% change)

249 code contributors in the past quarter (-2% change)

1752 PRs opened on GitHub, past quarter (-7% change)

1698 PRs closed on GitHub, past quarter (-5% change)

1364 issues opened on GitHub, past quarter (-14% change)

1052 issues closed on GitHub, past quarter (-18% change)


[RESULT][VOTE][RUST][DataFusion] Release DataFusion Python Bindings 34.0.0 RC1

2024-01-02 Thread Andy Grove
On Tue, Jan 2, 2024 at 5:51 PM Andy Grove  wrote:

> The vote passes with three binding +1 votes. Thanks, everyone.
>
> Source release:
> https://dist.apache.org/repos/dist/release/arrow/arrow-datafusion-python-34.0.0/
> PyPi: https://test.pypi.org/project/datafusion/
>
> On Tue, Jan 2, 2024 at 5:39 PM Andy Grove  wrote:
>
>> Hi Andrew,
>>
>> It looks like the issue is that numpy 1.21.3 requires a different Python
>> version:
>>
>> 1.21.3 Requires-Python>=3.7,<3.11
>>
>> I am guessing that you have a Python version that is not within that
>> range?
>>
>> I agree that this should not be a blocker.
>>
>> Thanks,
>>
>> Andy.
>>
>>
>>
>> On Fri, Dec 29, 2023 at 4:18 AM Andrew Lamb  wrote:
>>
>>> I had some trouble running the verification script --  got an error that
>>> a
>>> specific version of numpy was not available. I am running on an Apple M3
>>> Max.
>>>
>>> ERROR: No matching distribution found for numpy==1.21.3
>>>
>>> However that version does appear to be available:
>>> https://pypi.org/project/numpy/1.21.3/
>>>
>>> I was able to verify the release on a Ubuntu 22.04 / x86_64 machine so I
>>> don't think this is a release blocker
>>>
>>> Andrew
>>>
>>>
>>> Here are more details:
>>>
>>> $ ./dev/release/verify-release-candidate.sh 34.0.0 1
>>> + set -o pipefail
>>> +++ dirname ./dev/release/verify-release-candidate.sh
>>> ++ cd ./dev/release
>>> ++ pwd
>>> ...
>>> Successfully installed pip-23.3.2
>>> + python3 -m pip install -r requirements-310.txt
>>> Collecting attrs==21.2.0 (from -r requirements-310.txt (line 7))
>>>   Using cached attrs-21.2.0-py2.py3-none-any.whl (53 kB)
>>> Collecting black==21.9b0 (from -r requirements-310.txt (line 11))
>>>   Using cached black-21.9b0-py3-none-any.whl (148 kB)
>>> Collecting click==8.0.3 (from -r requirements-310.txt (line 15))
>>>   Using cached click-8.0.3-py3-none-any.whl (97 kB)
>>> Collecting flake8==4.0.1 (from -r requirements-310.txt (line 19))
>>>   Using cached flake8-4.0.1-py2.py3-none-any.whl (64 kB)
>>> Collecting iniconfig==1.1.1 (from -r requirements-310.txt (line 23))
>>>   Using cached iniconfig-1.1.1-py2.py3-none-any.whl (5.0 kB)
>>> Collecting isort==5.9.3 (from -r requirements-310.txt (line 27))
>>>   Using cached isort-5.9.3-py3-none-any.whl (106 kB)
>>> Collecting maturin==0.15.1 (from -r requirements-310.txt (line 31))
>>>   Using cached
>>>
>>> maturin-0.15.1-py3-none-macosx_10_9_x86_64.macosx_11_0_arm64.macosx_10_9_universal2.whl
>>> (14.5 MB)
>>> Collecting mccabe==0.6.1 (from -r requirements-310.txt (line 46))
>>>   Using cached mccabe-0.6.1-py2.py3-none-any.whl (8.6 kB)
>>> Collecting mypy==0.910 (from -r requirements-310.txt (line 50))
>>>   Using cached mypy-0.910-py3-none-any.whl (2.1 MB)
>>> Collecting mypy-extensions==0.4.3 (from -r requirements-310.txt (line
>>> 75))
>>>   Using cached mypy_extensions-0.4.3-py2.py3-none-any.whl (4.5 kB)
>>> ERROR: Ignored the following versions that require a different python
>>> version: 1.21.2 Requires-Python >=3.7,<3.11; 1.21.3 Requires-Python
>>> >=3.7,<3.11; 1.21.4 Requires-Python >=3.7,<3.11; 1.21.5 Requires-Python
>>> >=3.7,<3.11; 1.21.6 Requires-Python >=3.7,<3.11
>>> ERROR: Could not find a version that satisfies the requirement
>>> numpy==1.21.3 (from versions: 1.3.0, 1.4.1, 1.5.0, 1.5.1, 1.6.0, 1.6.1,
>>> 1.6.2, 1.7.0, 1.7.1, 1.7.2, 1.8.0, 1.8.1, 1.8.2, 1.9.0, 1.9.1, 1.9.2,
>>> 1.9.3, 1.10.0.post2, 1.10.1, 1.10.2, 1.10.4, 1.11.0, 1.11.1, 1.11.2,
>>> 1.11.3, 1.12.0, 1.12.1, 1.13.0, 1.13.1, 1.13.3, 1.14.0, 1.14.1, 1.14.2,
>>> 1.14.3, 1.14.4, 1.14.5, 1.14.6, 1.15.0, 1.15.1, 1.15.2, 1.15.3, 1.15.4,
>>> 1.16.0, 1.16.1, 1.16.2, 1.16.3, 1.16.4, 1.16.5, 1.16.6, 1.17.0, 1.17.1,
>>> 1.17.2, 1.17.3, 1.17.4, 1.17.5, 1.18.0, 1.18.1, 1.18.2, 1.18.3, 1.18.4,
>>> 1.18.5, 1.19.0, 1.19.1, 1.19.2, 1.19.3, 1.19.4, 1.19.5, 1.20.0, 1.20.1,
>>> 1.20.2, 1.20.3, 1.21.0, 1.21.1, 1.22.0, 1.22.1, 1.22.2, 1.22.3, 1.22.4,
>>> 1.23.0rc1, 1.23.0rc2, 1.23.0rc3, 1.23.0, 1.23.1, 1.23.2, 1.23.3, 1.23.4,
>>> 1.23.5, 1.24.0rc1, 1.24.0rc2, 1.24.0, 1.24.1, 1.24.2, 1.24.3, 1.24.4,
>>> 1.25.0rc1, 1.25.0, 1.25.1, 1.25.2, 1.26.0b1, 1.26.0rc1, 1.26.0, 1.26.1,
>>> 1.26.2)
>>> ERROR: No matching distribution found for numpy==1.21.3
>>&g

Re: [VOTE][RUST][DataFusion] Release DataFusion Python Bindings 34.0.0 RC1

2024-01-02 Thread Andy Grove
The vote passes with three binding +1 votes. Thanks, everyone.

Source release:
https://dist.apache.org/repos/dist/release/arrow/arrow-datafusion-python-34.0.0/
PyPi: https://test.pypi.org/project/datafusion/

On Tue, Jan 2, 2024 at 5:39 PM Andy Grove  wrote:

> Hi Andrew,
>
> It looks like the issue is that numpy 1.21.3 requires a different Python
> version:
>
> 1.21.3 Requires-Python>=3.7,<3.11
>
> I am guessing that you have a Python version that is not within that range?
>
> I agree that this should not be a blocker.
>
> Thanks,
>
> Andy.
>
>
>
> On Fri, Dec 29, 2023 at 4:18 AM Andrew Lamb  wrote:
>
>> I had some trouble running the verification script --  got an error that a
>> specific version of numpy was not available. I am running on an Apple M3
>> Max.
>>
>> ERROR: No matching distribution found for numpy==1.21.3
>>
>> However that version does appear to be available:
>> https://pypi.org/project/numpy/1.21.3/
>>
>> I was able to verify the release on a Ubuntu 22.04 / x86_64 machine so I
>> don't think this is a release blocker
>>
>> Andrew
>>
>>
>> Here are more details:
>>
>> $ ./dev/release/verify-release-candidate.sh 34.0.0 1
>> + set -o pipefail
>> +++ dirname ./dev/release/verify-release-candidate.sh
>> ++ cd ./dev/release
>> ++ pwd
>> ...
>> Successfully installed pip-23.3.2
>> + python3 -m pip install -r requirements-310.txt
>> Collecting attrs==21.2.0 (from -r requirements-310.txt (line 7))
>>   Using cached attrs-21.2.0-py2.py3-none-any.whl (53 kB)
>> Collecting black==21.9b0 (from -r requirements-310.txt (line 11))
>>   Using cached black-21.9b0-py3-none-any.whl (148 kB)
>> Collecting click==8.0.3 (from -r requirements-310.txt (line 15))
>>   Using cached click-8.0.3-py3-none-any.whl (97 kB)
>> Collecting flake8==4.0.1 (from -r requirements-310.txt (line 19))
>>   Using cached flake8-4.0.1-py2.py3-none-any.whl (64 kB)
>> Collecting iniconfig==1.1.1 (from -r requirements-310.txt (line 23))
>>   Using cached iniconfig-1.1.1-py2.py3-none-any.whl (5.0 kB)
>> Collecting isort==5.9.3 (from -r requirements-310.txt (line 27))
>>   Using cached isort-5.9.3-py3-none-any.whl (106 kB)
>> Collecting maturin==0.15.1 (from -r requirements-310.txt (line 31))
>>   Using cached
>>
>> maturin-0.15.1-py3-none-macosx_10_9_x86_64.macosx_11_0_arm64.macosx_10_9_universal2.whl
>> (14.5 MB)
>> Collecting mccabe==0.6.1 (from -r requirements-310.txt (line 46))
>>   Using cached mccabe-0.6.1-py2.py3-none-any.whl (8.6 kB)
>> Collecting mypy==0.910 (from -r requirements-310.txt (line 50))
>>   Using cached mypy-0.910-py3-none-any.whl (2.1 MB)
>> Collecting mypy-extensions==0.4.3 (from -r requirements-310.txt (line 75))
>>   Using cached mypy_extensions-0.4.3-py2.py3-none-any.whl (4.5 kB)
>> ERROR: Ignored the following versions that require a different python
>> version: 1.21.2 Requires-Python >=3.7,<3.11; 1.21.3 Requires-Python
>> >=3.7,<3.11; 1.21.4 Requires-Python >=3.7,<3.11; 1.21.5 Requires-Python
>> >=3.7,<3.11; 1.21.6 Requires-Python >=3.7,<3.11
>> ERROR: Could not find a version that satisfies the requirement
>> numpy==1.21.3 (from versions: 1.3.0, 1.4.1, 1.5.0, 1.5.1, 1.6.0, 1.6.1,
>> 1.6.2, 1.7.0, 1.7.1, 1.7.2, 1.8.0, 1.8.1, 1.8.2, 1.9.0, 1.9.1, 1.9.2,
>> 1.9.3, 1.10.0.post2, 1.10.1, 1.10.2, 1.10.4, 1.11.0, 1.11.1, 1.11.2,
>> 1.11.3, 1.12.0, 1.12.1, 1.13.0, 1.13.1, 1.13.3, 1.14.0, 1.14.1, 1.14.2,
>> 1.14.3, 1.14.4, 1.14.5, 1.14.6, 1.15.0, 1.15.1, 1.15.2, 1.15.3, 1.15.4,
>> 1.16.0, 1.16.1, 1.16.2, 1.16.3, 1.16.4, 1.16.5, 1.16.6, 1.17.0, 1.17.1,
>> 1.17.2, 1.17.3, 1.17.4, 1.17.5, 1.18.0, 1.18.1, 1.18.2, 1.18.3, 1.18.4,
>> 1.18.5, 1.19.0, 1.19.1, 1.19.2, 1.19.3, 1.19.4, 1.19.5, 1.20.0, 1.20.1,
>> 1.20.2, 1.20.3, 1.21.0, 1.21.1, 1.22.0, 1.22.1, 1.22.2, 1.22.3, 1.22.4,
>> 1.23.0rc1, 1.23.0rc2, 1.23.0rc3, 1.23.0, 1.23.1, 1.23.2, 1.23.3, 1.23.4,
>> 1.23.5, 1.24.0rc1, 1.24.0rc2, 1.24.0, 1.24.1, 1.24.2, 1.24.3, 1.24.4,
>> 1.25.0rc1, 1.25.0, 1.25.1, 1.25.2, 1.26.0b1, 1.26.0rc1, 1.26.0, 1.26.1,
>> 1.26.2)
>> ERROR: No matching distribution found for numpy==1.21.3
>> + cleanup
>> + '[' no = yes ']'
>> + echo 'Failed to verify release candidate. See
>>
>> /var/folders/1l/tg68jc6550gg8xqf1hr4mlwrgn/T/arrow-34.0.0.X.Oo74Ac7ank
>> for details.'
>> Failed to verify release candidate. See
>>
>> /var/folders/1l/tg68jc6550gg8xqf1hr4mlwrgn/T/arrow-34.0.0.X.Oo74Ac7ank
>> for details.
>> (venv) andrewlamb@Andrews-MacBook-Pro
>> :~/Downloads/apache-arrow-da

Re: [VOTE][RUST][DataFusion] Release DataFusion Python Bindings 34.0.0 RC1

2024-01-02 Thread Andy Grove
Hi Andrew,

It looks like the issue is that numpy 1.21.3 requires a different Python
version:

1.21.3 Requires-Python>=3.7,<3.11

I am guessing that you have a Python version that is not within that range?

I agree that this should not be a blocker.

Thanks,

Andy.



On Fri, Dec 29, 2023 at 4:18 AM Andrew Lamb  wrote:

> I had some trouble running the verification script --  got an error that a
> specific version of numpy was not available. I am running on an Apple M3
> Max.
>
> ERROR: No matching distribution found for numpy==1.21.3
>
> However that version does appear to be available:
> https://pypi.org/project/numpy/1.21.3/
>
> I was able to verify the release on a Ubuntu 22.04 / x86_64 machine so I
> don't think this is a release blocker
>
> Andrew
>
>
> Here are more details:
>
> $ ./dev/release/verify-release-candidate.sh 34.0.0 1
> + set -o pipefail
> +++ dirname ./dev/release/verify-release-candidate.sh
> ++ cd ./dev/release
> ++ pwd
> ...
> Successfully installed pip-23.3.2
> + python3 -m pip install -r requirements-310.txt
> Collecting attrs==21.2.0 (from -r requirements-310.txt (line 7))
>   Using cached attrs-21.2.0-py2.py3-none-any.whl (53 kB)
> Collecting black==21.9b0 (from -r requirements-310.txt (line 11))
>   Using cached black-21.9b0-py3-none-any.whl (148 kB)
> Collecting click==8.0.3 (from -r requirements-310.txt (line 15))
>   Using cached click-8.0.3-py3-none-any.whl (97 kB)
> Collecting flake8==4.0.1 (from -r requirements-310.txt (line 19))
>   Using cached flake8-4.0.1-py2.py3-none-any.whl (64 kB)
> Collecting iniconfig==1.1.1 (from -r requirements-310.txt (line 23))
>   Using cached iniconfig-1.1.1-py2.py3-none-any.whl (5.0 kB)
> Collecting isort==5.9.3 (from -r requirements-310.txt (line 27))
>   Using cached isort-5.9.3-py3-none-any.whl (106 kB)
> Collecting maturin==0.15.1 (from -r requirements-310.txt (line 31))
>   Using cached
>
> maturin-0.15.1-py3-none-macosx_10_9_x86_64.macosx_11_0_arm64.macosx_10_9_universal2.whl
> (14.5 MB)
> Collecting mccabe==0.6.1 (from -r requirements-310.txt (line 46))
>   Using cached mccabe-0.6.1-py2.py3-none-any.whl (8.6 kB)
> Collecting mypy==0.910 (from -r requirements-310.txt (line 50))
>   Using cached mypy-0.910-py3-none-any.whl (2.1 MB)
> Collecting mypy-extensions==0.4.3 (from -r requirements-310.txt (line 75))
>   Using cached mypy_extensions-0.4.3-py2.py3-none-any.whl (4.5 kB)
> ERROR: Ignored the following versions that require a different python
> version: 1.21.2 Requires-Python >=3.7,<3.11; 1.21.3 Requires-Python
> >=3.7,<3.11; 1.21.4 Requires-Python >=3.7,<3.11; 1.21.5 Requires-Python
> >=3.7,<3.11; 1.21.6 Requires-Python >=3.7,<3.11
> ERROR: Could not find a version that satisfies the requirement
> numpy==1.21.3 (from versions: 1.3.0, 1.4.1, 1.5.0, 1.5.1, 1.6.0, 1.6.1,
> 1.6.2, 1.7.0, 1.7.1, 1.7.2, 1.8.0, 1.8.1, 1.8.2, 1.9.0, 1.9.1, 1.9.2,
> 1.9.3, 1.10.0.post2, 1.10.1, 1.10.2, 1.10.4, 1.11.0, 1.11.1, 1.11.2,
> 1.11.3, 1.12.0, 1.12.1, 1.13.0, 1.13.1, 1.13.3, 1.14.0, 1.14.1, 1.14.2,
> 1.14.3, 1.14.4, 1.14.5, 1.14.6, 1.15.0, 1.15.1, 1.15.2, 1.15.3, 1.15.4,
> 1.16.0, 1.16.1, 1.16.2, 1.16.3, 1.16.4, 1.16.5, 1.16.6, 1.17.0, 1.17.1,
> 1.17.2, 1.17.3, 1.17.4, 1.17.5, 1.18.0, 1.18.1, 1.18.2, 1.18.3, 1.18.4,
> 1.18.5, 1.19.0, 1.19.1, 1.19.2, 1.19.3, 1.19.4, 1.19.5, 1.20.0, 1.20.1,
> 1.20.2, 1.20.3, 1.21.0, 1.21.1, 1.22.0, 1.22.1, 1.22.2, 1.22.3, 1.22.4,
> 1.23.0rc1, 1.23.0rc2, 1.23.0rc3, 1.23.0, 1.23.1, 1.23.2, 1.23.3, 1.23.4,
> 1.23.5, 1.24.0rc1, 1.24.0rc2, 1.24.0, 1.24.1, 1.24.2, 1.24.3, 1.24.4,
> 1.25.0rc1, 1.25.0, 1.25.1, 1.25.2, 1.26.0b1, 1.26.0rc1, 1.26.0, 1.26.1,
> 1.26.2)
> ERROR: No matching distribution found for numpy==1.21.3
> + cleanup
> + '[' no = yes ']'
> + echo 'Failed to verify release candidate. See
>
> /var/folders/1l/tg68jc6550gg8xqf1hr4mlwrgn/T/arrow-34.0.0.X.Oo74Ac7ank
> for details.'
> Failed to verify release candidate. See
>
> /var/folders/1l/tg68jc6550gg8xqf1hr4mlwrgn/T/arrow-34.0.0.X.Oo74Ac7ank
> for details.
> (venv) andrewlamb@Andrews-MacBook-Pro
> :~/Downloads/apache-arrow-datafusion-python-34.0.0$
>
>
> On Thu, Dec 28, 2023 at 9:18 PM vin jake  wrote:
>
> > +1 (binding) Verified on my M1 Mac.
> >
> > Thanks Andy!
> >
> > On Fri, Dec 29, 2023 at 5:42 AM Andy Grove 
> wrote:
> >
> > > Hi,
> > >
> > > I would like to propose a release of Apache Arrow DataFusion Python
> > > Bindings,
> > > version 34.0.0.
> > >
> > > This release candidate is based on commit:
> > > b22f82f3055941dc3599c9a18458a2de163ff4c0 [1]
> > > The proposed release tarball and signatures are hosted at [2].
> &

[VOTE][RUST][DataFusion] Release DataFusion Python Bindings 34.0.0 RC1

2023-12-28 Thread Andy Grove
Hi,

I would like to propose a release of Apache Arrow DataFusion Python
Bindings,
version 34.0.0.

This release candidate is based on commit:
b22f82f3055941dc3599c9a18458a2de163ff4c0 [1]
The proposed release tarball and signatures are hosted at [2].
The changelog is located at [3].
The Python wheels are located at [4].

Please download, verify checksums and signatures, run the unit tests, and
vote
on the release. The vote will be open for at least 72 hours.

Only votes from PMC members are binding, but all members of the community
are
encouraged to test the release and vote with "(non-binding)".

The standard verification procedure is documented at
https://github.com/apache/arrow-datafusion-python/blob/main/dev/release/README.md#verifying-release-candidates
.

[ ] +1 Release this as Apache Arrow DataFusion Python 34.0.0
[ ] +0
[ ] -1 Do not release this as Apache Arrow DataFusion Python 34.0.0
because...

Here is my vote:

+1

[1]:
https://github.com/apache/arrow-datafusion-python/tree/b22f82f3055941dc3599c9a18458a2de163ff4c0
[2]:
https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-python-34.0.0-rc1
[3]:
https://github.com/apache/arrow-datafusion-python/blob/b22f82f3055941dc3599c9a18458a2de163ff4c0/CHANGELOG.md
[4]: https://test.pypi.org/project/datafusion/34.0.0/


Re: [DISCUSS][RFC] Draft Proposal for new Top Level Project for DataFusion

2023-12-27 Thread Andy Grove
Thank you for creating the draft proposal, Andrew. I have reviewed this and
I think it looks great.

Andy.

On Wed, Dec 27, 2023 at 3:19 PM Andrew Lamb  wrote:

> I have created a draft proposal [1] to break DataFusion out to its own top
> level project. Please provide your feedback and suggestions.
>
> The proposal is included at the end of this email and in this Google Doc:
>
> https://docs.google.com/document/d/11WTNYS8KWScOt3ySTX39WVS6krPhUvHsuJRY9PZQx4g
> .
>
> Feel free to respond to this email or comment / make suggestions directly
> on the document.
>
> I would be especially grateful if people could review and comment on the
> proposed list of committers and PMC members.
>
> I hope everyone is not getting sick of hearing about this, but I think in
> this case it is better to over communicate than risk surprises.
>
> Andrew
>
> [1] https://github.com/apache/arrow-datafusion/issues/8491
>
>
> --
>
> DataFusion Top Level Project Proposal
> Dec 27, 2023
>
> [Editor’s note: This document is based on the proposal to the ASF board to
> create the Arrow project. One it is been reviewed, we plan to send it to
> the ASF board sometime in January or February 2024 for their consideration]
>
> To: The ASF (bo...@apache.org)
>
> Summary:
>
> We propose creating a new top level project, Apache DataFusion, from an
> existing sub project of Apache Arrow to facilitate additional community and
> project growth.
>
> 
> Apache DataFusion for Apache Top Level Project
>
> Abstract
>
> Apache Arrow DataFusion[1]  is a very fast, extensible query engine for
> building high-quality data-centric systems in Rust, using the Apache Arrow
> in-memory format. DataFusion offers SQL and Dataframe APIs, excellent
> performance, built-in support for CSV, Parquet, JSON, and Avro, extensive
> customization, and a great community.
>
> [1] https://arrow.apache.org/datafusion/
>
>
> Proposal
>
> We propose creating a new top level ASF project, Apache DataFusion,
> governed initially by a subset of the Arrow project’s PMC and committers.
> The project’s code is in four existing git repositories, currently governed
> by Apache Arrow which would transfer to the new top level project.
>
> Background
>
> When DataFusion was initially donated to the Arrow project, it did not have
> a strong enough community to stand on its own. It has since grown
> significantly, and benefited immensely from being part of Arrow and
> nurturing of the Apache Way, and now has a community strong enough to stand
> on its own and that would benefit from focused governance attention.
>
> The community has discussed this idea publicly for more than 6 months
> https://github.com/apache/arrow-datafusion/discussions/6475  and briefly
> on
> the Arrow PMC mailing list
> https://lists.apache.org/thread/thv2jdm6640l6gm88hy8jhk5prjww0cs. As of
> the
> time of this writing both had exclusively positive reactions.
>
> Several current members of the Arrow PMC are both active contributors to
> DataFusion and understand and believe deeply in the Apache Way, and play
> active governance roles in the Arrow project as PMC members and PMC chairs,
> guiding the community, and releasing software versions. With this existing
> governance experience and structure, the new top level project will be able
> to function well immediately and independently.
>
> Overview of DataFusion
>
> Current Status
>
> Meritocracy
>
> DataFusion has been developed as part of Apache Arrow and thus has been
> operating as a meritocracy. Many of the developers of DataFusion are Arrow
> PMC members or committers. The DataFusion project plans to continue adding
> new PMC and committers as the project matures and grows.
>
> Community
>
> The DataFusion development team seeks to foster the development and user
> communities. We hope that becoming a separate project will help both Arrow
> and DataFusion communities by being more focused.  Focused governance will
> make it easier to grow the community of committers and PMC members and make
> the organization more clear to others.
>
> Alignment
>
> The ASF is a natural host for DataFusion given that it is already the home
> of Arrow, Parquet, and other related distributed system, storage and query
> execution systems.
>
> Project Leadership
>
> Proposed Initial PMC
>
> We propose the following people as the initial DataFusion PMC members. This
> is a subset of the existing Arrow PMC members who contribute to DataFusion
> https://people.apache.org/phonebook.html?unix=arrow
>
> Andy Grove (agrove):  Arrow PMC Chair
> Andrew Lamb (alamb): Arrow PMC, past Arrow PMC Chair
> Daniël Heres (dhere

Re: [DISUCSS] [DATAFUSION] Repositories

2023-12-21 Thread Andy Grove
Hi Andrew,

There is also https://github.com/apache/arrow-ballista-python. It is
currently unmaintained but would be good to move it along with the main
Ballista repo.

Thanks,

Andy.

On Thu, Dec 21, 2023 at 6:33 AM Andrew Lamb  wrote:

> This is another chain related to the previous discussion [1], about
> planning to propose [2]
> "graduating" the DataFusion to its own top level Apache project.
>
> In this chain, I would like to discuss which of the current arrow
> repositories[3] to propose moving to the new PMC
>
> I propose the following three repositories move in their entirety
>
> https://github.com/apache/arrow-datafusion
> https://github.com/apache/arrow-ballista
> https://github.com/apache/arrow-datafusion-python
>
> I do now know of any code in any other arrow repositories that would be
> appropriate to move to a new Apache Project.
>
> Please let me know your thoughts,
> Andrew
>
> [1]: https://github.com/apache/arrow-datafusion/discussions/6475
> [2]: https://github.com/apache/arrow-datafusion/issues/8491
> [3]:
>
> https://github.com/orgs/apache/repositories?language==arrow==all
>


Re: [DISCUSS] [DATAFUSION] PMC for new DataFusion top level project

2023-12-20 Thread Andy Grove
This list LGTM.

On Wed, Dec 20, 2023 at 1:11 PM Andrew Lamb  wrote:

> Hello,
>
> As we have discussed previously [1], we are planning to propose [2]
> "graduating" the DataFusion to its own top level Apache project.
>
> I would like to discuss the initial PMC members for the new top level
> project.  The suggestion in [1] is
>
> > All existing Arrow Committers and PMC members who so desired, would start
> as committers or PMC members on the new DataFusion project (assuming this
> is allowed by the process)
>
> From what I can tell, this means the PMC would be the following from the
> current Arrow PMC [3]:
>
> Andy Grove (NVidia)
> Andrew Lamb (InfuxData)
> Daniël Heres (Coralogix)
> Jie Wen (SelectDB)
> Kun Liu (Ebay)
> Liang-Chi Hsieh (Apple)
> Qingping Hou (Scribd)
> Will Jones (VoltronData)
>
> I think Raphael Taylor-Davies has told me offline he would prefer to focus
> on Arrow and thus now join the DataFusion PMC, though it would be nice if
> he could confirm.
>
> We also need to propose a chair of the new PMC -- I am happy to help anyone
> who would like to do this role, or do it myself. Spending a year as the
> Arrow PMC chair gave me sufficient experience to make sure the process is
> smooth, in my opinion.
>
> If  the new project is approved and created then the initial PMC will
> invite the relevant existing Arrow commiters as DataFusion committers.
>
> Please let me know your thoughts and if there are other existing Arrow PMC
> members who should be included in the proposal for initial DataFusion PMC.
>
> Andrew
>
> p.s. As part of this process, I discovered Arrow's origin as a top level
> project came from splitting off from the Apache Drill project, which I had
> not previously known
>
>
> [1]: https://github.com/apache/arrow-datafusion/discussions/6475
> [2]: https://github.com/apache/arrow-datafusion/issues/8491
> [3]: https://arrow.apache.org/committers/
>


[RESULT][VOTE][RUST][DataFusion] Release Apache Arrow DataFusion 34.0.0 RC3

2023-12-17 Thread Andy Grove
On Sun, Dec 17, 2023 at 2:03 PM Andy Grove  wrote:

> The vote passes with 6 +1 votes (4 binding). Thank you to everyone who
> helped verify the release.
>
> Andy.
>
> On Fri, Dec 15, 2023 at 4:40 PM Will Jones 
> wrote:
>
>> +1 (binding). Verified on x86_64 Ubuntu 22.04.
>>
>> On Fri, Dec 15, 2023 at 1:13 AM Jean-Baptiste Onofré 
>> wrote:
>>
>> > +1 (non binding)
>> >
>> > Regards
>> > JB
>> >
>> > On Thu, Dec 14, 2023 at 9:52 PM Andy Grove 
>> wrote:
>> > >
>> > > Hi,
>> > >
>> > > I would like to propose a release of Apache Arrow DataFusion
>> > Implementation,
>> > > version 34.0.0.
>> > >
>> > > *Please note this is RC3 - I ran into some local testing issues with
>> RC2*
>> > >
>> > > This release candidate is based on commit:
>> > > 26933842e48d69f510f9461a1f2c87af587d5986 [1]
>> > > The proposed release tarball and signatures are hosted at [2].
>> > > The changelog is located at [3].
>> > >
>> > > Please download, verify checksums and signatures, run the unit tests,
>> and
>> > > vote
>> > > on the release. The vote will be open for at least 72 hours.
>> > >
>> > > Only votes from PMC members are binding, but all members of the
>> community
>> > > are
>> > > encouraged to test the release and vote with "(non-binding)".
>> > >
>> > > The standard verification procedure is documented at
>> > >
>> >
>> https://github.com/apache/arrow-datafusion/blob/main/dev/release/README.md#verifying-release-candidates
>> > > .
>> > >
>> > > [ ] +1 Release this as Apache Arrow DataFusion 34.0.0
>> > > [ ] +0
>> > > [ ] -1 Do not release this as Apache Arrow DataFusion 34.0.0
>> because...
>> > >
>> > > Here is my vote:
>> > >
>> > > +1
>> > >
>> > > [1]:
>> > >
>> >
>> https://github.com/apache/arrow-datafusion/tree/26933842e48d69f510f9461a1f2c87af587d5986
>> > > [2]:
>> > >
>> >
>> https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-34.0.0-rc3
>> > > [3]:
>> > >
>> >
>> https://github.com/apache/arrow-datafusion/blob/26933842e48d69f510f9461a1f2c87af587d5986/CHANGELOG.md
>> >
>>
>


Re: [VOTE][RUST][DataFusion] Release Apache Arrow DataFusion 34.0.0 RC3

2023-12-17 Thread Andy Grove
The vote passes with 6 +1 votes (4 binding). Thank you to everyone who
helped verify the release.

Andy.

On Fri, Dec 15, 2023 at 4:40 PM Will Jones  wrote:

> +1 (binding). Verified on x86_64 Ubuntu 22.04.
>
> On Fri, Dec 15, 2023 at 1:13 AM Jean-Baptiste Onofré 
> wrote:
>
> > +1 (non binding)
> >
> > Regards
> > JB
> >
> > On Thu, Dec 14, 2023 at 9:52 PM Andy Grove 
> wrote:
> > >
> > > Hi,
> > >
> > > I would like to propose a release of Apache Arrow DataFusion
> > Implementation,
> > > version 34.0.0.
> > >
> > > *Please note this is RC3 - I ran into some local testing issues with
> RC2*
> > >
> > > This release candidate is based on commit:
> > > 26933842e48d69f510f9461a1f2c87af587d5986 [1]
> > > The proposed release tarball and signatures are hosted at [2].
> > > The changelog is located at [3].
> > >
> > > Please download, verify checksums and signatures, run the unit tests,
> and
> > > vote
> > > on the release. The vote will be open for at least 72 hours.
> > >
> > > Only votes from PMC members are binding, but all members of the
> community
> > > are
> > > encouraged to test the release and vote with "(non-binding)".
> > >
> > > The standard verification procedure is documented at
> > >
> >
> https://github.com/apache/arrow-datafusion/blob/main/dev/release/README.md#verifying-release-candidates
> > > .
> > >
> > > [ ] +1 Release this as Apache Arrow DataFusion 34.0.0
> > > [ ] +0
> > > [ ] -1 Do not release this as Apache Arrow DataFusion 34.0.0 because...
> > >
> > > Here is my vote:
> > >
> > > +1
> > >
> > > [1]:
> > >
> >
> https://github.com/apache/arrow-datafusion/tree/26933842e48d69f510f9461a1f2c87af587d5986
> > > [2]:
> > >
> >
> https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-34.0.0-rc3
> > > [3]:
> > >
> >
> https://github.com/apache/arrow-datafusion/blob/26933842e48d69f510f9461a1f2c87af587d5986/CHANGELOG.md
> >
>


[VOTE][RUST][DataFusion] Release Apache Arrow DataFusion 34.0.0 RC3

2023-12-14 Thread Andy Grove
Hi,

I would like to propose a release of Apache Arrow DataFusion Implementation,
version 34.0.0.

*Please note this is RC3 - I ran into some local testing issues with RC2*

This release candidate is based on commit:
26933842e48d69f510f9461a1f2c87af587d5986 [1]
The proposed release tarball and signatures are hosted at [2].
The changelog is located at [3].

Please download, verify checksums and signatures, run the unit tests, and
vote
on the release. The vote will be open for at least 72 hours.

Only votes from PMC members are binding, but all members of the community
are
encouraged to test the release and vote with "(non-binding)".

The standard verification procedure is documented at
https://github.com/apache/arrow-datafusion/blob/main/dev/release/README.md#verifying-release-candidates
.

[ ] +1 Release this as Apache Arrow DataFusion 34.0.0
[ ] +0
[ ] -1 Do not release this as Apache Arrow DataFusion 34.0.0 because...

Here is my vote:

+1

[1]:
https://github.com/apache/arrow-datafusion/tree/26933842e48d69f510f9461a1f2c87af587d5986
[2]:
https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-34.0.0-rc3
[3]:
https://github.com/apache/arrow-datafusion/blob/26933842e48d69f510f9461a1f2c87af587d5986/CHANGELOG.md


Re: [VOTE][RUST][DataFusion] Release Apache Arrow DataFusion 34.0.0 RC1

2023-12-13 Thread Andy Grove
Thanks, Andrew.

That does seem quite serious. I agree it would be better to hold off on the
release until this is resolved.

On Wed, Dec 13, 2023 at 1:09 PM Andrew Lamb  wrote:

> I have found a regression [1] that I think is fairly serious and should be
> fixed prior to a release.
>
> If others think we should release anyways I can plan to make a patch
> release shortly with the fix
>
>
> [1] https://github.com/apache/arrow-datafusion/issues/8532
>
> On Wed, Dec 13, 2023 at 2:36 AM Kun Liu  wrote:
>
> > +1 (binding)
> >
> > verified on the M2 mac
> >
> > Thanks
> >
> > Wayne Xia  于2023年12月13日周三 14:27写道:
> >
> > > +1 (non-binding)
> > >
> > > Verified on AMD64 Linux
> > >
> > > Thanks Andy
> > >
> > > On Wed, Dec 13, 2023 at 10:05 AM vin jake 
> wrote:
> > >
> > > > +1 (binding)
> > > >
> > > > Verified on Mac M1
> > > >
> > > > Thanks Andy
> > > >
> > > > Andy Grove  于 2023年12月12日周二 下午10:17写道:
> > > >
> > > > > Hi,
> > > > >
> > > > > I would like to propose a release of Apache Arrow DataFusion
> > > > > Implementation,
> > > > > version 34.0.0.
> > > > >
> > > > > This release candidate is based on commit:
> > > > > 1a02d1456878dcd44159ebaf33e24c28f471aa14 [1]
> > > > > The proposed release tarball and signatures are hosted at [2].
> > > > > The changelog is located at [3].
> > > > >
> > > > > Please download, verify checksums and signatures, run the unit
> tests,
> > > and
> > > > > vote
> > > > > on the release. The vote will be open for at least 72 hours.
> > > > >
> > > > > Only votes from PMC members are binding, but all members of the
> > > community
> > > > > are
> > > > > encouraged to test the release and vote with "(non-binding)".
> > > > >
> > > > > The standard verification procedure is documented at
> > > > >
> > > > >
> > > >
> > >
> >
> https://github.com/apache/arrow-datafusion/blob/main/dev/release/README.md#verifying-release-candidates
> > > > > .
> > > > >
> > > > > [ ] +1 Release this as Apache Arrow DataFusion 34.0.0
> > > > > [ ] +0
> > > > > [ ] -1 Do not release this as Apache Arrow DataFusion 34.0.0
> > because...
> > > > >
> > > > > Here is my vote:
> > > > >
> > > > > +1
> > > > >
> > > > > [1]:
> > > > >
> > > > >
> > > >
> > >
> >
> https://github.com/apache/arrow-datafusion/tree/1a02d1456878dcd44159ebaf33e24c28f471aa14
> > > > > [2]:
> > > > >
> > > > >
> > > >
> > >
> >
> https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-34.0.0-rc1
> > > > > [3]:
> > > > >
> > > > >
> > > >
> > >
> >
> https://github.com/apache/arrow-datafusion/blob/1a02d1456878dcd44159ebaf33e24c28f471aa14/CHANGELOG.md
> > > > >
> > > >
> > >
> >
>


[VOTE][RUST][DataFusion] Release Apache Arrow DataFusion 34.0.0 RC1

2023-12-12 Thread Andy Grove
Hi,

I would like to propose a release of Apache Arrow DataFusion Implementation,
version 34.0.0.

This release candidate is based on commit:
1a02d1456878dcd44159ebaf33e24c28f471aa14 [1]
The proposed release tarball and signatures are hosted at [2].
The changelog is located at [3].

Please download, verify checksums and signatures, run the unit tests, and
vote
on the release. The vote will be open for at least 72 hours.

Only votes from PMC members are binding, but all members of the community
are
encouraged to test the release and vote with "(non-binding)".

The standard verification procedure is documented at
https://github.com/apache/arrow-datafusion/blob/main/dev/release/README.md#verifying-release-candidates
.

[ ] +1 Release this as Apache Arrow DataFusion 34.0.0
[ ] +0
[ ] -1 Do not release this as Apache Arrow DataFusion 34.0.0 because...

Here is my vote:

+1

[1]:
https://github.com/apache/arrow-datafusion/tree/1a02d1456878dcd44159ebaf33e24c28f471aa14
[2]:
https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-34.0.0-rc1
[3]:
https://github.com/apache/arrow-datafusion/blob/1a02d1456878dcd44159ebaf33e24c28f471aa14/CHANGELOG.md


Re: [ANNOUNCE] New Arrow PMC chair: Andy Grove

2023-11-28 Thread Andy Grove
Thank you all. I look forward to helping the project in this new role over
the next year.

Thanks to Andrew for handling this for the past year.

On Tue, Nov 28, 2023 at 3:20 AM Li Jin  wrote:

>  Congrats Andy!
>
> On Tue, Nov 28, 2023 at 3:25 PM Weston Pace  wrote:
>
> > Congrats Andy!
> >
> > On Mon, Nov 27, 2023, 7:31 PM wish maple  wrote:
> >
> > > Congrats Andy!
> > >
> > > Best,
> > > Xuwei Fu
> > >
> > > Andrew Lamb  于2023年11月27日周一 20:47写道:
> > >
> > > > I am pleased to announce that the Arrow Project has a new PMC chair
> and
> > > VP
> > > > as per our tradition of rotating the chair once a year. I have
> resigned
> > > and
> > > > Andy Grove was duly elected by the PMC and approved unanimously by
> the
> > > > board.
> > > >
> > > > Please join me in congratulating Andy Grove!
> > > >
> > > > Thanks,
> > > > Andrew
> > > >
> > >
> >
>


[RESULT][VOTE][RUST][DataFusion] Release DataFusion Python Bindings 33.0.0 RC1

2023-11-19 Thread Andy Grove
On Sun, Nov 19, 2023 at 12:45 PM Andy Grove  wrote:

> The vote passes with four +1 votes (three binding).
>
> Thanks for verifying the release. I really appreciate it.
>
> Release:
>
>
> https://dist.apache.org/repos/dist/release/arrow/arrow-datafusion-python-33.0.0/
> https://pypi.org/project/datafusion/33.0.0/
>
>
> On Sun, Nov 19, 2023 at 9:27 AM Daniël Heres 
> wrote:
>
>> +1 (binding)
>>
>> Verified on Mac (M1).
>> Thanks Andy!
>>
>> Op zo 19 nov 2023 om 15:05 schreef Jeremy Dyer :
>>
>> > Not PMC but +1 (non-binding) from me. Validated and verified release on
>> > ubuntu 22 x86 machine.
>> >
>> > Thanks,
>> > Jeremy Dyer
>> >
>> > On Sun, Nov 19, 2023 at 8:55 AM Andy Grove 
>> wrote:
>> >
>> > > We need one more PMC vote. If anyone has time to verify the release,
>> it
>> > > would be appreciated.
>> > >
>> > > On Thu, Nov 16, 2023 at 1:15 PM L. C. Hsieh  wrote:
>> > >
>> > > > +1 (binding)
>> > > >
>> > > > Verified on Intel Mac.
>> > > >
>> > > > Thanks Andy.
>> > > >
>> > > > On Thu, Nov 16, 2023 at 11:12 AM Andy Grove 
>> > > wrote:
>> > > > >
>> > > > > Hi,
>> > > > >
>> > > > > I would like to propose a release of Apache Arrow DataFusion
>> Python
>> > > > > Bindings,
>> > > > > version 33.0.0.
>> > > > >
>> > > > > This release candidate is based on commit:
>> > > > > d1a7505a72400d8f69b63dbad6123eccaef58366 [1]
>> > > > > The proposed release tarball and signatures are hosted at [2].
>> > > > > The changelog is located at [3].
>> > > > > The Python wheels are located at [4].
>> > > > >
>> > > > > Please download, verify checksums and signatures, run the unit
>> tests,
>> > > and
>> > > > > vote
>> > > > > on the release. The vote will be open for at least 72 hours.
>> > > > >
>> > > > > Only votes from PMC members are binding, but all members of the
>> > > community
>> > > > > are
>> > > > > encouraged to test the release and vote with "(non-binding)".
>> > > > >
>> > > > > The standard verification procedure is documented at
>> > > > >
>> > > >
>> > >
>> >
>> https://github.com/apache/arrow-datafusion-python/blob/main/dev/release/README.md#verifying-release-candidates
>> > > > > .
>> > > > >
>> > > > > [ ] +1 Release this as Apache Arrow DataFusion Python 33.0.0
>> > > > > [ ] +0
>> > > > > [ ] -1 Do not release this as Apache Arrow DataFusion Python
>> 33.0.0
>> > > > > because...
>> > > > >
>> > > > > Here is my vote:
>> > > > >
>> > > > > +1
>> > > > >
>> > > > > [1]:
>> > > > >
>> > > >
>> > >
>> >
>> https://github.com/apache/arrow-datafusion-python/tree/d1a7505a72400d8f69b63dbad6123eccaef58366
>> > > > > [2]:
>> > > > >
>> > > >
>> > >
>> >
>> https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-python-33.0.0-rc1
>> > > > > [3]:
>> > > > >
>> > > >
>> > >
>> >
>> https://github.com/apache/arrow-datafusion-python/blob/d1a7505a72400d8f69b63dbad6123eccaef58366/CHANGELOG.md
>> > > > > [4]: https://test.pypi.org/project/datafusion/33.0.0/
>> > > >
>> > >
>> >
>>
>>
>> --
>> Daniël Heres
>>
>


Re: [VOTE][RUST][DataFusion] Release DataFusion Python Bindings 33.0.0 RC1

2023-11-19 Thread Andy Grove
The vote passes with four +1 votes (three binding).

Thanks for verifying the release. I really appreciate it.

Release:

https://dist.apache.org/repos/dist/release/arrow/arrow-datafusion-python-33.0.0/
https://pypi.org/project/datafusion/33.0.0/


On Sun, Nov 19, 2023 at 9:27 AM Daniël Heres  wrote:

> +1 (binding)
>
> Verified on Mac (M1).
> Thanks Andy!
>
> Op zo 19 nov 2023 om 15:05 schreef Jeremy Dyer :
>
> > Not PMC but +1 (non-binding) from me. Validated and verified release on
> > ubuntu 22 x86 machine.
> >
> > Thanks,
> > Jeremy Dyer
> >
> > On Sun, Nov 19, 2023 at 8:55 AM Andy Grove 
> wrote:
> >
> > > We need one more PMC vote. If anyone has time to verify the release, it
> > > would be appreciated.
> > >
> > > On Thu, Nov 16, 2023 at 1:15 PM L. C. Hsieh  wrote:
> > >
> > > > +1 (binding)
> > > >
> > > > Verified on Intel Mac.
> > > >
> > > > Thanks Andy.
> > > >
> > > > On Thu, Nov 16, 2023 at 11:12 AM Andy Grove 
> > > wrote:
> > > > >
> > > > > Hi,
> > > > >
> > > > > I would like to propose a release of Apache Arrow DataFusion Python
> > > > > Bindings,
> > > > > version 33.0.0.
> > > > >
> > > > > This release candidate is based on commit:
> > > > > d1a7505a72400d8f69b63dbad6123eccaef58366 [1]
> > > > > The proposed release tarball and signatures are hosted at [2].
> > > > > The changelog is located at [3].
> > > > > The Python wheels are located at [4].
> > > > >
> > > > > Please download, verify checksums and signatures, run the unit
> tests,
> > > and
> > > > > vote
> > > > > on the release. The vote will be open for at least 72 hours.
> > > > >
> > > > > Only votes from PMC members are binding, but all members of the
> > > community
> > > > > are
> > > > > encouraged to test the release and vote with "(non-binding)".
> > > > >
> > > > > The standard verification procedure is documented at
> > > > >
> > > >
> > >
> >
> https://github.com/apache/arrow-datafusion-python/blob/main/dev/release/README.md#verifying-release-candidates
> > > > > .
> > > > >
> > > > > [ ] +1 Release this as Apache Arrow DataFusion Python 33.0.0
> > > > > [ ] +0
> > > > > [ ] -1 Do not release this as Apache Arrow DataFusion Python 33.0.0
> > > > > because...
> > > > >
> > > > > Here is my vote:
> > > > >
> > > > > +1
> > > > >
> > > > > [1]:
> > > > >
> > > >
> > >
> >
> https://github.com/apache/arrow-datafusion-python/tree/d1a7505a72400d8f69b63dbad6123eccaef58366
> > > > > [2]:
> > > > >
> > > >
> > >
> >
> https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-python-33.0.0-rc1
> > > > > [3]:
> > > > >
> > > >
> > >
> >
> https://github.com/apache/arrow-datafusion-python/blob/d1a7505a72400d8f69b63dbad6123eccaef58366/CHANGELOG.md
> > > > > [4]: https://test.pypi.org/project/datafusion/33.0.0/
> > > >
> > >
> >
>
>
> --
> Daniël Heres
>


Re: [VOTE][RUST][DataFusion] Release DataFusion Python Bindings 33.0.0 RC1

2023-11-19 Thread Andy Grove
We need one more PMC vote. If anyone has time to verify the release, it
would be appreciated.

On Thu, Nov 16, 2023 at 1:15 PM L. C. Hsieh  wrote:

> +1 (binding)
>
> Verified on Intel Mac.
>
> Thanks Andy.
>
> On Thu, Nov 16, 2023 at 11:12 AM Andy Grove  wrote:
> >
> > Hi,
> >
> > I would like to propose a release of Apache Arrow DataFusion Python
> > Bindings,
> > version 33.0.0.
> >
> > This release candidate is based on commit:
> > d1a7505a72400d8f69b63dbad6123eccaef58366 [1]
> > The proposed release tarball and signatures are hosted at [2].
> > The changelog is located at [3].
> > The Python wheels are located at [4].
> >
> > Please download, verify checksums and signatures, run the unit tests, and
> > vote
> > on the release. The vote will be open for at least 72 hours.
> >
> > Only votes from PMC members are binding, but all members of the community
> > are
> > encouraged to test the release and vote with "(non-binding)".
> >
> > The standard verification procedure is documented at
> >
> https://github.com/apache/arrow-datafusion-python/blob/main/dev/release/README.md#verifying-release-candidates
> > .
> >
> > [ ] +1 Release this as Apache Arrow DataFusion Python 33.0.0
> > [ ] +0
> > [ ] -1 Do not release this as Apache Arrow DataFusion Python 33.0.0
> > because...
> >
> > Here is my vote:
> >
> > +1
> >
> > [1]:
> >
> https://github.com/apache/arrow-datafusion-python/tree/d1a7505a72400d8f69b63dbad6123eccaef58366
> > [2]:
> >
> https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-python-33.0.0-rc1
> > [3]:
> >
> https://github.com/apache/arrow-datafusion-python/blob/d1a7505a72400d8f69b63dbad6123eccaef58366/CHANGELOG.md
> > [4]: https://test.pypi.org/project/datafusion/33.0.0/
>


[VOTE][RUST][DataFusion] Release DataFusion Python Bindings 33.0.0 RC1

2023-11-16 Thread Andy Grove
Hi,

I would like to propose a release of Apache Arrow DataFusion Python
Bindings,
version 33.0.0.

This release candidate is based on commit:
d1a7505a72400d8f69b63dbad6123eccaef58366 [1]
The proposed release tarball and signatures are hosted at [2].
The changelog is located at [3].
The Python wheels are located at [4].

Please download, verify checksums and signatures, run the unit tests, and
vote
on the release. The vote will be open for at least 72 hours.

Only votes from PMC members are binding, but all members of the community
are
encouraged to test the release and vote with "(non-binding)".

The standard verification procedure is documented at
https://github.com/apache/arrow-datafusion-python/blob/main/dev/release/README.md#verifying-release-candidates
.

[ ] +1 Release this as Apache Arrow DataFusion Python 33.0.0
[ ] +0
[ ] -1 Do not release this as Apache Arrow DataFusion Python 33.0.0
because...

Here is my vote:

+1

[1]:
https://github.com/apache/arrow-datafusion-python/tree/d1a7505a72400d8f69b63dbad6123eccaef58366
[2]:
https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-python-33.0.0-rc1
[3]:
https://github.com/apache/arrow-datafusion-python/blob/d1a7505a72400d8f69b63dbad6123eccaef58366/CHANGELOG.md
[4]: https://test.pypi.org/project/datafusion/33.0.0/


Re: [RESULT][VOTE][RUST][DataFusion] Release Apache Arrow DataFusion 33.0.0 RC2

2023-11-16 Thread Andy Grove
Thanks for catching that. It is released now.

On Thu, Nov 16, 2023 at 10:26 AM Daniël Heres  wrote:

> Thank you Andy for the release.
>
> When trying to update Ballista I saw datafusion-cli
> https://crates.io/crates/datafusion-cli version 33 is not yet available.
>
> Kind regards,
>
> Daniël
>
> Op do 16 nov 2023 om 18:05 schreef Andy Grove :
>
> > On Thu, Nov 16, 2023 at 10:02 AM Andy Grove 
> wrote:
> >
> > > The vote passes with six +1 votes (four binding). Thanks, everyone. The
> > > release is now available on crates.io and at
> > >
> >
> https://dist.apache.org/repos/dist/release/arrow/arrow-datafusion-33.0.0/
> > >
> > > On Tue, Nov 14, 2023 at 8:30 AM Chao Sun  wrote:
> > >
> > >> +1 (non-binding). Verified on M1 Mac.
> > >>
> > >> Thanks Andy.
> > >>
> > >> On Mon, Nov 13, 2023 at 11:16 PM Wayne Xia 
> > wrote:
> > >> >
> > >> > +1 (non-binding)
> > >> >
> > >> > Verified on amd64 linux
> > >> >
> > >> > Thanks Andy
> > >> >
> > >> > On Tue, Nov 14, 2023 at 2:25 PM vin jake 
> > wrote:
> > >> >
> > >> > > +1 (binding)
> > >> > >
> > >> > > Verified on M1 macbook
> > >> > >
> > >> > > Thanks Andy
> > >> > >
> > >> > > On Mon, Nov 13, 2023 at 11:39 PM Andy Grove <
> andygrov...@gmail.com>
> > >> wrote:
> > >> > >
> > >> > > > Hi,
> > >> > > >
> > >> > > > I would like to propose a release of Apache Arrow DataFusion
> > >> > > > Implementation,
> > >> > > > version 33.0.0.
> > >> > > >
> > >> > > > This release candidate is based on commit:
> > >> > > > d2efaa965989278fc86291be5048c4b460ed82c7 [1]
> > >> > > > The proposed release tarball and signatures are hosted at [2].
> > >> > > > The changelog is located at [3].
> > >> > > >
> > >> > > > Please download, verify checksums and signatures, run the unit
> > >> tests, and
> > >> > > > vote
> > >> > > > on the release. The vote will be open for at least 72 hours.
> > >> > > >
> > >> > > > Only votes from PMC members are binding, but all members of the
> > >> community
> > >> > > > are
> > >> > > > encouraged to test the release and vote with "(non-binding)".
> > >> > > >
> > >> > > > The standard verification procedure is documented at
> > >> > > >
> > >> > > >
> > >> > >
> > >>
> >
> https://github.com/apache/arrow-datafusion/blob/main/dev/release/README.md#verifying-release-candidates
> > >> > > > .
> > >> > > >
> > >> > > > [ ] +1 Release this as Apache Arrow DataFusion 33.0.0
> > >> > > > [ ] +0
> > >> > > > [ ] -1 Do not release this as Apache Arrow DataFusion 33.0.0
> > >> because...
> > >> > > >
> > >> > > > Here is my vote:
> > >> > > >
> > >> > > > +1
> > >> > > >
> > >> > > > [1]:
> > >> > > >
> > >> > > >
> > >> > >
> > >>
> >
> https://github.com/apache/arrow-datafusion/tree/d2efaa965989278fc86291be5048c4b460ed82c7
> > >> > > > [2]:
> > >> > > >
> > >> > > >
> > >> > >
> > >>
> >
> https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-33.0.0-rc2
> > >> > > > [3]:
> > >> > > >
> > >> > > >
> > >> > >
> > >>
> >
> https://github.com/apache/arrow-datafusion/blob/d2efaa965989278fc86291be5048c4b460ed82c7/CHANGELOG.md
> > >> > > >
> > >> > >
> > >>
> > >
> >
>
>
> --
> Daniël Heres
>


[RESULT][VOTE][RUST][DataFusion] Release Apache Arrow DataFusion 33.0.0 RC2

2023-11-16 Thread Andy Grove
On Thu, Nov 16, 2023 at 10:02 AM Andy Grove  wrote:

> The vote passes with six +1 votes (four binding). Thanks, everyone. The
> release is now available on crates.io and at
> https://dist.apache.org/repos/dist/release/arrow/arrow-datafusion-33.0.0/
>
> On Tue, Nov 14, 2023 at 8:30 AM Chao Sun  wrote:
>
>> +1 (non-binding). Verified on M1 Mac.
>>
>> Thanks Andy.
>>
>> On Mon, Nov 13, 2023 at 11:16 PM Wayne Xia  wrote:
>> >
>> > +1 (non-binding)
>> >
>> > Verified on amd64 linux
>> >
>> > Thanks Andy
>> >
>> > On Tue, Nov 14, 2023 at 2:25 PM vin jake  wrote:
>> >
>> > > +1 (binding)
>> > >
>> > > Verified on M1 macbook
>> > >
>> > > Thanks Andy
>> > >
>> > > On Mon, Nov 13, 2023 at 11:39 PM Andy Grove 
>> wrote:
>> > >
>> > > > Hi,
>> > > >
>> > > > I would like to propose a release of Apache Arrow DataFusion
>> > > > Implementation,
>> > > > version 33.0.0.
>> > > >
>> > > > This release candidate is based on commit:
>> > > > d2efaa965989278fc86291be5048c4b460ed82c7 [1]
>> > > > The proposed release tarball and signatures are hosted at [2].
>> > > > The changelog is located at [3].
>> > > >
>> > > > Please download, verify checksums and signatures, run the unit
>> tests, and
>> > > > vote
>> > > > on the release. The vote will be open for at least 72 hours.
>> > > >
>> > > > Only votes from PMC members are binding, but all members of the
>> community
>> > > > are
>> > > > encouraged to test the release and vote with "(non-binding)".
>> > > >
>> > > > The standard verification procedure is documented at
>> > > >
>> > > >
>> > >
>> https://github.com/apache/arrow-datafusion/blob/main/dev/release/README.md#verifying-release-candidates
>> > > > .
>> > > >
>> > > > [ ] +1 Release this as Apache Arrow DataFusion 33.0.0
>> > > > [ ] +0
>> > > > [ ] -1 Do not release this as Apache Arrow DataFusion 33.0.0
>> because...
>> > > >
>> > > > Here is my vote:
>> > > >
>> > > > +1
>> > > >
>> > > > [1]:
>> > > >
>> > > >
>> > >
>> https://github.com/apache/arrow-datafusion/tree/d2efaa965989278fc86291be5048c4b460ed82c7
>> > > > [2]:
>> > > >
>> > > >
>> > >
>> https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-33.0.0-rc2
>> > > > [3]:
>> > > >
>> > > >
>> > >
>> https://github.com/apache/arrow-datafusion/blob/d2efaa965989278fc86291be5048c4b460ed82c7/CHANGELOG.md
>> > > >
>> > >
>>
>


Re: [VOTE][RUST][DataFusion] Release Apache Arrow DataFusion 33.0.0 RC2

2023-11-16 Thread Andy Grove
The vote passes with six +1 votes (four binding). Thanks, everyone. The
release is now available on crates.io and at
https://dist.apache.org/repos/dist/release/arrow/arrow-datafusion-33.0.0/

On Tue, Nov 14, 2023 at 8:30 AM Chao Sun  wrote:

> +1 (non-binding). Verified on M1 Mac.
>
> Thanks Andy.
>
> On Mon, Nov 13, 2023 at 11:16 PM Wayne Xia  wrote:
> >
> > +1 (non-binding)
> >
> > Verified on amd64 linux
> >
> > Thanks Andy
> >
> > On Tue, Nov 14, 2023 at 2:25 PM vin jake  wrote:
> >
> > > +1 (binding)
> > >
> > > Verified on M1 macbook
> > >
> > > Thanks Andy
> > >
> > > On Mon, Nov 13, 2023 at 11:39 PM Andy Grove 
> wrote:
> > >
> > > > Hi,
> > > >
> > > > I would like to propose a release of Apache Arrow DataFusion
> > > > Implementation,
> > > > version 33.0.0.
> > > >
> > > > This release candidate is based on commit:
> > > > d2efaa965989278fc86291be5048c4b460ed82c7 [1]
> > > > The proposed release tarball and signatures are hosted at [2].
> > > > The changelog is located at [3].
> > > >
> > > > Please download, verify checksums and signatures, run the unit
> tests, and
> > > > vote
> > > > on the release. The vote will be open for at least 72 hours.
> > > >
> > > > Only votes from PMC members are binding, but all members of the
> community
> > > > are
> > > > encouraged to test the release and vote with "(non-binding)".
> > > >
> > > > The standard verification procedure is documented at
> > > >
> > > >
> > >
> https://github.com/apache/arrow-datafusion/blob/main/dev/release/README.md#verifying-release-candidates
> > > > .
> > > >
> > > > [ ] +1 Release this as Apache Arrow DataFusion 33.0.0
> > > > [ ] +0
> > > > [ ] -1 Do not release this as Apache Arrow DataFusion 33.0.0
> because...
> > > >
> > > > Here is my vote:
> > > >
> > > > +1
> > > >
> > > > [1]:
> > > >
> > > >
> > >
> https://github.com/apache/arrow-datafusion/tree/d2efaa965989278fc86291be5048c4b460ed82c7
> > > > [2]:
> > > >
> > > >
> > >
> https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-33.0.0-rc2
> > > > [3]:
> > > >
> > > >
> > >
> https://github.com/apache/arrow-datafusion/blob/d2efaa965989278fc86291be5048c4b460ed82c7/CHANGELOG.md
> > > >
> > >
>


[VOTE][RUST][DataFusion] Release Apache Arrow DataFusion 33.0.0 RC2

2023-11-13 Thread Andy Grove
Hi,

I would like to propose a release of Apache Arrow DataFusion Implementation,
version 33.0.0.

This release candidate is based on commit:
d2efaa965989278fc86291be5048c4b460ed82c7 [1]
The proposed release tarball and signatures are hosted at [2].
The changelog is located at [3].

Please download, verify checksums and signatures, run the unit tests, and
vote
on the release. The vote will be open for at least 72 hours.

Only votes from PMC members are binding, but all members of the community
are
encouraged to test the release and vote with "(non-binding)".

The standard verification procedure is documented at
https://github.com/apache/arrow-datafusion/blob/main/dev/release/README.md#verifying-release-candidates
.

[ ] +1 Release this as Apache Arrow DataFusion 33.0.0
[ ] +0
[ ] -1 Do not release this as Apache Arrow DataFusion 33.0.0 because...

Here is my vote:

+1

[1]:
https://github.com/apache/arrow-datafusion/tree/d2efaa965989278fc86291be5048c4b460ed82c7
[2]:
https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-33.0.0-rc2
[3]:
https://github.com/apache/arrow-datafusion/blob/d2efaa965989278fc86291be5048c4b460ed82c7/CHANGELOG.md


Re: [VOTE][RUST] Release Apache Arrow Rust 48.0.1 RC1

2023-11-12 Thread Andy Grove
+1 (binding)

Verified on Ubuntu 22.04.3 LTS.

Thanks, Andrew.

On Thu, Nov 9, 2023 at 2:10 PM Raphael Taylor-Davies
 wrote:

> +1 (binding)
>
> Verified on x86_64 GNU/Linux
>
> On 09/11/2023 20:31, Andrew Lamb wrote:
> > As discussed on [5], I would like to propose a patch release of Apache
> > Arrow Rust Implementation, version 48.0.1 to include two bug fixes.
> >
> > This release candidate is based on commit:
> > b60fc7bb09ada1385d3542b784fff2915fbc9cff [1]
> >
> > The proposed release tarball and signatures are hosted at [2].
> >
> > The changelog is located at [3].
> >
> > Please download, verify checksums and signatures, run the unit tests,
> > and vote on the release. There is a script [4] that automates some of
> > the verification.
> >
> > The vote will be open for at least 72 hours.
> >
> > [ ] +1 Release this as Apache Arrow Rust
> > [ ] +0
> > [ ] -1 Do not release this as Apache Arrow Rust  because...
> >
> > [1]:
> >
> https://github.com/apache/arrow-rs/tree/b60fc7bb09ada1385d3542b784fff2915fbc9cff
> > [2]:
> https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-rs-48.0.1-rc1
> > [3]:
> >
> https://github.com/apache/arrow-rs/blob/b60fc7bb09ada1385d3542b784fff2915fbc9cff/CHANGELOG.md
> > [4]:
> >
> https://github.com/apache/arrow-rs/blob/master/dev/release/verify-release-candidate.sh
> > [5]: https://github.com/apache/arrow-rs/issues/5050
> >
>


Re: [VOTE][RUST] Release Apache Arrow Rust 49.0.0 RC1

2023-11-07 Thread Andy Grove
+1 (binding)

Verified on Ubuntu 22.04.3 LTS.

Thanks, Raphael.

On Tue, Nov 7, 2023 at 2:22 PM Raphael Taylor-Davies
 wrote:

> Hi,
>
> I would like to propose a release of Apache Arrow Rust Implementation,
> version 49.0.0.
>
> This release candidate is based on commit:
> 747dcbf0670aeab2ede474edb3c4f22028d6a7e6 [1]
>
> The proposed release tarball and signatures are hosted at [2].
>
> The changelog is located at [3].
>
> Please download, verify checksums and signatures, run the unit tests,
> and vote on the release. There is a script [4] that automates some of
> the verification.
>
> The vote will be open for at least 72 hours.
>
> [ ] +1 Release this as Apache Arrow Rust
> [ ] +0
> [ ] -1 Do not release this as Apache Arrow Rust  because...
>
> [1]:
>
> https://github.com/apache/arrow-rs/tree/747dcbf0670aeab2ede474edb3c4f22028d6a7e6
> [2]:
> https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-rs-49.0.0-rc1
> [3]:
>
> https://github.com/apache/arrow-rs/blob/747dcbf0670aeab2ede474edb3c4f22028d6a7e6/CHANGELOG.md
> [4]:
>
> https://github.com/apache/arrow-rs/blob/master/dev/release/verify-release-candidate.sh
>
>


Re: [VOTE][RUST][DataFusion] Release Apache Arrow DataFusion 33.0.0 RC1

2023-11-06 Thread Andy Grove
I filed https://github.com/apache/arrow-datafusion/issues/8069

On Mon, Nov 6, 2023 at 11:59 AM Andy Grove  wrote:

> I see the same error when I run on my M1 Macbook Air with 16 GB RAM.
>
>  aggregates::tests::run_first_last_multi_partitions stdout 
> Error: ResourcesExhausted("Failed to allocate additional 632 bytes for
> GroupedHashAggregateStream[0] with 1829 bytes already allocated - maximum
> available is 605")
>
> It worked fine on my workstation with 128 GB RAM.
>
>
>
> On Mon, Nov 6, 2023 at 11:23 AM L. C. Hsieh  wrote:
>
>> Hmm, ran verification script and got one failure:
>>
>> failures:
>>
>>  aggregates::tests::run_first_last_multi_partitions stdout 
>> Error: ResourcesExhausted("Failed to allocate additional 632 bytes for
>> GroupedHashAggregateStream[0] with 1829 bytes already allocated -
>> maximum available is 605")
>>
>> failures:
>> aggregates::tests::run_first_last_multi_partitions
>>
>> test result: FAILED. 557 passed; 1 failed; 1 ignored; 0 measured; 0
>> filtered out; finished in 2.21s
>>
>>
>>
>> On Mon, Nov 6, 2023 at 6:57 AM Andy Grove  wrote:
>> >
>> > Hi,
>> >
>> > I would like to propose a release of Apache Arrow DataFusion
>> Implementation,
>> > version 33.0.0.
>> >
>> > This release candidate is based on commit:
>> > 262f08778b8ec231d96792c01fc3e051640eb5d4 [1]
>> > The proposed release tarball and signatures are hosted at [2].
>> > The changelog is located at [3].
>> >
>> > Please download, verify checksums and signatures, run the unit tests,
>> and
>> > vote
>> > on the release. The vote will be open for at least 72 hours.
>> >
>> > Only votes from PMC members are binding, but all members of the
>> community
>> > are
>> > encouraged to test the release and vote with "(non-binding)".
>> >
>> > The standard verification procedure is documented at
>> >
>> https://github.com/apache/arrow-datafusion/blob/main/dev/release/README.md#verifying-release-candidates
>> > .
>> >
>> > [ ] +1 Release this as Apache Arrow DataFusion 33.0.0
>> > [ ] +0
>> > [ ] -1 Do not release this as Apache Arrow DataFusion 33.0.0 because...
>> >
>> > Here is my vote:
>> >
>> > +1
>> >
>> > [1]:
>> >
>> https://github.com/apache/arrow-datafusion/tree/262f08778b8ec231d96792c01fc3e051640eb5d4
>> > [2]:
>> >
>> https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-33.0.0-rc1
>> > [3]:
>> >
>> https://github.com/apache/arrow-datafusion/blob/262f08778b8ec231d96792c01fc3e051640eb5d4/CHANGELOG.md
>>
>


Re: [VOTE][RUST][DataFusion] Release Apache Arrow DataFusion 33.0.0 RC1

2023-11-06 Thread Andy Grove
I see the same error when I run on my M1 Macbook Air with 16 GB RAM.

 aggregates::tests::run_first_last_multi_partitions stdout 
Error: ResourcesExhausted("Failed to allocate additional 632 bytes for
GroupedHashAggregateStream[0] with 1829 bytes already allocated - maximum
available is 605")

It worked fine on my workstation with 128 GB RAM.



On Mon, Nov 6, 2023 at 11:23 AM L. C. Hsieh  wrote:

> Hmm, ran verification script and got one failure:
>
> failures:
>
>  aggregates::tests::run_first_last_multi_partitions stdout 
> Error: ResourcesExhausted("Failed to allocate additional 632 bytes for
> GroupedHashAggregateStream[0] with 1829 bytes already allocated -
> maximum available is 605")
>
> failures:
> aggregates::tests::run_first_last_multi_partitions
>
> test result: FAILED. 557 passed; 1 failed; 1 ignored; 0 measured; 0
> filtered out; finished in 2.21s
>
>
>
> On Mon, Nov 6, 2023 at 6:57 AM Andy Grove  wrote:
> >
> > Hi,
> >
> > I would like to propose a release of Apache Arrow DataFusion
> Implementation,
> > version 33.0.0.
> >
> > This release candidate is based on commit:
> > 262f08778b8ec231d96792c01fc3e051640eb5d4 [1]
> > The proposed release tarball and signatures are hosted at [2].
> > The changelog is located at [3].
> >
> > Please download, verify checksums and signatures, run the unit tests, and
> > vote
> > on the release. The vote will be open for at least 72 hours.
> >
> > Only votes from PMC members are binding, but all members of the community
> > are
> > encouraged to test the release and vote with "(non-binding)".
> >
> > The standard verification procedure is documented at
> >
> https://github.com/apache/arrow-datafusion/blob/main/dev/release/README.md#verifying-release-candidates
> > .
> >
> > [ ] +1 Release this as Apache Arrow DataFusion 33.0.0
> > [ ] +0
> > [ ] -1 Do not release this as Apache Arrow DataFusion 33.0.0 because...
> >
> > Here is my vote:
> >
> > +1
> >
> > [1]:
> >
> https://github.com/apache/arrow-datafusion/tree/262f08778b8ec231d96792c01fc3e051640eb5d4
> > [2]:
> >
> https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-33.0.0-rc1
> > [3]:
> >
> https://github.com/apache/arrow-datafusion/blob/262f08778b8ec231d96792c01fc3e051640eb5d4/CHANGELOG.md
>


[VOTE][RUST][DataFusion] Release Apache Arrow DataFusion 33.0.0 RC1

2023-11-06 Thread Andy Grove
Hi,

I would like to propose a release of Apache Arrow DataFusion Implementation,
version 33.0.0.

This release candidate is based on commit:
262f08778b8ec231d96792c01fc3e051640eb5d4 [1]
The proposed release tarball and signatures are hosted at [2].
The changelog is located at [3].

Please download, verify checksums and signatures, run the unit tests, and
vote
on the release. The vote will be open for at least 72 hours.

Only votes from PMC members are binding, but all members of the community
are
encouraged to test the release and vote with "(non-binding)".

The standard verification procedure is documented at
https://github.com/apache/arrow-datafusion/blob/main/dev/release/README.md#verifying-release-candidates
.

[ ] +1 Release this as Apache Arrow DataFusion 33.0.0
[ ] +0
[ ] -1 Do not release this as Apache Arrow DataFusion 33.0.0 because...

Here is my vote:

+1

[1]:
https://github.com/apache/arrow-datafusion/tree/262f08778b8ec231d96792c01fc3e051640eb5d4
[2]:
https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-33.0.0-rc1
[3]:
https://github.com/apache/arrow-datafusion/blob/262f08778b8ec231d96792c01fc3e051640eb5d4/CHANGELOG.md


[RESULT][VOTE][RUST][DataFusion] Release DataFusion Python Bindings 32.0.0 RC1

2023-10-25 Thread Andy Grove
On Wed, Oct 25, 2023 at 10:28 AM Andy Grove  wrote:

> The vote passes with three binding votes. Thanks, everyone.
>
> The release is now available:
>
>
> https://dist.apache.org/repos/dist/release/arrow/arrow-datafusion-python-32.0.0/
> https://pypi.org/project/datafusion/32.0.0/
> https://crates.io/crates/datafusion-python/32.0.0
>
>
>
>
>
> On Tue, Oct 24, 2023 at 2:33 PM Andrew Lamb  wrote:
>
>> +1 (binding) on x86 mac
>>
>> Thank you Andy -- I know this takes effort to keep going, but it is really
>> valuable to keep these releases flowing
>>
>> Andrew
>>
>> On Sat, Oct 21, 2023 at 4:48 PM L. C. Hsieh  wrote:
>>
>> > +1 (binding)
>> >
>> > Verified on M1 Mac.
>> >
>> > Thanks Andy.
>> >
>> > On Sat, Oct 21, 2023 at 1:42 PM Andy Grove 
>> wrote:
>> > >
>> > > Hi,
>> > >
>> > > I would like to propose a release of Apache Arrow DataFusion Python
>> > > Bindings,
>> > > version 32.0.0.
>> > >
>> > > This release candidate is based on commit:
>> > > fc3c24b52e8bfa1e170fb9f3708fe014e41b3e9e [1]
>> > > The proposed release tarball and signatures are hosted at [2].
>> > > The changelog is located at [3].
>> > > The Python wheels are located at [4].
>> > >
>> > > Please download, verify checksums and signatures, run the unit tests,
>> and
>> > > vote
>> > > on the release. The vote will be open for at least 72 hours.
>> > >
>> > > Only votes from PMC members are binding, but all members of the
>> community
>> > > are
>> > > encouraged to test the release and vote with "(non-binding)".
>> > >
>> > > The standard verification procedure is documented at
>> > >
>> >
>> https://github.com/apache/arrow-datafusion-python/blob/main/dev/release/README.md#verifying-release-candidates
>> > > .
>> > >
>> > > [ ] +1 Release this as Apache Arrow DataFusion Python 32.0.0
>> > > [ ] +0
>> > > [ ] -1 Do not release this as Apache Arrow DataFusion Python 32.0.0
>> > > because...
>> > >
>> > > Here is my vote:
>> > >
>> > > +1
>> > >
>> > > [1]:
>> > >
>> >
>> https://github.com/apache/arrow-datafusion-python/tree/fc3c24b52e8bfa1e170fb9f3708fe014e41b3e9e
>> > > [2]:
>> > >
>> >
>> https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-python-32.0.0-rc1
>> > > [3]:
>> > >
>> >
>> https://github.com/apache/arrow-datafusion-python/blob/fc3c24b52e8bfa1e170fb9f3708fe014e41b3e9e/CHANGELOG.md
>> > > [4]: https://test.pypi.org/project/datafusion/32.0.0/
>> >
>>
>


Re: [VOTE][RUST][DataFusion] Release DataFusion Python Bindings 32.0.0 RC1

2023-10-25 Thread Andy Grove
The vote passes with three binding votes. Thanks, everyone.

The release is now available:

https://dist.apache.org/repos/dist/release/arrow/arrow-datafusion-python-32.0.0/
https://pypi.org/project/datafusion/32.0.0/
https://crates.io/crates/datafusion-python/32.0.0





On Tue, Oct 24, 2023 at 2:33 PM Andrew Lamb  wrote:

> +1 (binding) on x86 mac
>
> Thank you Andy -- I know this takes effort to keep going, but it is really
> valuable to keep these releases flowing
>
> Andrew
>
> On Sat, Oct 21, 2023 at 4:48 PM L. C. Hsieh  wrote:
>
> > +1 (binding)
> >
> > Verified on M1 Mac.
> >
> > Thanks Andy.
> >
> > On Sat, Oct 21, 2023 at 1:42 PM Andy Grove 
> wrote:
> > >
> > > Hi,
> > >
> > > I would like to propose a release of Apache Arrow DataFusion Python
> > > Bindings,
> > > version 32.0.0.
> > >
> > > This release candidate is based on commit:
> > > fc3c24b52e8bfa1e170fb9f3708fe014e41b3e9e [1]
> > > The proposed release tarball and signatures are hosted at [2].
> > > The changelog is located at [3].
> > > The Python wheels are located at [4].
> > >
> > > Please download, verify checksums and signatures, run the unit tests,
> and
> > > vote
> > > on the release. The vote will be open for at least 72 hours.
> > >
> > > Only votes from PMC members are binding, but all members of the
> community
> > > are
> > > encouraged to test the release and vote with "(non-binding)".
> > >
> > > The standard verification procedure is documented at
> > >
> >
> https://github.com/apache/arrow-datafusion-python/blob/main/dev/release/README.md#verifying-release-candidates
> > > .
> > >
> > > [ ] +1 Release this as Apache Arrow DataFusion Python 32.0.0
> > > [ ] +0
> > > [ ] -1 Do not release this as Apache Arrow DataFusion Python 32.0.0
> > > because...
> > >
> > > Here is my vote:
> > >
> > > +1
> > >
> > > [1]:
> > >
> >
> https://github.com/apache/arrow-datafusion-python/tree/fc3c24b52e8bfa1e170fb9f3708fe014e41b3e9e
> > > [2]:
> > >
> >
> https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-python-32.0.0-rc1
> > > [3]:
> > >
> >
> https://github.com/apache/arrow-datafusion-python/blob/fc3c24b52e8bfa1e170fb9f3708fe014e41b3e9e/CHANGELOG.md
> > > [4]: https://test.pypi.org/project/datafusion/32.0.0/
> >
>


[VOTE][RUST][DataFusion] Release DataFusion Python Bindings 32.0.0 RC1

2023-10-21 Thread Andy Grove
Hi,

I would like to propose a release of Apache Arrow DataFusion Python
Bindings,
version 32.0.0.

This release candidate is based on commit:
fc3c24b52e8bfa1e170fb9f3708fe014e41b3e9e [1]
The proposed release tarball and signatures are hosted at [2].
The changelog is located at [3].
The Python wheels are located at [4].

Please download, verify checksums and signatures, run the unit tests, and
vote
on the release. The vote will be open for at least 72 hours.

Only votes from PMC members are binding, but all members of the community
are
encouraged to test the release and vote with "(non-binding)".

The standard verification procedure is documented at
https://github.com/apache/arrow-datafusion-python/blob/main/dev/release/README.md#verifying-release-candidates
.

[ ] +1 Release this as Apache Arrow DataFusion Python 32.0.0
[ ] +0
[ ] -1 Do not release this as Apache Arrow DataFusion Python 32.0.0
because...

Here is my vote:

+1

[1]:
https://github.com/apache/arrow-datafusion-python/tree/fc3c24b52e8bfa1e170fb9f3708fe014e41b3e9e
[2]:
https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-python-32.0.0-rc1
[3]:
https://github.com/apache/arrow-datafusion-python/blob/fc3c24b52e8bfa1e170fb9f3708fe014e41b3e9e/CHANGELOG.md
[4]: https://test.pypi.org/project/datafusion/32.0.0/


[VOTE][RUST][DataFusion] Release Apache Arrow DataFusion 32.0.0 RC1

2023-10-12 Thread Andy Grove
On Thu, Oct 12, 2023 at 9:17 AM Andy Grove  wrote:

> The vote passes with 7 +1 votes (5 binding). Thanks, everyone.
>
> On Mon, Oct 9, 2023 at 10:56 AM Will Jones 
> wrote:
>
>> +1 (binding)
>> Verified on M1 Mac.
>>
>> On Mon, Oct 9, 2023 at 7:12 AM Andrew Lamb  wrote:
>>
>> > +1 (binding)
>> > Verified on x86 mac
>> >
>> > Thanks Andy
>> >
>> > On Sun, Oct 8, 2023 at 1:22 PM Andy Grove 
>> wrote:
>> >
>> > > Hi,
>> > >
>> > > I would like to propose a release of Apache Arrow DataFusion
>> > > Implementation,
>> > > version 32.0.0.
>> > >
>> > > This release candidate is based on commit:
>> > > eca48dae2447a67fcf30313c956e6c39cf739d48 [1]
>> > > The proposed release tarball and signatures are hosted at [2].
>> > > The changelog is located at [3].
>> > >
>> > > Please download, verify checksums and signatures, run the unit tests,
>> and
>> > > vote
>> > > on the release. The vote will be open for at least 72 hours.
>> > >
>> > > Only votes from PMC members are binding, but all members of the
>> community
>> > > are
>> > > encouraged to test the release and vote with "(non-binding)".
>> > >
>> > > The standard verification procedure is documented at
>> > >
>> > >
>> >
>> https://github.com/apache/arrow-datafusion/blob/main/dev/release/README.md#verifying-release-candidates
>> > > .
>> > >
>> > > [ ] +1 Release this as Apache Arrow DataFusion 32.0.0
>> > > [ ] +0
>> > > [ ] -1 Do not release this as Apache Arrow DataFusion 32.0.0
>> because...
>> > >
>> > > Here is my vote:
>> > >
>> > > +1
>> > >
>> > > [1]:
>> > >
>> > >
>> >
>> https://github.com/apache/arrow-datafusion/tree/eca48dae2447a67fcf30313c956e6c39cf739d48
>> > > [2]:
>> > >
>> > >
>> >
>> https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-32.0.0-rc1
>> > > [3]:
>> > >
>> > >
>> >
>> https://github.com/apache/arrow-datafusion/blob/eca48dae2447a67fcf30313c956e6c39cf739d48/CHANGELOG.md
>> > >
>> >
>>
>


Re: [VOTE][RUST][DataFusion] Release Apache Arrow DataFusion 32.0.0 RC1

2023-10-12 Thread Andy Grove
The vote passes with 7 +1 votes (5 binding). Thanks, everyone.

On Mon, Oct 9, 2023 at 10:56 AM Will Jones  wrote:

> +1 (binding)
> Verified on M1 Mac.
>
> On Mon, Oct 9, 2023 at 7:12 AM Andrew Lamb  wrote:
>
> > +1 (binding)
> > Verified on x86 mac
> >
> > Thanks Andy
> >
> > On Sun, Oct 8, 2023 at 1:22 PM Andy Grove  wrote:
> >
> > > Hi,
> > >
> > > I would like to propose a release of Apache Arrow DataFusion
> > > Implementation,
> > > version 32.0.0.
> > >
> > > This release candidate is based on commit:
> > > eca48dae2447a67fcf30313c956e6c39cf739d48 [1]
> > > The proposed release tarball and signatures are hosted at [2].
> > > The changelog is located at [3].
> > >
> > > Please download, verify checksums and signatures, run the unit tests,
> and
> > > vote
> > > on the release. The vote will be open for at least 72 hours.
> > >
> > > Only votes from PMC members are binding, but all members of the
> community
> > > are
> > > encouraged to test the release and vote with "(non-binding)".
> > >
> > > The standard verification procedure is documented at
> > >
> > >
> >
> https://github.com/apache/arrow-datafusion/blob/main/dev/release/README.md#verifying-release-candidates
> > > .
> > >
> > > [ ] +1 Release this as Apache Arrow DataFusion 32.0.0
> > > [ ] +0
> > > [ ] -1 Do not release this as Apache Arrow DataFusion 32.0.0 because...
> > >
> > > Here is my vote:
> > >
> > > +1
> > >
> > > [1]:
> > >
> > >
> >
> https://github.com/apache/arrow-datafusion/tree/eca48dae2447a67fcf30313c956e6c39cf739d48
> > > [2]:
> > >
> > >
> >
> https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-32.0.0-rc1
> > > [3]:
> > >
> > >
> >
> https://github.com/apache/arrow-datafusion/blob/eca48dae2447a67fcf30313c956e6c39cf739d48/CHANGELOG.md
> > >
> >
>


[VOTE][RUST][DataFusion] Release Apache Arrow DataFusion 32.0.0 RC1

2023-10-08 Thread Andy Grove
Hi,

I would like to propose a release of Apache Arrow DataFusion Implementation,
version 32.0.0.

This release candidate is based on commit:
eca48dae2447a67fcf30313c956e6c39cf739d48 [1]
The proposed release tarball and signatures are hosted at [2].
The changelog is located at [3].

Please download, verify checksums and signatures, run the unit tests, and
vote
on the release. The vote will be open for at least 72 hours.

Only votes from PMC members are binding, but all members of the community
are
encouraged to test the release and vote with "(non-binding)".

The standard verification procedure is documented at
https://github.com/apache/arrow-datafusion/blob/main/dev/release/README.md#verifying-release-candidates
.

[ ] +1 Release this as Apache Arrow DataFusion 32.0.0
[ ] +0
[ ] -1 Do not release this as Apache Arrow DataFusion 32.0.0 because...

Here is my vote:

+1

[1]:
https://github.com/apache/arrow-datafusion/tree/eca48dae2447a67fcf30313c956e6c39cf739d48
[2]:
https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-32.0.0-rc1
[3]:
https://github.com/apache/arrow-datafusion/blob/eca48dae2447a67fcf30313c956e6c39cf739d48/CHANGELOG.md


[RESULT][VOTE][RUST][DataFusion] Release DataFusion Python Bindings 31.0.0 RC1

2023-09-18 Thread Andy Grove
On Mon, Sep 18, 2023 at 8:27 AM Andy Grove  wrote:

> The vote passes with 5 +1 votes (3 binding). Thanks, everyone.
>
> I have published the release (to ASF and PyPi).
>
>
>
> On Sat, Sep 16, 2023 at 3:14 AM Daniel Alejandro Mesejo-León <
> mesejol...@gmail.com> wrote:
>
>> +1 (non-binding) verified using parquet examples on M1 Mac.
>> Thanks, Andy.
>>
>> On Thu, Sep 14, 2023 at 1:46 PM Jeremy Dyer  wrote:
>>
>> > +1 (non-binding)
>> >
>> > Verified with build script and in the context of using it as a 3rd party
>> > dependency for dask-sql.
>> >
>> > Thanks Andy
>> >
>> > On Thu, Sep 14, 2023 at 7:26 AM Andrew Lamb 
>> wrote:
>> >
>> > > +1 (binding)
>> > >
>> > > thank you Andy
>> > >
>> > > On Wed, Sep 13, 2023 at 4:28 PM L. C. Hsieh  wrote:
>> > >
>> > > > +1 (binding)
>> > > >
>> > > > Verified on Intel Mac.
>> > > >
>> > > > Thanks Andy.
>> > > >
>> > > > On Wed, Sep 13, 2023 at 12:26 PM Andy Grove 
>> > > wrote:
>> > > > >
>> > > > > Hi,
>> > > > >
>> > > > > I would like to propose a release of Apache Arrow DataFusion
>> Python
>> > > > > Bindings,
>> > > > > version 31.0.0.
>> > > > >
>> > > > > This release candidate is based on commit:
>> > > > > 54d17771fc2814339f94e0871401ee946d7c913b [1]
>> > > > > The proposed release tarball and signatures are hosted at [2].
>> > > > > The changelog is located at [3].
>> > > > > The Python wheels are located at [4].
>> > > > >
>> > > > > Please download, verify checksums and signatures, run the unit
>> tests,
>> > > and
>> > > > > vote
>> > > > > on the release. The vote will be open for at least 72 hours.
>> > > > >
>> > > > > Only votes from PMC members are binding, but all members of the
>> > > community
>> > > > > are
>> > > > > encouraged to test the release and vote with "(non-binding)".
>> > > > >
>> > > > > The standard verification procedure is documented at
>> > > > >
>> > > >
>> > >
>> >
>> https://github.com/apache/arrow-datafusion-python/blob/main/dev/release/README.md#verifying-release-candidates
>> > > > > .
>> > > > >
>> > > > > [ ] +1 Release this as Apache Arrow DataFusion Python 31.0.0
>> > > > > [ ] +0
>> > > > > [ ] -1 Do not release this as Apache Arrow DataFusion Python
>> 31.0.0
>> > > > > because...
>> > > > >
>> > > > > Here is my vote:
>> > > > >
>> > > > > +1
>> > > > >
>> > > > > [1]:
>> > > > >
>> > > >
>> > >
>> >
>> https://github.com/apache/arrow-datafusion-python/tree/54d17771fc2814339f94e0871401ee946d7c913b
>> > > > > [2]:
>> > > > >
>> > > >
>> > >
>> >
>> https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-python-31.0.0-rc1
>> > > > > [3]:
>> > > > >
>> > > >
>> > >
>> >
>> https://github.com/apache/arrow-datafusion-python/blob/54d17771fc2814339f94e0871401ee946d7c913b/CHANGELOG.md
>> > > > > [4]: https://test.pypi.org/project/datafusion/31.0.0/
>> > > >
>> > >
>> >
>>
>


Re: [VOTE][RUST][DataFusion] Release DataFusion Python Bindings 31.0.0 RC1

2023-09-18 Thread Andy Grove
The vote passes with 5 +1 votes (3 binding). Thanks, everyone.

I have published the release (to ASF and PyPi).



On Sat, Sep 16, 2023 at 3:14 AM Daniel Alejandro Mesejo-León <
mesejol...@gmail.com> wrote:

> +1 (non-binding) verified using parquet examples on M1 Mac.
> Thanks, Andy.
>
> On Thu, Sep 14, 2023 at 1:46 PM Jeremy Dyer  wrote:
>
> > +1 (non-binding)
> >
> > Verified with build script and in the context of using it as a 3rd party
> > dependency for dask-sql.
> >
> > Thanks Andy
> >
> > On Thu, Sep 14, 2023 at 7:26 AM Andrew Lamb 
> wrote:
> >
> > > +1 (binding)
> > >
> > > thank you Andy
> > >
> > > On Wed, Sep 13, 2023 at 4:28 PM L. C. Hsieh  wrote:
> > >
> > > > +1 (binding)
> > > >
> > > > Verified on Intel Mac.
> > > >
> > > > Thanks Andy.
> > > >
> > > > On Wed, Sep 13, 2023 at 12:26 PM Andy Grove 
> > > wrote:
> > > > >
> > > > > Hi,
> > > > >
> > > > > I would like to propose a release of Apache Arrow DataFusion Python
> > > > > Bindings,
> > > > > version 31.0.0.
> > > > >
> > > > > This release candidate is based on commit:
> > > > > 54d17771fc2814339f94e0871401ee946d7c913b [1]
> > > > > The proposed release tarball and signatures are hosted at [2].
> > > > > The changelog is located at [3].
> > > > > The Python wheels are located at [4].
> > > > >
> > > > > Please download, verify checksums and signatures, run the unit
> tests,
> > > and
> > > > > vote
> > > > > on the release. The vote will be open for at least 72 hours.
> > > > >
> > > > > Only votes from PMC members are binding, but all members of the
> > > community
> > > > > are
> > > > > encouraged to test the release and vote with "(non-binding)".
> > > > >
> > > > > The standard verification procedure is documented at
> > > > >
> > > >
> > >
> >
> https://github.com/apache/arrow-datafusion-python/blob/main/dev/release/README.md#verifying-release-candidates
> > > > > .
> > > > >
> > > > > [ ] +1 Release this as Apache Arrow DataFusion Python 31.0.0
> > > > > [ ] +0
> > > > > [ ] -1 Do not release this as Apache Arrow DataFusion Python 31.0.0
> > > > > because...
> > > > >
> > > > > Here is my vote:
> > > > >
> > > > > +1
> > > > >
> > > > > [1]:
> > > > >
> > > >
> > >
> >
> https://github.com/apache/arrow-datafusion-python/tree/54d17771fc2814339f94e0871401ee946d7c913b
> > > > > [2]:
> > > > >
> > > >
> > >
> >
> https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-python-31.0.0-rc1
> > > > > [3]:
> > > > >
> > > >
> > >
> >
> https://github.com/apache/arrow-datafusion-python/blob/54d17771fc2814339f94e0871401ee946d7c913b/CHANGELOG.md
> > > > > [4]: https://test.pypi.org/project/datafusion/31.0.0/
> > > >
> > >
> >
>


[VOTE][RUST][DataFusion] Release DataFusion Python Bindings 31.0.0 RC1

2023-09-13 Thread Andy Grove
Hi,

I would like to propose a release of Apache Arrow DataFusion Python
Bindings,
version 31.0.0.

This release candidate is based on commit:
54d17771fc2814339f94e0871401ee946d7c913b [1]
The proposed release tarball and signatures are hosted at [2].
The changelog is located at [3].
The Python wheels are located at [4].

Please download, verify checksums and signatures, run the unit tests, and
vote
on the release. The vote will be open for at least 72 hours.

Only votes from PMC members are binding, but all members of the community
are
encouraged to test the release and vote with "(non-binding)".

The standard verification procedure is documented at
https://github.com/apache/arrow-datafusion-python/blob/main/dev/release/README.md#verifying-release-candidates
.

[ ] +1 Release this as Apache Arrow DataFusion Python 31.0.0
[ ] +0
[ ] -1 Do not release this as Apache Arrow DataFusion Python 31.0.0
because...

Here is my vote:

+1

[1]:
https://github.com/apache/arrow-datafusion-python/tree/54d17771fc2814339f94e0871401ee946d7c913b
[2]:
https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-python-31.0.0-rc1
[3]:
https://github.com/apache/arrow-datafusion-python/blob/54d17771fc2814339f94e0871401ee946d7c913b/CHANGELOG.md
[4]: https://test.pypi.org/project/datafusion/31.0.0/


[RESULT][VOTE][RUST][DataFusion] Release Apache Arrow DataFusion 31.0.0 RC1

2023-09-11 Thread Andy Grove
On Mon, Sep 11, 2023 at 2:02 PM Andy Grove  wrote:

> The vote passes with 6 +1 votes (5 binding). Thanks, everyone. Crates are
> published.
>
> On Sun, Sep 10, 2023 at 8:04 AM vin jake  wrote:
>
>> +1 (binding)
>>
>> Verified on my M1 Mac
>>
>> Thanks Andy
>>
>> On Sat, Sep 9, 2023 at 12:01 AM Andy Grove  wrote:
>>
>> > Hi,
>> >
>> > I would like to propose a release of Apache Arrow DataFusion
>> > Implementation,
>> > version 31.0.0.
>> >
>> > This release candidate is based on commit:
>> > 44cf6f127ddfba7cda0c243b22f7e0fce70f16ec [1]
>> > The proposed release tarball and signatures are hosted at [2].
>> > The changelog is located at [3].
>> >
>> > Please download, verify checksums and signatures, run the unit tests,
>> and
>> > vote
>> > on the release. The vote will be open for at least 72 hours.
>> >
>> > Only votes from PMC members are binding, but all members of the
>> community
>> > are
>> > encouraged to test the release and vote with "(non-binding)".
>> >
>> > The standard verification procedure is documented at
>> >
>> >
>> https://github.com/apache/arrow-datafusion/blob/main/dev/release/README.md#verifying-release-candidates
>> > .
>> >
>> > [ ] +1 Release this as Apache Arrow DataFusion 31.0.0
>> > [ ] +0
>> > [ ] -1 Do not release this as Apache Arrow DataFusion 31.0.0 because...
>> >
>> > Here is my vote:
>> >
>> > +1
>> >
>> > [1]:
>> >
>> >
>> https://github.com/apache/arrow-datafusion/tree/44cf6f127ddfba7cda0c243b22f7e0fce70f16ec
>> > [2]:
>> >
>> >
>> https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-31.0.0-rc1
>> > [3]:
>> >
>> >
>> https://github.com/apache/arrow-datafusion/blob/44cf6f127ddfba7cda0c243b22f7e0fce70f16ec/CHANGELOG.md
>> >
>>
>


Re: [VOTE][RUST][DataFusion] Release Apache Arrow DataFusion 31.0.0 RC1

2023-09-11 Thread Andy Grove
The vote passes with 6 +1 votes (5 binding). Thanks, everyone. Crates are
published.

On Sun, Sep 10, 2023 at 8:04 AM vin jake  wrote:

> +1 (binding)
>
> Verified on my M1 Mac
>
> Thanks Andy
>
> On Sat, Sep 9, 2023 at 12:01 AM Andy Grove  wrote:
>
> > Hi,
> >
> > I would like to propose a release of Apache Arrow DataFusion
> > Implementation,
> > version 31.0.0.
> >
> > This release candidate is based on commit:
> > 44cf6f127ddfba7cda0c243b22f7e0fce70f16ec [1]
> > The proposed release tarball and signatures are hosted at [2].
> > The changelog is located at [3].
> >
> > Please download, verify checksums and signatures, run the unit tests, and
> > vote
> > on the release. The vote will be open for at least 72 hours.
> >
> > Only votes from PMC members are binding, but all members of the community
> > are
> > encouraged to test the release and vote with "(non-binding)".
> >
> > The standard verification procedure is documented at
> >
> >
> https://github.com/apache/arrow-datafusion/blob/main/dev/release/README.md#verifying-release-candidates
> > .
> >
> > [ ] +1 Release this as Apache Arrow DataFusion 31.0.0
> > [ ] +0
> > [ ] -1 Do not release this as Apache Arrow DataFusion 31.0.0 because...
> >
> > Here is my vote:
> >
> > +1
> >
> > [1]:
> >
> >
> https://github.com/apache/arrow-datafusion/tree/44cf6f127ddfba7cda0c243b22f7e0fce70f16ec
> > [2]:
> >
> >
> https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-31.0.0-rc1
> > [3]:
> >
> >
> https://github.com/apache/arrow-datafusion/blob/44cf6f127ddfba7cda0c243b22f7e0fce70f16ec/CHANGELOG.md
> >
>


[VOTE][RUST][DataFusion] Release Apache Arrow DataFusion 31.0.0 RC1

2023-09-08 Thread Andy Grove
Hi,

I would like to propose a release of Apache Arrow DataFusion Implementation,
version 31.0.0.

This release candidate is based on commit:
44cf6f127ddfba7cda0c243b22f7e0fce70f16ec [1]
The proposed release tarball and signatures are hosted at [2].
The changelog is located at [3].

Please download, verify checksums and signatures, run the unit tests, and
vote
on the release. The vote will be open for at least 72 hours.

Only votes from PMC members are binding, but all members of the community
are
encouraged to test the release and vote with "(non-binding)".

The standard verification procedure is documented at
https://github.com/apache/arrow-datafusion/blob/main/dev/release/README.md#verifying-release-candidates
.

[ ] +1 Release this as Apache Arrow DataFusion 31.0.0
[ ] +0
[ ] -1 Do not release this as Apache Arrow DataFusion 31.0.0 because...

Here is my vote:

+1

[1]:
https://github.com/apache/arrow-datafusion/tree/44cf6f127ddfba7cda0c243b22f7e0fce70f16ec
[2]:
https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-31.0.0-rc1
[3]:
https://github.com/apache/arrow-datafusion/blob/44cf6f127ddfba7cda0c243b22f7e0fce70f16ec/CHANGELOG.md


[RESULT][VOTE][RUST][DataFusion] Release Apache Arrow DataFusion 30.0.0 RC1

2023-08-25 Thread Andy Grove
On Fri, Aug 25, 2023 at 9:58 AM Andy Grove  wrote:

> The vote passes with 5 +1 votes (4 binding). Thanks. everyone! The crates
> were released without issue this time.
>
> On Wed, Aug 23, 2023 at 1:28 AM vin jake  wrote:
>
>> +1 (binding)
>>
>> Verified on M1 macbook.
>>
>> Thanks Andy!
>>
>> On Tue, Aug 22, 2023 at 10:48 PM Andy Grove 
>> wrote:
>>
>> > Hi,
>> >
>> > I would like to propose a release of Apache Arrow DataFusion
>> > Implementation,
>> > version 30.0.0.
>> >
>> > This release candidate is based on commit:
>> > c703526596c8602f24d470d98c469c985a99b4b5 [1]
>> > The proposed release tarball and signatures are hosted at [2].
>> > The changelog is located at [3].
>> >
>> > Please download, verify checksums and signatures, run the unit tests,
>> and
>> > vote
>> > on the release. The vote will be open for at least 72 hours.
>> >
>> > Only votes from PMC members are binding, but all members of the
>> community
>> > are
>> > encouraged to test the release and vote with "(non-binding)".
>> >
>> > The standard verification procedure is documented at
>> >
>> >
>> https://github.com/apache/arrow-datafusion/blob/main/dev/release/README.md#verifying-release-candidates
>> > .
>> >
>> > [ ] +1 Release this as Apache Arrow DataFusion 30.0.0
>> > [ ] +0
>> > [ ] -1 Do not release this as Apache Arrow DataFusion 30.0.0 because...
>> >
>> > Here is my vote:
>> >
>> > +1
>> >
>> > [1]:
>> >
>> >
>> https://github.com/apache/arrow-datafusion/tree/c703526596c8602f24d470d98c469c985a99b4b5
>> > [2]:
>> >
>> >
>> https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-30.0.0-rc1
>> > [3]:
>> >
>> >
>> https://github.com/apache/arrow-datafusion/blob/c703526596c8602f24d470d98c469c985a99b4b5/CHANGELOG.md
>> >
>>
>


Re: [VOTE][RUST][DataFusion] Release Apache Arrow DataFusion 30.0.0 RC1

2023-08-25 Thread Andy Grove
The vote passes with 5 +1 votes (4 binding). Thanks. everyone! The crates
were released without issue this time.

On Wed, Aug 23, 2023 at 1:28 AM vin jake  wrote:

> +1 (binding)
>
> Verified on M1 macbook.
>
> Thanks Andy!
>
> On Tue, Aug 22, 2023 at 10:48 PM Andy Grove  wrote:
>
> > Hi,
> >
> > I would like to propose a release of Apache Arrow DataFusion
> > Implementation,
> > version 30.0.0.
> >
> > This release candidate is based on commit:
> > c703526596c8602f24d470d98c469c985a99b4b5 [1]
> > The proposed release tarball and signatures are hosted at [2].
> > The changelog is located at [3].
> >
> > Please download, verify checksums and signatures, run the unit tests, and
> > vote
> > on the release. The vote will be open for at least 72 hours.
> >
> > Only votes from PMC members are binding, but all members of the community
> > are
> > encouraged to test the release and vote with "(non-binding)".
> >
> > The standard verification procedure is documented at
> >
> >
> https://github.com/apache/arrow-datafusion/blob/main/dev/release/README.md#verifying-release-candidates
> > .
> >
> > [ ] +1 Release this as Apache Arrow DataFusion 30.0.0
> > [ ] +0
> > [ ] -1 Do not release this as Apache Arrow DataFusion 30.0.0 because...
> >
> > Here is my vote:
> >
> > +1
> >
> > [1]:
> >
> >
> https://github.com/apache/arrow-datafusion/tree/c703526596c8602f24d470d98c469c985a99b4b5
> > [2]:
> >
> >
> https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-30.0.0-rc1
> > [3]:
> >
> >
> https://github.com/apache/arrow-datafusion/blob/c703526596c8602f24d470d98c469c985a99b4b5/CHANGELOG.md
> >
>


[VOTE][RUST][DataFusion] Release Apache Arrow DataFusion 30.0.0 RC1

2023-08-22 Thread Andy Grove
Hi,

I would like to propose a release of Apache Arrow DataFusion Implementation,
version 30.0.0.

This release candidate is based on commit:
c703526596c8602f24d470d98c469c985a99b4b5 [1]
The proposed release tarball and signatures are hosted at [2].
The changelog is located at [3].

Please download, verify checksums and signatures, run the unit tests, and
vote
on the release. The vote will be open for at least 72 hours.

Only votes from PMC members are binding, but all members of the community
are
encouraged to test the release and vote with "(non-binding)".

The standard verification procedure is documented at
https://github.com/apache/arrow-datafusion/blob/main/dev/release/README.md#verifying-release-candidates
.

[ ] +1 Release this as Apache Arrow DataFusion 30.0.0
[ ] +0
[ ] -1 Do not release this as Apache Arrow DataFusion 30.0.0 because...

Here is my vote:

+1

[1]:
https://github.com/apache/arrow-datafusion/tree/c703526596c8602f24d470d98c469c985a99b4b5
[2]:
https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-30.0.0-rc1
[3]:
https://github.com/apache/arrow-datafusion/blob/c703526596c8602f24d470d98c469c985a99b4b5/CHANGELOG.md


[RESULT][VOTE][RUST][DataFusion] Release Apache Arrow DataFusion 29.0.0 RC1

2023-08-15 Thread Andy Grove
Note that the release was not published to crates.io. See below for details.

On Mon, Aug 14, 2023 at 1:45 PM Andy Grove  wrote:

> The vote passes with 8 +1 votes (5 binding).
>
> I published the source release, but unfortunately, I could not publish the
> crates to crates.io due to a circular dependency. I have filed an issue
> for this:
>
> https://github.com/apache/arrow-datafusion/issues/7281
>
> I don't think I can unpublish the source release, so I guess we'll have to
> fix the issue and then release as version 30.
>
> Thanks,
>
> Andy.
>
>
>
> On Mon, Aug 14, 2023 at 2:16 AM Daniël Heres 
> wrote:
>
>> +1 (binding)
>>
>> Tested on M1 Mac
>>
>> On first try I got this failing test (increased open file limit to pass)
>>
>> failures:
>>
>>
>>  fuzz_cases::order_spill_fuzz::test_sort_1k_mem stdout 
>> thread 'fuzz_cases::order_spill_fuzz::test_sort_1k_mem' panicked at
>> 'called
>> `Result::unwrap()` on an `Err` value: Execution("Failed to create
>> partition
>> file at
>>
>> \"/var/folders/3b/xk_bhzc565q0mz5j4yw92fjcgn/T/.tmpk6OSKK/.tmpH8pwU4\":
>> Os { code: 24, kind: Uncategorized, message: \"Too many open files\" }")',
>> datafusion/core/tests/fuzz_cases/order_spill_fuzz.rs:95:63
>>
>> And the following warning during compilation:
>>
>> *warning**: function `decimal_to_str` is never used*
>>
>>   *--> *datafusion/sqllogictest/src/engines/conversion.rs:85:15
>>
>>*|*
>>
>> *85* *|* pub(crate) fn decimal_to_str(value: Decimal) -> String {
>>
>>*| *  *^^*
>>
>>*|*
>>
>>*= **note*: `#[warn(dead_code)]` on by default
>>
>> Op zo 13 aug 2023 om 13:36 schreef Andrew Lamb :
>>
>> > +1 (binding)
>> >
>> > Tested on mac x86_64
>> >
>> > Thank you for keeping the code flowing Andy
>> >
>> > Andrew
>> >
>> > On Sun, Aug 13, 2023 at 1:48 AM vin jake  wrote:
>> >
>> > > +1 (binding)
>> > >
>> > > Verified on M1 Mac.
>> > >
>> > > Thanks Andy!
>> > >
>> > > On Sat, Aug 12, 2023, 01:59 Andy Grove  wrote:
>> > >
>> > > > Hi,
>> > > >
>> > > > I would like to propose a release of Apache Arrow DataFusion
>> > > > Implementation,
>> > > > version 29.0.0.
>> > > >
>> > > > This release candidate is based on commit:
>> > > > 8265e99d05382fca57cc7399f8ee241966f4a1f5 [1]
>> > > > The proposed release tarball and signatures are hosted at [2].
>> > > > The changelog is located at [3].
>> > > >
>> > > > Please download, verify checksums and signatures, run the unit
>> tests,
>> > and
>> > > > vote
>> > > > on the release. The vote will be open for at least 72 hours.
>> > > >
>> > > > Only votes from PMC members are binding, but all members of the
>> > community
>> > > > are
>> > > > encouraged to test the release and vote with "(non-binding)".
>> > > >
>> > > > The standard verification procedure is documented at
>> > > >
>> > > >
>> > >
>> >
>> https://github.com/apache/arrow-datafusion/blob/main/dev/release/README.md#verifying-release-candidates
>> > > > .
>> > > >
>> > > > [ ] +1 Release this as Apache Arrow DataFusion 29.0.0
>> > > > [ ] +0
>> > > > [ ] -1 Do not release this as Apache Arrow DataFusion 29.0.0
>> because...
>> > > >
>> > > > Here is my vote:
>> > > >
>> > > > +1
>> > > >
>> > > > [1]:
>> > > >
>> > > >
>> > >
>> >
>> https://github.com/apache/arrow-datafusion/tree/8265e99d05382fca57cc7399f8ee241966f4a1f5
>> > > > [2]:
>> > > >
>> > > >
>> > >
>> >
>> https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-29.0.0-rc1
>> > > > [3]:
>> > > >
>> > > >
>> > >
>> >
>> https://github.com/apache/arrow-datafusion/blob/8265e99d05382fca57cc7399f8ee241966f4a1f5/CHANGELOG.md
>> > > >
>> > >
>> >
>>
>>
>> --
>> Daniël Heres
>>
>


Re: [VOTE][RUST][DataFusion] Release Apache Arrow DataFusion 29.0.0 RC1

2023-08-14 Thread Andy Grove
The vote passes with 8 +1 votes (5 binding).

I published the source release, but unfortunately, I could not publish the
crates to crates.io due to a circular dependency. I have filed an issue for
this:

https://github.com/apache/arrow-datafusion/issues/7281

I don't think I can unpublish the source release, so I guess we'll have to
fix the issue and then release as version 30.

Thanks,

Andy.



On Mon, Aug 14, 2023 at 2:16 AM Daniël Heres  wrote:

> +1 (binding)
>
> Tested on M1 Mac
>
> On first try I got this failing test (increased open file limit to pass)
>
> failures:
>
>
>  fuzz_cases::order_spill_fuzz::test_sort_1k_mem stdout 
> thread 'fuzz_cases::order_spill_fuzz::test_sort_1k_mem' panicked at 'called
> `Result::unwrap()` on an `Err` value: Execution("Failed to create partition
> file at
> \"/var/folders/3b/xk_bhzc565q0mz5j4yw92fjcgn/T/.tmpk6OSKK/.tmpH8pwU4\":
> Os { code: 24, kind: Uncategorized, message: \"Too many open files\" }")',
> datafusion/core/tests/fuzz_cases/order_spill_fuzz.rs:95:63
>
> And the following warning during compilation:
>
> *warning**: function `decimal_to_str` is never used*
>
>   *--> *datafusion/sqllogictest/src/engines/conversion.rs:85:15
>
>*|*
>
> *85* *|* pub(crate) fn decimal_to_str(value: Decimal) -> String {
>
>*| *  *^^*
>
>*|*
>
>*= **note*: `#[warn(dead_code)]` on by default
>
> Op zo 13 aug 2023 om 13:36 schreef Andrew Lamb :
>
> > +1 (binding)
> >
> > Tested on mac x86_64
> >
> > Thank you for keeping the code flowing Andy
> >
> > Andrew
> >
> > On Sun, Aug 13, 2023 at 1:48 AM vin jake  wrote:
> >
> > > +1 (binding)
> > >
> > > Verified on M1 Mac.
> > >
> > > Thanks Andy!
> > >
> > > On Sat, Aug 12, 2023, 01:59 Andy Grove  wrote:
> > >
> > > > Hi,
> > > >
> > > > I would like to propose a release of Apache Arrow DataFusion
> > > > Implementation,
> > > > version 29.0.0.
> > > >
> > > > This release candidate is based on commit:
> > > > 8265e99d05382fca57cc7399f8ee241966f4a1f5 [1]
> > > > The proposed release tarball and signatures are hosted at [2].
> > > > The changelog is located at [3].
> > > >
> > > > Please download, verify checksums and signatures, run the unit tests,
> > and
> > > > vote
> > > > on the release. The vote will be open for at least 72 hours.
> > > >
> > > > Only votes from PMC members are binding, but all members of the
> > community
> > > > are
> > > > encouraged to test the release and vote with "(non-binding)".
> > > >
> > > > The standard verification procedure is documented at
> > > >
> > > >
> > >
> >
> https://github.com/apache/arrow-datafusion/blob/main/dev/release/README.md#verifying-release-candidates
> > > > .
> > > >
> > > > [ ] +1 Release this as Apache Arrow DataFusion 29.0.0
> > > > [ ] +0
> > > > [ ] -1 Do not release this as Apache Arrow DataFusion 29.0.0
> because...
> > > >
> > > > Here is my vote:
> > > >
> > > > +1
> > > >
> > > > [1]:
> > > >
> > > >
> > >
> >
> https://github.com/apache/arrow-datafusion/tree/8265e99d05382fca57cc7399f8ee241966f4a1f5
> > > > [2]:
> > > >
> > > >
> > >
> >
> https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-29.0.0-rc1
> > > > [3]:
> > > >
> > > >
> > >
> >
> https://github.com/apache/arrow-datafusion/blob/8265e99d05382fca57cc7399f8ee241966f4a1f5/CHANGELOG.md
> > > >
> > >
> >
>
>
> --
> Daniël Heres
>


[VOTE][RUST][DataFusion] Release Apache Arrow DataFusion 29.0.0 RC1

2023-08-11 Thread Andy Grove
Hi,

I would like to propose a release of Apache Arrow DataFusion Implementation,
version 29.0.0.

This release candidate is based on commit:
8265e99d05382fca57cc7399f8ee241966f4a1f5 [1]
The proposed release tarball and signatures are hosted at [2].
The changelog is located at [3].

Please download, verify checksums and signatures, run the unit tests, and
vote
on the release. The vote will be open for at least 72 hours.

Only votes from PMC members are binding, but all members of the community
are
encouraged to test the release and vote with "(non-binding)".

The standard verification procedure is documented at
https://github.com/apache/arrow-datafusion/blob/main/dev/release/README.md#verifying-release-candidates
.

[ ] +1 Release this as Apache Arrow DataFusion 29.0.0
[ ] +0
[ ] -1 Do not release this as Apache Arrow DataFusion 29.0.0 because...

Here is my vote:

+1

[1]:
https://github.com/apache/arrow-datafusion/tree/8265e99d05382fca57cc7399f8ee241966f4a1f5
[2]:
https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-29.0.0-rc1
[3]:
https://github.com/apache/arrow-datafusion/blob/8265e99d05382fca57cc7399f8ee241966f4a1f5/CHANGELOG.md


Re: [VOTE][RUST][DataFusion] Release DataFusion Python Bindings 28.0.0 RC1

2023-08-06 Thread Andy Grove
The vote passes with 4 +1 votes (3 binding). Thanks, everyone. The release
has been published.

On Sat, Aug 5, 2023 at 11:38 AM vin jake  wrote:

> +1 (binding)
>
> Verified on my M1 Mac
>
> thanks andy
>
> On Tue, Aug 1, 2023, 23:06 Andy Grove  wrote:
>
> > Hi,
> >
> > I would like to propose a release of Apache Arrow DataFusion Python
> > Bindings,
> > version 28.0.0.
> >
> > This release candidate is based on commit:
> > ffd15410c01868f5ed62c5fb2db2a460b42e06b3 [1]
> > The proposed release tarball and signatures are hosted at [2].
> > The changelog is located at [3].
> > The Python wheels are located at [4].
> >
> > Please download, verify checksums and signatures, run the unit tests, and
> > vote
> > on the release. The vote will be open for at least 72 hours.
> >
> > Only votes from PMC members are binding, but all members of the community
> > are
> > encouraged to test the release and vote with "(non-binding)".
> >
> > The standard verification procedure is documented at
> >
> >
> https://github.com/apache/arrow-datafusion-python/blob/main/dev/release/README.md#verifying-release-candidates
> > .
> >
> > [ ] +1 Release this as Apache Arrow DataFusion Python 28.0.0
> > [ ] +0
> > [ ] -1 Do not release this as Apache Arrow DataFusion Python 28.0.0
> > because...
> >
> > Here is my vote:
> >
> > +1
> >
> > [1]:
> >
> >
> https://github.com/apache/arrow-datafusion-python/tree/ffd15410c01868f5ed62c5fb2db2a460b42e06b3
> > [2]:
> >
> >
> https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-python-28.0.0-rc1
> > [3]:
> >
> >
> https://github.com/apache/arrow-datafusion-python/blob/ffd15410c01868f5ed62c5fb2db2a460b42e06b3/CHANGELOG.md
> > [4]: https://test.pypi.org/project/datafusion/28.0.0/
> >
>


[RESULT][VOTE][RUST][DataFusion] Release DataFusion Python Bindings 28.0.0 RC1

2023-08-06 Thread Andy Grove
On Sun, Aug 6, 2023 at 9:57 AM Andy Grove  wrote:

> The vote passes with 4 +1 votes (3 binding). Thanks, everyone. The release
> has been published.
>
> On Sat, Aug 5, 2023 at 11:38 AM vin jake  wrote:
>
>> +1 (binding)
>>
>> Verified on my M1 Mac
>>
>> thanks andy
>>
>> On Tue, Aug 1, 2023, 23:06 Andy Grove  wrote:
>>
>> > Hi,
>> >
>> > I would like to propose a release of Apache Arrow DataFusion Python
>> > Bindings,
>> > version 28.0.0.
>> >
>> > This release candidate is based on commit:
>> > ffd15410c01868f5ed62c5fb2db2a460b42e06b3 [1]
>> > The proposed release tarball and signatures are hosted at [2].
>> > The changelog is located at [3].
>> > The Python wheels are located at [4].
>> >
>> > Please download, verify checksums and signatures, run the unit tests,
>> and
>> > vote
>> > on the release. The vote will be open for at least 72 hours.
>> >
>> > Only votes from PMC members are binding, but all members of the
>> community
>> > are
>> > encouraged to test the release and vote with "(non-binding)".
>> >
>> > The standard verification procedure is documented at
>> >
>> >
>> https://github.com/apache/arrow-datafusion-python/blob/main/dev/release/README.md#verifying-release-candidates
>> > .
>> >
>> > [ ] +1 Release this as Apache Arrow DataFusion Python 28.0.0
>> > [ ] +0
>> > [ ] -1 Do not release this as Apache Arrow DataFusion Python 28.0.0
>> > because...
>> >
>> > Here is my vote:
>> >
>> > +1
>> >
>> > [1]:
>> >
>> >
>> https://github.com/apache/arrow-datafusion-python/tree/ffd15410c01868f5ed62c5fb2db2a460b42e06b3
>> > [2]:
>> >
>> >
>> https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-python-28.0.0-rc1
>> > [3]:
>> >
>> >
>> https://github.com/apache/arrow-datafusion-python/blob/ffd15410c01868f5ed62c5fb2db2a460b42e06b3/CHANGELOG.md
>> > [4]: https://test.pypi.org/project/datafusion/28.0.0/
>> >
>>
>


Re: [VOTE][RUST][DataFusion] Release DataFusion Python Bindings 28.0.0 RC1

2023-08-04 Thread Andy Grove
We need one more binding PMC vote. Is anyone available to do that?

Thanks!

On Thu, Aug 3, 2023 at 8:27 AM Jeremy Dyer  wrote:

> +1 (non-binding)
>
> Verified with x86 ubuntu machine and also M1 OSX.
>
> Thanks for getting the release together Andy
>
> - Jeremy Dyer
>
> On Tue, Aug 1, 2023 at 12:30 PM L. C. Hsieh  wrote:
>
> > +1 (binding)
> >
> > Verified on M1 Mac.
> >
> > Thanks Andy.
> >
> > On Tue, Aug 1, 2023 at 8:06 AM Andy Grove  wrote:
> > >
> > > Hi,
> > >
> > > I would like to propose a release of Apache Arrow DataFusion Python
> > > Bindings,
> > > version 28.0.0.
> > >
> > > This release candidate is based on commit:
> > > ffd15410c01868f5ed62c5fb2db2a460b42e06b3 [1]
> > > The proposed release tarball and signatures are hosted at [2].
> > > The changelog is located at [3].
> > > The Python wheels are located at [4].
> > >
> > > Please download, verify checksums and signatures, run the unit tests,
> and
> > > vote
> > > on the release. The vote will be open for at least 72 hours.
> > >
> > > Only votes from PMC members are binding, but all members of the
> community
> > > are
> > > encouraged to test the release and vote with "(non-binding)".
> > >
> > > The standard verification procedure is documented at
> > >
> >
> https://github.com/apache/arrow-datafusion-python/blob/main/dev/release/README.md#verifying-release-candidates
> > > .
> > >
> > > [ ] +1 Release this as Apache Arrow DataFusion Python 28.0.0
> > > [ ] +0
> > > [ ] -1 Do not release this as Apache Arrow DataFusion Python 28.0.0
> > > because...
> > >
> > > Here is my vote:
> > >
> > > +1
> > >
> > > [1]:
> > >
> >
> https://github.com/apache/arrow-datafusion-python/tree/ffd15410c01868f5ed62c5fb2db2a460b42e06b3
> > > [2]:
> > >
> >
> https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-python-28.0.0-rc1
> > > [3]:
> > >
> >
> https://github.com/apache/arrow-datafusion-python/blob/ffd15410c01868f5ed62c5fb2db2a460b42e06b3/CHANGELOG.md
> > > [4]: https://test.pypi.org/project/datafusion/28.0.0/
> >
>


Re: [VOTE][RUST][DataFusion] Release DataFusion Python Bindings 28.0.0 RC1

2023-08-01 Thread Andy Grove
Wheels are now uploaded.

On Tue, Aug 1, 2023 at 9:17 AM Andy Grove  wrote:

> I sent this email prematurely ... the wheels are not yet uploaded to
> test.pypi. I will send an update once they are.
>
> Andy.
>
> On Tue, Aug 1, 2023 at 9:06 AM Andy Grove  wrote:
>
>> Hi,
>>
>> I would like to propose a release of Apache Arrow DataFusion Python
>> Bindings,
>> version 28.0.0.
>>
>> This release candidate is based on commit:
>> ffd15410c01868f5ed62c5fb2db2a460b42e06b3 [1]
>> The proposed release tarball and signatures are hosted at [2].
>> The changelog is located at [3].
>> The Python wheels are located at [4].
>>
>> Please download, verify checksums and signatures, run the unit tests, and
>> vote
>> on the release. The vote will be open for at least 72 hours.
>>
>> Only votes from PMC members are binding, but all members of the community
>> are
>> encouraged to test the release and vote with "(non-binding)".
>>
>> The standard verification procedure is documented at
>> https://github.com/apache/arrow-datafusion-python/blob/main/dev/release/README.md#verifying-release-candidates
>> .
>>
>> [ ] +1 Release this as Apache Arrow DataFusion Python 28.0.0
>> [ ] +0
>> [ ] -1 Do not release this as Apache Arrow DataFusion Python 28.0.0
>> because...
>>
>> Here is my vote:
>>
>> +1
>>
>> [1]:
>> https://github.com/apache/arrow-datafusion-python/tree/ffd15410c01868f5ed62c5fb2db2a460b42e06b3
>> [2]:
>> https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-python-28.0.0-rc1
>> [3]:
>> https://github.com/apache/arrow-datafusion-python/blob/ffd15410c01868f5ed62c5fb2db2a460b42e06b3/CHANGELOG.md
>> [4]: https://test.pypi.org/project/datafusion/28.0.0/
>>
>


Re: [VOTE][RUST][DataFusion] Release DataFusion Python Bindings 28.0.0 RC1

2023-08-01 Thread Andy Grove
I sent this email prematurely ... the wheels are not yet uploaded to
test.pypi. I will send an update once they are.

Andy.

On Tue, Aug 1, 2023 at 9:06 AM Andy Grove  wrote:

> Hi,
>
> I would like to propose a release of Apache Arrow DataFusion Python
> Bindings,
> version 28.0.0.
>
> This release candidate is based on commit:
> ffd15410c01868f5ed62c5fb2db2a460b42e06b3 [1]
> The proposed release tarball and signatures are hosted at [2].
> The changelog is located at [3].
> The Python wheels are located at [4].
>
> Please download, verify checksums and signatures, run the unit tests, and
> vote
> on the release. The vote will be open for at least 72 hours.
>
> Only votes from PMC members are binding, but all members of the community
> are
> encouraged to test the release and vote with "(non-binding)".
>
> The standard verification procedure is documented at
> https://github.com/apache/arrow-datafusion-python/blob/main/dev/release/README.md#verifying-release-candidates
> .
>
> [ ] +1 Release this as Apache Arrow DataFusion Python 28.0.0
> [ ] +0
> [ ] -1 Do not release this as Apache Arrow DataFusion Python 28.0.0
> because...
>
> Here is my vote:
>
> +1
>
> [1]:
> https://github.com/apache/arrow-datafusion-python/tree/ffd15410c01868f5ed62c5fb2db2a460b42e06b3
> [2]:
> https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-python-28.0.0-rc1
> [3]:
> https://github.com/apache/arrow-datafusion-python/blob/ffd15410c01868f5ed62c5fb2db2a460b42e06b3/CHANGELOG.md
> [4]: https://test.pypi.org/project/datafusion/28.0.0/
>


[VOTE][RUST][DataFusion] Release DataFusion Python Bindings 28.0.0 RC1

2023-08-01 Thread Andy Grove
Hi,

I would like to propose a release of Apache Arrow DataFusion Python
Bindings,
version 28.0.0.

This release candidate is based on commit:
ffd15410c01868f5ed62c5fb2db2a460b42e06b3 [1]
The proposed release tarball and signatures are hosted at [2].
The changelog is located at [3].
The Python wheels are located at [4].

Please download, verify checksums and signatures, run the unit tests, and
vote
on the release. The vote will be open for at least 72 hours.

Only votes from PMC members are binding, but all members of the community
are
encouraged to test the release and vote with "(non-binding)".

The standard verification procedure is documented at
https://github.com/apache/arrow-datafusion-python/blob/main/dev/release/README.md#verifying-release-candidates
.

[ ] +1 Release this as Apache Arrow DataFusion Python 28.0.0
[ ] +0
[ ] -1 Do not release this as Apache Arrow DataFusion Python 28.0.0
because...

Here is my vote:

+1

[1]:
https://github.com/apache/arrow-datafusion-python/tree/ffd15410c01868f5ed62c5fb2db2a460b42e06b3
[2]:
https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-python-28.0.0-rc1
[3]:
https://github.com/apache/arrow-datafusion-python/blob/ffd15410c01868f5ed62c5fb2db2a460b42e06b3/CHANGELOG.md
[4]: https://test.pypi.org/project/datafusion/28.0.0/


[RESULT][VOTE][RUST][DataFusion] Release Apache Arrow DataFusion 28.0.0 RC1

2023-07-25 Thread Andy Grove
On Tue, Jul 25, 2023 at 8:42 AM Andy Grove  wrote:

> The vote passes with 5 +1 votes (4 binding). Thanks for verifying the
> release.
>
> I have published the release.
>
> On Mon, Jul 24, 2023 at 7:19 AM Andrew Lamb  wrote:
>
>> +1 (binding)
>>
>> Verified in x86_64 mac
>>
>> Thank you very much Andy.
>> Andrew
>>
>> On Sun, Jul 23, 2023 at 9:31 AM vin jake  wrote:
>>
>> > +1 (binding)
>> >
>> > Verified on my M1 macbook.
>> >
>> > Thanks Andy
>> >
>>
>


Re: [VOTE][RUST][DataFusion] Release Apache Arrow DataFusion 28.0.0 RC1

2023-07-25 Thread Andy Grove
The vote passes with 5 +1 votes (4 binding). Thanks for verifying the
release.

I have published the release.

On Mon, Jul 24, 2023 at 7:19 AM Andrew Lamb  wrote:

> +1 (binding)
>
> Verified in x86_64 mac
>
> Thank you very much Andy.
> Andrew
>
> On Sun, Jul 23, 2023 at 9:31 AM vin jake  wrote:
>
> > +1 (binding)
> >
> > Verified on my M1 macbook.
> >
> > Thanks Andy
> >
>


[VOTE][RUST][DataFusion] Release Apache Arrow DataFusion 28.0.0 RC1

2023-07-22 Thread Andy Grove
Hi,

I would like to propose a release of Apache Arrow DataFusion Implementation,
version 28.0.0.

This release candidate is based on commit:
51b4392577554becf637a8adcefa0e7fdc79e41f [1]
The proposed release tarball and signatures are hosted at [2].
The changelog is located at [3].

Please download, verify checksums and signatures, run the unit tests, and
vote
on the release. The vote will be open for at least 72 hours.

Only votes from PMC members are binding, but all members of the community
are
encouraged to test the release and vote with "(non-binding)".

The standard verification procedure is documented at
https://github.com/apache/arrow-datafusion/blob/main/dev/release/README.md#verifying-release-candidates
.

[ ] +1 Release this as Apache Arrow DataFusion 28.0.0
[ ] +0
[ ] -1 Do not release this as Apache Arrow DataFusion 28.0.0 because...

Here is my vote:

+1

[1]:
https://github.com/apache/arrow-datafusion/tree/51b4392577554becf637a8adcefa0e7fdc79e41f
[2]:
https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-28.0.0-rc1
[3]:
https://github.com/apache/arrow-datafusion/blob/51b4392577554becf637a8adcefa0e7fdc79e41f/CHANGELOG.md


[RESULT][VOTE][RUST][DataFusion] Release DataFusion Python Bindings 27.0.0 RC1

2023-07-08 Thread Andy Grove
On Sat, Jul 8, 2023 at 10:55 AM Andy Grove  wrote:

> The vote passes with 5 +1 votes (4 binding). Thanks, everyone. The release
> has been published.
>
> On Fri, Jul 7, 2023 at 10:22 AM vin jake  wrote:
>
>> +1 (binding)
>>
>> Verified on M1 Mac.
>>
>> Thanks Andy
>>
>> On Thu, Jul 6, 2023, 01:13 Andy Grove  wrote:
>>
>> > Hi,
>> >
>> > I would like to propose a release of Apache Arrow DataFusion Python
>> > Bindings,
>> > version 27.0.0.
>> >
>> > This release candidate is based on commit:
>> > 3f81513d6c5fd109bdf8c509f81c0a587924d354 [1]
>> > The proposed release tarball and signatures are hosted at [2].
>> > The changelog is located at [3].
>> > The Python wheels are located at [4].
>> >
>> > Please download, verify checksums and signatures, run the unit tests,
>> and
>> > vote
>> > on the release. The vote will be open for at least 72 hours.
>> >
>> > Only votes from PMC members are binding, but all members of the
>> community
>> > are
>> > encouraged to test the release and vote with "(non-binding)".
>> >
>> > The standard verification procedure is documented at
>> >
>> >
>> https://github.com/apache/arrow-datafusion-python/blob/main/dev/release/README.md#verifying-release-candidates
>> > .
>> >
>> > [ ] +1 Release this as Apache Arrow DataFusion Python 27.0.0
>> > [ ] +0
>> > [ ] -1 Do not release this as Apache Arrow DataFusion Python 27.0.0
>> > because...
>> >
>> > Here is my vote:
>> >
>> > +1
>> >
>> > [1]:
>> >
>> >
>> https://github.com/apache/arrow-datafusion-python/tree/3f81513d6c5fd109bdf8c509f81c0a587924d354
>> > [2]:
>> >
>> >
>> https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-python-27.0.0-rc1
>> > [3]:
>> >
>> >
>> https://github.com/apache/arrow-datafusion-python/blob/3f81513d6c5fd109bdf8c509f81c0a587924d354/CHANGELOG.md
>> > [4]: https://test.pypi.org/project/datafusion/27.0.0/
>> >
>>
>


Re: [VOTE][RUST][DataFusion] Release DataFusion Python Bindings 27.0.0 RC1

2023-07-08 Thread Andy Grove
The vote passes with 5 +1 votes (4 binding). Thanks, everyone. The release
has been published.

On Fri, Jul 7, 2023 at 10:22 AM vin jake  wrote:

> +1 (binding)
>
> Verified on M1 Mac.
>
> Thanks Andy
>
> On Thu, Jul 6, 2023, 01:13 Andy Grove  wrote:
>
> > Hi,
> >
> > I would like to propose a release of Apache Arrow DataFusion Python
> > Bindings,
> > version 27.0.0.
> >
> > This release candidate is based on commit:
> > 3f81513d6c5fd109bdf8c509f81c0a587924d354 [1]
> > The proposed release tarball and signatures are hosted at [2].
> > The changelog is located at [3].
> > The Python wheels are located at [4].
> >
> > Please download, verify checksums and signatures, run the unit tests, and
> > vote
> > on the release. The vote will be open for at least 72 hours.
> >
> > Only votes from PMC members are binding, but all members of the community
> > are
> > encouraged to test the release and vote with "(non-binding)".
> >
> > The standard verification procedure is documented at
> >
> >
> https://github.com/apache/arrow-datafusion-python/blob/main/dev/release/README.md#verifying-release-candidates
> > .
> >
> > [ ] +1 Release this as Apache Arrow DataFusion Python 27.0.0
> > [ ] +0
> > [ ] -1 Do not release this as Apache Arrow DataFusion Python 27.0.0
> > because...
> >
> > Here is my vote:
> >
> > +1
> >
> > [1]:
> >
> >
> https://github.com/apache/arrow-datafusion-python/tree/3f81513d6c5fd109bdf8c509f81c0a587924d354
> > [2]:
> >
> >
> https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-python-27.0.0-rc1
> > [3]:
> >
> >
> https://github.com/apache/arrow-datafusion-python/blob/3f81513d6c5fd109bdf8c509f81c0a587924d354/CHANGELOG.md
> > [4]: https://test.pypi.org/project/datafusion/27.0.0/
> >
>


  1   2   3   4   5   6   7   8   9   10   >