The VOTE carries with 4 binding +1 votes, 3 non-binding +1 votes and
one binding +0 vote.
I'm starting the post-release tasks, if anyone wants to help please let me know.
On Fri, Feb 7, 2020 at 12:25 AM Krisztián Szűcs
wrote:
>
> So far we have the following votes:
>
> +0 (binding)
> +1 (binding
Micah Kornfield created ARROW-7788:
--
Summary: Add schema conversion support for map type
Key: ARROW-7788
URL: https://issues.apache.org/jira/browse/ARROW-7788
Project: Apache Arrow
Issue Typ
So far we have the following votes:
+0 (binding)
+1 (binding)
+1 (non-binding)
+1 (binding)
+1 (non-binding)
+1 (binding)
+1 (non-binding)
+1 (binding)
4 +1 (binding)
3 +1 (non-binding)
I'm waiting for votes until tomorrow morning (UTC), then I'm closing the VOTE.
Thanks everyone!
Testing on macOS Catalina
Binaries: OK
Wheels: OK
Verified on macOS and on Linux.
On linux the verification script has failed for python 3.5 and manylinux2010
and manylinux2014 with unsupported platform tag. I've manually checked
these wheels in the python:3.5 docker image, and the wheels were go
Catching up on questions here...
> Typically you can solve this by having enough IO concurrency at once :-)
> I'm not sure having sophisticated global coordination (based on which
> algorithms) would bring anything. Would you care to elaborate?
We aren't proposing *sophisticated* global coordina
Jorge created ARROW-7787:
Summary: Add collect to Table API
Key: ARROW-7787
URL: https://issues.apache.org/jira/browse/ARROW-7787
Project: Apache Arrow
Issue Type: Improvement
Components: R
On Thu, Feb 6, 2020 at 1:30 PM Antoine Pitrou wrote:
>
>
> Le 06/02/2020 à 20:20, Wes McKinney a écrit :
> >> Actually, on a more high-level basis, is the goal to prefetch for
> >> sequential consumption of row groups?
> >>
> >
> > Essentially yes. One "easy" optimization is to prefetch the entire
Le 06/02/2020 à 20:20, Wes McKinney a écrit :
>> Actually, on a more high-level basis, is the goal to prefetch for
>> sequential consumption of row groups?
>>
>
> Essentially yes. One "easy" optimization is to prefetch the entire
> serialized row group. This is an evolution of that idea where we
On Thu, Feb 6, 2020, 12:42 PM Antoine Pitrou wrote:
>
> Le 06/02/2020 à 19:40, Antoine Pitrou a écrit :
> >
> > Le 06/02/2020 à 19:37, Wes McKinney a écrit :
> >> On Thu, Feb 6, 2020, 12:12 PM Antoine Pitrou
> wrote:
> >>
> >>> Le 06/02/2020 à 16:26, Wes McKinney a écrit :
>
> This see
On Thu, Feb 6, 2020, 12:41 PM Antoine Pitrou wrote:
>
> Le 06/02/2020 à 19:37, Wes McKinney a écrit :
> > On Thu, Feb 6, 2020, 12:12 PM Antoine Pitrou wrote:
> >
> >> Le 06/02/2020 à 16:26, Wes McKinney a écrit :
> >>>
> >>> This seems useful, too. It becomes a question of where do you want to
>
Le 06/02/2020 à 19:40, Antoine Pitrou a écrit :
>
> Le 06/02/2020 à 19:37, Wes McKinney a écrit :
>> On Thu, Feb 6, 2020, 12:12 PM Antoine Pitrou wrote:
>>
>>> Le 06/02/2020 à 16:26, Wes McKinney a écrit :
This seems useful, too. It becomes a question of where do you want to
mana
I re-verified the macOS wheels and they worked but I had to hard-code
`MACOSX_DEPLOYMENT_TARGET="10.14"` to get past the cython error I reported
previously. I tried to set that env var dynamically based on your current
OS version but didn't succeed in getting it passed through to pytest,
despite ma
Le 06/02/2020 à 19:37, Wes McKinney a écrit :
> On Thu, Feb 6, 2020, 12:12 PM Antoine Pitrou wrote:
>
>> Le 06/02/2020 à 16:26, Wes McKinney a écrit :
>>>
>>> This seems useful, too. It becomes a question of where do you want to
>>> manage the cached memory segments, however you obtain them. I'
On Thu, Feb 6, 2020, 12:12 PM Antoine Pitrou wrote:
>
> Le 06/02/2020 à 16:26, Wes McKinney a écrit :
> >
> > This seems useful, too. It becomes a question of where do you want to
> > manage the cached memory segments, however you obtain them. I'm
> > arguing that we should not have much custom c
Le 06/02/2020 à 16:26, Wes McKinney a écrit :
>
> This seems useful, too. It becomes a question of where do you want to
> manage the cached memory segments, however you obtain them. I'm
> arguing that we should not have much custom code in the Parquet
> library to manage the prefetched segments
Neal Richardson created ARROW-7786:
--
Summary: [R] Wire up check_metadata in Table.Equals method
Key: ARROW-7786
URL: https://issues.apache.org/jira/browse/ARROW-7786
Project: Apache Arrow
Is
Le 06/02/2020 à 17:07, Wes McKinney a écrit :
> In case folks are interested in how some other systems deal with IO
> management / scheduling, the comments in
>
> https://github.com/apache/impala/blob/master/be/src/runtime/io/disk-io-mgr.h
>
> and related files might be interesting
Thanks. Th
Antoine Pitrou created ARROW-7785:
-
Summary: [C++] sparse_tensor.cc is extremely slow to compile
Key: ARROW-7785
URL: https://issues.apache.org/jira/browse/ARROW-7785
Project: Apache Arrow
Is
In case folks are interested in how some other systems deal with IO
management / scheduling, the comments in
https://github.com/apache/impala/blob/master/be/src/runtime/io/disk-io-mgr.h
and related files might be interesting
On Thu, Feb 6, 2020 at 9:26 AM Wes McKinney wrote:
>
> On Thu, Feb 6,
Antoine Pitrou created ARROW-7784:
-
Summary: [C++] diff.cc is extremely slow to compile
Key: ARROW-7784
URL: https://issues.apache.org/jira/browse/ARROW-7784
Project: Apache Arrow
Issue Type:
On Thu, Feb 6, 2020 at 2:46 AM Antoine Pitrou wrote:
>
> On Wed, 5 Feb 2020 15:46:15 -0600
> Wes McKinney wrote:
> >
> > I'll comment in more detail on some of the other items in due course,
> > but I think this should be handled by an implementation of
> > RandomAccessFile (that wraps a naked Ra
Antoine Pitrou created ARROW-7783:
-
Summary: [C++] ARROW_DATASET should enable ARROW_COMPUTE
Key: ARROW-7783
URL: https://issues.apache.org/jira/browse/ARROW-7783
Project: Apache Arrow
Issue
Ludwik Bielczynski created ARROW-7782:
-
Summary: Losing index information when using write_to_dataset with
partition_cols
Key: ARROW-7782
URL: https://issues.apache.org/jira/browse/ARROW-7782
Proj
Joris Van den Bossche created ARROW-7781:
Summary: [C++][Dataset] Filtering on a non-existent column gives a
segfault
Key: ARROW-7781
URL: https://issues.apache.org/jira/browse/ARROW-7781
Proj
Krisztian Szucs created ARROW-7780:
--
Summary: [Release] Fix Windows wheel RC verification script given
lack of "m" ABI tag in Python 3.8
Key: ARROW-7780
URL: https://issues.apache.org/jira/browse/ARROW-7780
Hello,
A quick update: since Arrow C++ started being fuzzed in OSS-Fuzz, 41
issues (usually crashes) on invalid input have been found, 35 of which
have already been corrected.
We plan to expand the fuzzed areas to cover Parquet files, as well as
serialized Tensor and SparseTensor data.
Regards
On Wed, 5 Feb 2020 16:37:17 -0500
David Li wrote:
>
> As a separate step, prefetching/caching should also make use of a
> global (or otherwise shared) IO thread pool, so that parallel reads of
> different files implicitly coordinate work with each other as well.
> Then, you could queue up reads o
On Wed, 5 Feb 2020 15:46:15 -0600
Wes McKinney wrote:
>
> I'll comment in more detail on some of the other items in due course,
> but I think this should be handled by an implementation of
> RandomAccessFile (that wraps a naked RandomAccessFile) with some
> additional methods, rather than adding
28 matches
Mail list logo