[jira] [Created] (ARROW-7542) [CI][C++] nrpoc isn't availabe on macOS

2020-01-09 Thread Kouhei Sutou (Jira)
Kouhei Sutou created ARROW-7542: --- Summary: [CI][C++] nrpoc isn't availabe on macOS Key: ARROW-7542 URL: https://issues.apache.org/jira/browse/ARROW-7542 Project: Apache Arrow Issue Type:

Re: Coordinating / scheduling C++ Parquet-Arrow nested data work (ARROW-1644 and others)

2020-01-09 Thread Micah Kornfield
Hi Wes, I'm still interested in doing the work. But don't to hold anybody up if they have bandwidth. In order to actually make progress on this, my plan will be to: 1. Help with the current Java review backlog through early next week or so (this has been taking the majority of my time allocated

[jira] [Created] (ARROW-7541) [GLib] Install license files

2020-01-09 Thread Kouhei Sutou (Jira)
Kouhei Sutou created ARROW-7541: --- Summary: [GLib] Install license files Key: ARROW-7541 URL: https://issues.apache.org/jira/browse/ARROW-7541 Project: Apache Arrow Issue Type: Improvement

[jira] [Created] (ARROW-7540) [C++] License files aren't installed

2020-01-09 Thread Kouhei Sutou (Jira)
Kouhei Sutou created ARROW-7540: --- Summary: [C++] License files aren't installed Key: ARROW-7540 URL: https://issues.apache.org/jira/browse/ARROW-7540 Project: Apache Arrow Issue Type:

[jira] [Created] (ARROW-7539) [Java] FieldVector getFieldBuffers API should not set reader/writer indices

2020-01-09 Thread Ji Liu (Jira)
Ji Liu created ARROW-7539: - Summary: [Java] FieldVector getFieldBuffers API should not set reader/writer indices Key: ARROW-7539 URL: https://issues.apache.org/jira/browse/ARROW-7539 Project: Apache Arrow

[jira] [Created] (ARROW-7538) Clarify actual and desired size in AllocationManager

2020-01-09 Thread David Li (Jira)
David Li created ARROW-7538: --- Summary: Clarify actual and desired size in AllocationManager Key: ARROW-7538 URL: https://issues.apache.org/jira/browse/ARROW-7538 Project: Apache Arrow Issue Type:

[Discuss][Rust] Policy regarding "unsafe"

2020-01-09 Thread paddy horan
Hi All, This time last year there was a brief discussion on the usage of unsafe in Rust (a user on github raised the issue and I created the JIRA). [1] So far we mostly avoid unsafe in the public API's. The thinking here is that Arrow is a "development platform", i.e. lower level that most

Re: Timeline for next major release [was Re: Looking to 1.0]

2020-01-09 Thread Jacques Nadeau
Understood and appreciated. Yeah, it can become a bit of a mess. On Thu, Jan 9, 2020 at 12:22 PM Wes McKinney wrote: > Will do -- there were many C++ and Python-related issues that I think > were put in 1.0.0 / 0.16.0 overly optimistically and so I removed the > Fix Version entirely (some of

[jira] [Created] (ARROW-7537) [CI][R] Nightly macOS autobrew job should be more verbose if it fails

2020-01-09 Thread Neal Richardson (Jira)
Neal Richardson created ARROW-7537: -- Summary: [CI][R] Nightly macOS autobrew job should be more verbose if it fails Key: ARROW-7537 URL: https://issues.apache.org/jira/browse/ARROW-7537 Project:

[jira] [Created] (ARROW-7536) [Java] [Dev] `docker-compose pull debian-java` fails

2020-01-09 Thread Antoine Pitrou (Jira)
Antoine Pitrou created ARROW-7536: - Summary: [Java] [Dev] `docker-compose pull debian-java` fails Key: ARROW-7536 URL: https://issues.apache.org/jira/browse/ARROW-7536 Project: Apache Arrow

Re: Timeline for next major release [was Re: Looking to 1.0]

2020-01-09 Thread Wes McKinney
Will do -- there were many C++ and Python-related issues that I think were put in 1.0.0 / 0.16.0 overly optimistically and so I removed the Fix Version entirely (some of these had been pushed off 3-4 major releases ago). I may have removed some Fix Versions from other components that should have

Re: Timeline for next major release [was Re: Looking to 1.0]

2020-01-09 Thread Jacques Nadeau
It would be helpful that when something is assigned to a release and you want to push it out, you push it to the next release as opposed to removing a fix version entirely. Thanks! On Tue, Jan 7, 2020 at 10:26 AM Wes McKinney wrote: > I just renamed the 1.0.0 release version in JIRA to 0.16.0

[jira] [Created] (ARROW-7534) Create a new java/contrib module

2020-01-09 Thread Jacques Nadeau (Jira)
Jacques Nadeau created ARROW-7534: - Summary: Create a new java/contrib module Key: ARROW-7534 URL: https://issues.apache.org/jira/browse/ARROW-7534 Project: Apache Arrow Issue Type: Task

[jira] [Created] (ARROW-7533) [Java] Move ArrowBufPointer out of the java the memory package

2020-01-09 Thread Jacques Nadeau (Jira)
Jacques Nadeau created ARROW-7533: - Summary: [Java] Move ArrowBufPointer out of the java the memory package Key: ARROW-7533 URL: https://issues.apache.org/jira/browse/ARROW-7533 Project: Apache Arrow

[jira] [Created] (ARROW-7532) [CI] Unskip brew test after Homebrew fixes it upstream

2020-01-09 Thread Neal Richardson (Jira)
Neal Richardson created ARROW-7532: -- Summary: [CI] Unskip brew test after Homebrew fixes it upstream Key: ARROW-7532 URL: https://issues.apache.org/jira/browse/ARROW-7532 Project: Apache Arrow

Re: Pending Java pull requests

2020-01-09 Thread Micah Kornfield
My time has been more limited lately, but i'll try to work through these some as well over the next couple of days. On Thu, Jan 9, 2020 at 8:44 AM Jacques Nadeau wrote: > I think there are a decent chunk that are of questionable value. We need to > be more willing to simply reject requests

Re: [DRAFT] Apache Arrow Board Report January 2020

2020-01-09 Thread Jacques Nadeau
Posted with correction. Thanks to Wes, Antoine and Todd! On Wed, Jan 8, 2020 at 10:15 AM Wes McKinney wrote: > Not sure what happened there. The two words after "grow" can be removed > > ## Description: > > The mission of Apache Arrow is the creation and maintenance of software > related > to

Re: Pending Java pull requests

2020-01-09 Thread Jacques Nadeau
I think there are a decent chunk that are of questionable value. We need to be more willing to simply reject requests rather than leave them in no-man's land. I'll try to do a pass through and help dispatch, etc. On Thu, Jan 9, 2020 at 5:25 AM Krisztián Szűcs wrote: > Hi, > > Roughly 40% of the

[jira] [Created] (ARROW-7531) [C++] Investigate header cost reduction

2020-01-09 Thread Antoine Pitrou (Jira)
Antoine Pitrou created ARROW-7531: - Summary: [C++] Investigate header cost reduction Key: ARROW-7531 URL: https://issues.apache.org/jira/browse/ARROW-7531 Project: Apache Arrow Issue Type:

Coordinating / scheduling C++ Parquet-Arrow nested data work (ARROW-1644 and others)

2020-01-09 Thread Wes McKinney
hi folks, I think we have reached a point where the incomplete C++ Parquet nested data assembly/disassembly is harming the value of several others parts of the project, for example the Datasets API. As another example, it's possible to ingest nested data from JSON but not write it to Parquet in

Re: Human-readable version of Arrow Schema?

2020-01-09 Thread Francois Saint-Jacques
The desired goal for this feature is trivial modifications, e.g. within an editor, by data-scientists and researchers. I'd go for the flatbuffer's json representation as it is stable and has native support in almost any language or editor due to the ubiquity of JSON. The C interface schema string

[NIGHTLY] Arrow Build Report for Job nightly-2020-01-09-0

2020-01-09 Thread Crossbow
Arrow Build Report for Job nightly-2020-01-09-0 All tasks: https://github.com/ursa-labs/crossbow/branches/all?query=nightly-2020-01-09-0 Failed Tasks: - gandiva-jar-osx: URL: https://github.com/ursa-labs/crossbow/branches/all?query=nightly-2020-01-09-0-travis-gandiva-jar-osx -

[jira] [Created] (ARROW-7530) [Developer] Do not include list of commits from PR in squashed summary message

2020-01-09 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-7530: --- Summary: [Developer] Do not include list of commits from PR in squashed summary message Key: ARROW-7530 URL: https://issues.apache.org/jira/browse/ARROW-7530 Project:

Pending Java pull requests

2020-01-09 Thread Krisztián Szűcs
Hi, Roughly 40% of the pending pull requests are tagged as Java [1]. Some of those having long threads and some of them are not reviewed yet. Considering the upcoming release it would be great to close or proceed with them. So any additional help from Java developers would be appreciated!

[jira] [Created] (ARROW-7529) [C++][Gandiva] Handle utf8 characters for castVARCHAR(string, int) function

2020-01-09 Thread Projjal Chanda (Jira)
Projjal Chanda created ARROW-7529: - Summary: [C++][Gandiva] Handle utf8 characters for castVARCHAR(string, int) function Key: ARROW-7529 URL: https://issues.apache.org/jira/browse/ARROW-7529 Project:

[jira] [Created] (ARROW-7528) [Python] The pandas.datetime class (import of datetime.datetime) is deprecated

2020-01-09 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-7528: Summary: [Python] The pandas.datetime class (import of datetime.datetime) is deprecated Key: ARROW-7528 URL: https://issues.apache.org/jira/browse/ARROW-7528

[jira] [Created] (ARROW-7527) [Python] pandas/feather tests failing on pandas master

2020-01-09 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-7527: Summary: [Python] pandas/feather tests failing on pandas master Key: ARROW-7527 URL: https://issues.apache.org/jira/browse/ARROW-7527 Project: Apache

[jira] [Created] (ARROW-7526) [C++][Compute]: Optimize small integer sorting

2020-01-09 Thread Yibo Cai (Jira)
Yibo Cai created ARROW-7526: --- Summary: [C++][Compute]: Optimize small integer sorting Key: ARROW-7526 URL: https://issues.apache.org/jira/browse/ARROW-7526 Project: Apache Arrow Issue Type: