[jira] [Created] (ARROW-6452) [Java] Overrite ValueVector toString() method

2019-09-03 Thread Ji Liu (Jira)
Ji Liu created ARROW-6452: - Summary: [Java] Overrite ValueVector toString() method Key: ARROW-6452 URL: https://issues.apache.org/jira/browse/ARROW-6452 Project: Apache Arrow Issue Type: Improvement

Re: [DISCUSS][FORMAT] Concerning about character encoding of binary string data

2019-09-03 Thread Micah Kornfield
This might be bike-shedding but I agree we should attempt to use extension types for this use-cases. I would expect something like: ARROW:extension:name=NonUtf8String ARROW:extension:metadata = "{\"iso-charset\": "ISO-8859-10"}" The latter's value being a json encoded string, which captures the

Re: [DISCUSS][Java] Should null values in VariableWidthVector/ListVector always takes 0 space?

2019-09-03 Thread Fan Liya
Hi Wes, Thanks for the effort. I will add clarifications. Best, Liya Fan On Wed, Sep 4, 2019 at 11:06 AM Wes McKinney wrote: > I opened https://issues.apache.org/jira/browse/ARROW-6451 > > On Sun, Sep 1, 2019 at 9:59 PM Fan Liya wrote: > > > > Hi Wes, > > > > Thanks for the information. > >

Re: [RESULT] [VOTE] Alter Arrow binary protocol to address 8-byte Flatbuffer alignment requirements (2nd vote)

2019-09-03 Thread Wes McKinney
hi folks, We now have patches up for Java, JS, and Go. How are we doing on the code reviews for getting these in? Since C# implements the binary protocol, the C# developers might want to look at this before the 0.15.0 release also. Absent integration tests it's difficult to verify the C#

Re: [DISCUSS][Java] Should null values in VariableWidthVector/ListVector always takes 0 space?

2019-09-03 Thread Wes McKinney
I opened https://issues.apache.org/jira/browse/ARROW-6451 On Sun, Sep 1, 2019 at 9:59 PM Fan Liya wrote: > > Hi Wes, > > Thanks for the information. > I agree with you that we had better make this clear in the document, to > help users avoid unexpected behaviors. > > Best, > Liya Fan > > On Sun,

[jira] [Created] (ARROW-6451) [Format] Add clarifications to Columnar.rst about the contents of "null" slots in Varbinary or List arrays

2019-09-03 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-6451: --- Summary: [Format] Add clarifications to Columnar.rst about the contents of "null" slots in Varbinary or List arrays Key: ARROW-6451 URL:

[jira] [Created] (ARROW-6450) [C++] Use 2x reallocation strategy in arrow::BufferBuilder instead of 1.5x

2019-09-03 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-6450: --- Summary: [C++] Use 2x reallocation strategy in arrow::BufferBuilder instead of 1.5x Key: ARROW-6450 URL: https://issues.apache.org/jira/browse/ARROW-6450 Project:

Re: KeyValue metadata for column

2019-09-03 Thread Wes McKinney
I opened https://issues.apache.org/jira/browse/PARQUET-1648 so we don't forget about it On Mon, Sep 2, 2019 at 10:25 AM wrote: > > Thanks for the feedback, I'll try to work on this if I find some time. > > Roman > > -Ursprüngliche Nachricht- > Von: Wes McKinney > Gesendet: Montag, 2.

[jira] [Created] (ARROW-6449) [R] io "tell()" methods are inconsistently named and untested

2019-09-03 Thread Neal Richardson (Jira)
Neal Richardson created ARROW-6449: -- Summary: [R] io "tell()" methods are inconsistently named and untested Key: ARROW-6449 URL: https://issues.apache.org/jira/browse/ARROW-6449 Project: Apache

[jira] [Created] (ARROW-6448) [CI] Add crossbow notifications

2019-09-03 Thread Francois Saint-Jacques (Jira)
Francois Saint-Jacques created ARROW-6448: - Summary: [CI] Add crossbow notifications Key: ARROW-6448 URL: https://issues.apache.org/jira/browse/ARROW-6448 Project: Apache Arrow Issue

Re: [DISCUSS][FORMAT] Concerning about character encoding of binary string data

2019-09-03 Thread Sutou Kouhei
Hi, > If people can constrain to use UTF-8 for all the string data, > StringArray is enough for them. But if they cannot unify the character > encoding of string data in UTF-8, should Apache Arrow provides the > standard way of the character encoding management? I think that Apache Arrow users

[jira] [Created] (ARROW-6447) [C++] Builds with ARROW_JEMALLOC=ON wait until jemalloc_ep is complete before building any libarrow .cc files

2019-09-03 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-6447: --- Summary: [C++] Builds with ARROW_JEMALLOC=ON wait until jemalloc_ep is complete before building any libarrow .cc files Key: ARROW-6447 URL:

Arrow sync call tomorrow (September 4) at 12:00 US/Eastern, 16:00 UTC

2019-09-03 Thread Neal Richardson
Hi all, Reminder that the biweekly Arrow call is tomorrow (or today, depending on your time zone ;) at https://meet.google.com/vtm-teks-phx. All are welcome to join. Notes will be sent out to the mailing list afterwards. Neal

Broken nightly builds needing attention before 0.15

2019-09-03 Thread Neal Richardson
Hi all, As there is currently not great visibility into the nightly builds and tests that crossbow runs, it's not surprising that many of them are failing. See https://github.com/ursa-labs/crossbow/branches/all?page=2=nightly-624=%E2%9C%93 for the builds and

[jira] [Created] (ARROW-6446) [CI][Crossbow][Python] Nightly osx wheel jobs fail

2019-09-03 Thread Neal Richardson (Jira)
Neal Richardson created ARROW-6446: -- Summary: [CI][Crossbow][Python] Nightly osx wheel jobs fail Key: ARROW-6446 URL: https://issues.apache.org/jira/browse/ARROW-6446 Project: Apache Arrow

[jira] [Created] (ARROW-6445) [CI][Crossbow] Nightly Gandiva jar trusty job fails

2019-09-03 Thread Neal Richardson (Jira)
Neal Richardson created ARROW-6445: -- Summary: [CI][Crossbow] Nightly Gandiva jar trusty job fails Key: ARROW-6445 URL: https://issues.apache.org/jira/browse/ARROW-6445 Project: Apache Arrow

[jira] [Created] (ARROW-6444) [CI][Crossbow] Nightly conda Windows builds fail (time out)

2019-09-03 Thread Neal Richardson (Jira)
Neal Richardson created ARROW-6444: -- Summary: [CI][Crossbow] Nightly conda Windows builds fail (time out) Key: ARROW-6444 URL: https://issues.apache.org/jira/browse/ARROW-6444 Project: Apache Arrow

[jira] [Created] (ARROW-6443) [CI][Crossbow] Nightly conda osx builds fail

2019-09-03 Thread Neal Richardson (Jira)
Neal Richardson created ARROW-6443: -- Summary: [CI][Crossbow] Nightly conda osx builds fail Key: ARROW-6443 URL: https://issues.apache.org/jira/browse/ARROW-6443 Project: Apache Arrow Issue

[jira] [Created] (ARROW-6442) [CI][Crossbow] Nightly gandiva jar osx build fails

2019-09-03 Thread Neal Richardson (Jira)
Neal Richardson created ARROW-6442: -- Summary: [CI][Crossbow] Nightly gandiva jar osx build fails Key: ARROW-6442 URL: https://issues.apache.org/jira/browse/ARROW-6442 Project: Apache Arrow

[jira] [Created] (ARROW-6441) [CI][Crossbow] Nightly Centos 6 job fails

2019-09-03 Thread Neal Richardson (Jira)
Neal Richardson created ARROW-6441: -- Summary: [CI][Crossbow] Nightly Centos 6 job fails Key: ARROW-6441 URL: https://issues.apache.org/jira/browse/ARROW-6441 Project: Apache Arrow Issue

[jira] [Created] (ARROW-6440) [CI][Crossbow] Nightly ubuntu package builds fail

2019-09-03 Thread Neal Richardson (Jira)
Neal Richardson created ARROW-6440: -- Summary: [CI][Crossbow] Nightly ubuntu package builds fail Key: ARROW-6440 URL: https://issues.apache.org/jira/browse/ARROW-6440 Project: Apache Arrow

[jira] [Created] (ARROW-6439) [R] Implement S3 file-system interface in R

2019-09-03 Thread Neal Richardson (Jira)
Neal Richardson created ARROW-6439: -- Summary: [R] Implement S3 file-system interface in R Key: ARROW-6439 URL: https://issues.apache.org/jira/browse/ARROW-6439 Project: Apache Arrow Issue

[jira] [Created] (ARROW-6438) [R] Add bindings for filesystem API

2019-09-03 Thread Neal Richardson (Jira)
Neal Richardson created ARROW-6438: -- Summary: [R] Add bindings for filesystem API Key: ARROW-6438 URL: https://issues.apache.org/jira/browse/ARROW-6438 Project: Apache Arrow Issue Type: New

[jira] [Created] (ARROW-6437) [R] Add AWS SDK to system dependencies for macOS and Windows

2019-09-03 Thread Neal Richardson (Jira)
Neal Richardson created ARROW-6437: -- Summary: [R] Add AWS SDK to system dependencies for macOS and Windows Key: ARROW-6437 URL: https://issues.apache.org/jira/browse/ARROW-6437 Project: Apache Arrow

[jira] [Created] (ARROW-6436) [C++] vendor a half precision floating point library

2019-09-03 Thread Benjamin Kietzman (Jira)
Benjamin Kietzman created ARROW-6436: Summary: [C++] vendor a half precision floating point library Key: ARROW-6436 URL: https://issues.apache.org/jira/browse/ARROW-6436 Project: Apache Arrow

[jira] [Created] (ARROW-6435) [CI][Crossbow] Nightly dask integration job fails

2019-09-03 Thread Neal Richardson (Jira)
Neal Richardson created ARROW-6435: -- Summary: [CI][Crossbow] Nightly dask integration job fails Key: ARROW-6435 URL: https://issues.apache.org/jira/browse/ARROW-6435 Project: Apache Arrow

[jira] [Created] (ARROW-6434) [CI][Crossbow] Nightly HDFS integration job fails

2019-09-03 Thread Neal Richardson (Jira)
Neal Richardson created ARROW-6434: -- Summary: [CI][Crossbow] Nightly HDFS integration job fails Key: ARROW-6434 URL: https://issues.apache.org/jira/browse/ARROW-6434 Project: Apache Arrow

[jira] [Created] (ARROW-6433) [CI][Crossbow] Nightly java docker job fails

2019-09-03 Thread Neal Richardson (Jira)
Neal Richardson created ARROW-6433: -- Summary: [CI][Crossbow] Nightly java docker job fails Key: ARROW-6433 URL: https://issues.apache.org/jira/browse/ARROW-6433 Project: Apache Arrow Issue

[jira] [Created] (ARROW-6432) [CI][Crossbow] Remove alpine crossbow jobs

2019-09-03 Thread Neal Richardson (Jira)
Neal Richardson created ARROW-6432: -- Summary: [CI][Crossbow] Remove alpine crossbow jobs Key: ARROW-6432 URL: https://issues.apache.org/jira/browse/ARROW-6432 Project: Apache Arrow Issue

[jira] [Created] (ARROW-6431) [CI][Crossbow] Nightly nopandas jobs fail

2019-09-03 Thread Neal Richardson (Jira)
Neal Richardson created ARROW-6431: -- Summary: [CI][Crossbow] Nightly nopandas jobs fail Key: ARROW-6431 URL: https://issues.apache.org/jira/browse/ARROW-6431 Project: Apache Arrow Issue

[jira] [Created] (ARROW-6430) [CI][Crossbow] Nightly R docker job fails

2019-09-03 Thread Neal Richardson (Jira)
Neal Richardson created ARROW-6430: -- Summary: [CI][Crossbow] Nightly R docker job fails Key: ARROW-6430 URL: https://issues.apache.org/jira/browse/ARROW-6430 Project: Apache Arrow Issue

[jira] [Created] (ARROW-6429) [CI][Crossbow] Nightly spark integration job fails

2019-09-03 Thread Neal Richardson (Jira)
Neal Richardson created ARROW-6429: -- Summary: [CI][Crossbow] Nightly spark integration job fails Key: ARROW-6429 URL: https://issues.apache.org/jira/browse/ARROW-6429 Project: Apache Arrow

[jira] [Created] (ARROW-6428) [CI][Crossbow] Nightly turbodbc job fails

2019-09-03 Thread Neal Richardson (Jira)
Neal Richardson created ARROW-6428: -- Summary: [CI][Crossbow] Nightly turbodbc job fails Key: ARROW-6428 URL: https://issues.apache.org/jira/browse/ARROW-6428 Project: Apache Arrow Issue

[jira] [Created] (ARROW-6427) [GLib] Add support for column names autogeneration CSV read option

2019-09-03 Thread Yosuke Shiro (Jira)
Yosuke Shiro created ARROW-6427: --- Summary: [GLib] Add support for column names autogeneration CSV read option Key: ARROW-6427 URL: https://issues.apache.org/jira/browse/ARROW-6427 Project: Apache Arrow

[jira] [Created] (ARROW-6426) [FlightRPC] Expose gRPC configuration knobs in Flight

2019-09-03 Thread lidavidm (Jira)
lidavidm created ARROW-6426: --- Summary: [FlightRPC] Expose gRPC configuration knobs in Flight Key: ARROW-6426 URL: https://issues.apache.org/jira/browse/ARROW-6426 Project: Apache Arrow Issue Type:

[jira] [Created] (ARROW-6425) ValidateArray fail for slice of list array

2019-09-03 Thread Yan Zhou (Jira)
Yan Zhou created ARROW-6425: --- Summary: ValidateArray fail for slice of list array Key: ARROW-6425 URL: https://issues.apache.org/jira/browse/ARROW-6425 Project: Apache Arrow Issue Type: Bug

[jira] [Created] (ARROW-6424) [C++][Fuzzing] Fuzzit nightly is broken

2019-09-03 Thread Marco Neumann (Jira)
Marco Neumann created ARROW-6424: Summary: [C++][Fuzzing] Fuzzit nightly is broken Key: ARROW-6424 URL: https://issues.apache.org/jira/browse/ARROW-6424 Project: Apache Arrow Issue Type: Bug

[jira] [Created] (ARROW-6423) [Python] pyarrow.CompressedOutputStream() never completes with compression='snappy'

2019-09-03 Thread Arik Funke (Jira)
Arik Funke created ARROW-6423: - Summary: [Python] pyarrow.CompressedOutputStream() never completes with compression='snappy' Key: ARROW-6423 URL: https://issues.apache.org/jira/browse/ARROW-6423 Project:

[jira] [Created] (ARROW-6422) [Gandiva][Crossbow] Fix double-conversion linker issue

2019-09-03 Thread Prudhvi Porandla (Jira)
Prudhvi Porandla created ARROW-6422: --- Summary: [Gandiva][Crossbow] Fix double-conversion linker issue Key: ARROW-6422 URL: https://issues.apache.org/jira/browse/ARROW-6422 Project: Apache Arrow

[jira] [Created] (ARROW-6421) Headers of system Boost not found in non-standard location

2019-09-03 Thread Jira
Ingo Müller created ARROW-6421: -- Summary: Headers of system Boost not found in non-standard location Key: ARROW-6421 URL: https://issues.apache.org/jira/browse/ARROW-6421 Project: Apache Arrow