[jira] [Created] (ARROW-8457) [C++] bridge test does not take care of endianness

2020-04-14 Thread Kazuaki Ishizaki (Jira)
Kazuaki Ishizaki created ARROW-8457: --- Summary: [C++] bridge test does not take care of endianness Key: ARROW-8457 URL: https://issues.apache.org/jira/browse/ARROW-8457 Project: Apache Arrow

[jira] [Created] (ARROW-8462) Crash in lib.concat_tables on Windows

2020-04-14 Thread Tom Augspurger (Jira)
Tom Augspurger created ARROW-8462: - Summary: Crash in lib.concat_tables on Windows Key: ARROW-8462 URL: https://issues.apache.org/jira/browse/ARROW-8462 Project: Apache Arrow Issue Type: Bug

[jira] [Created] (ARROW-8456) [Release] Add python script to help curating JIRA

2020-04-14 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-8456: -- Summary: [Release] Add python script to help curating JIRA Key: ARROW-8456 URL: https://issues.apache.org/jira/browse/ARROW-8456 Project: Apache Arrow

[jira] [Created] (ARROW-8461) [Packaging][deb] Use zstd package for Ubuntu Xenial

2020-04-14 Thread Kouhei Sutou (Jira)
Kouhei Sutou created ARROW-8461: --- Summary: [Packaging][deb] Use zstd package for Ubuntu Xenial Key: ARROW-8461 URL: https://issues.apache.org/jira/browse/ARROW-8461 Project: Apache Arrow Issue

Follow up on ARROW-8451, datafusion part of Arrow

2020-04-14 Thread RĂ©mi Dettai
This is a follow up on https://issues.apache.org/jira/browse/ARROW-8451. First thanks for your answer! It's true that I was also surprised to see all implementations of Arrow mixed up in a single repository! I was really considering the separation of the repositories as a mean to separate

[jira] [Created] (ARROW-8460) [Packaging][deb] Ubuntu Focal build is failed

2020-04-14 Thread Kouhei Sutou (Jira)
Kouhei Sutou created ARROW-8460: --- Summary: [Packaging][deb] Ubuntu Focal build is failed Key: ARROW-8460 URL: https://issues.apache.org/jira/browse/ARROW-8460 Project: Apache Arrow Issue Type:

Re: Follow up on ARROW-8451, datafusion part of Arrow

2020-04-14 Thread Wes McKinney
hi Remi, It's no problem, it's a common question we get. Some developers believe as a matter of principle that large projects should be broken up into many smaller repositories. Arrow is a different than many open source projects. Maintaining protocol-level interoperability (although note that

[jira] [Created] (ARROW-8459) [Dev][Archery] Use a more recent cmake-format

2020-04-14 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-8459: -- Summary: [Dev][Archery] Use a more recent cmake-format Key: ARROW-8459 URL: https://issues.apache.org/jira/browse/ARROW-8459 Project: Apache Arrow Issue

[jira] [Created] (ARROW-8458) [C++] Prefer the original mirrors for the bundled thirdparty dependencies

2020-04-14 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-8458: -- Summary: [C++] Prefer the original mirrors for the bundled thirdparty dependencies Key: ARROW-8458 URL: https://issues.apache.org/jira/browse/ARROW-8458 Project:

[NIGHTLY] Arrow Build Report for Job nightly-2020-04-14-3

2020-04-14 Thread Crossbow
Arrow Build Report for Job nightly-2020-04-14-3 All tasks: https://github.com/ursa-labs/crossbow/branches/all?query=nightly-2020-04-14-3 Failed Tasks: - centos-6-amd64: URL: https://github.com/ursa-labs/crossbow/branches/all?query=nightly-2020-04-14-3-github-centos-6-amd64 -

[jira] [Created] (ARROW-8465) [Packaging][Python] Windows py35 wheel build fails because of boost

2020-04-14 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-8465: -- Summary: [Packaging][Python] Windows py35 wheel build fails because of boost Key: ARROW-8465 URL: https://issues.apache.org/jira/browse/ARROW-8465 Project:

[jira] [Created] (ARROW-8464) [Rust] [DataFusion] Add support for dictionary types

2020-04-14 Thread Andy Grove (Jira)
Andy Grove created ARROW-8464: - Summary: [Rust] [DataFusion] Add support for dictionary types Key: ARROW-8464 URL: https://issues.apache.org/jira/browse/ARROW-8464 Project: Apache Arrow Issue

[jira] [Created] (ARROW-8463) [CI] Balance the nightly test builds between CircleCI, Azure and Github

2020-04-14 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-8463: -- Summary: [CI] Balance the nightly test builds between CircleCI, Azure and Github Key: ARROW-8463 URL: https://issues.apache.org/jira/browse/ARROW-8463 Project:

[jira] [Created] (ARROW-8466) [Packaging] The python unittests are not running in the windows wheel builds

2020-04-14 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-8466: -- Summary: [Packaging] The python unittests are not running in the windows wheel builds Key: ARROW-8466 URL: https://issues.apache.org/jira/browse/ARROW-8466

Re: Coordinating / scheduling C++ Parquet-Arrow nested data work (ARROW-1644 and others)

2020-04-14 Thread Micah Kornfield
Hi Wes, Yes, I'm making progress and at this point I anticipate being able to finish it off by next release, possibly without support for round tripping fixed size lists. I've been spending some time thinking about different approaches and have started coding some of the building blocks, which I

[jira] [Created] (ARROW-8441) [C++] Fix crashes on invalid input (OSS-Fuzz)

2020-04-14 Thread Antoine Pitrou (Jira)
Antoine Pitrou created ARROW-8441: - Summary: [C++] Fix crashes on invalid input (OSS-Fuzz) Key: ARROW-8441 URL: https://issues.apache.org/jira/browse/ARROW-8441 Project: Apache Arrow Issue

[NIGHTLY] Arrow Build Report for Job nightly-2020-04-14-1

2020-04-14 Thread Crossbow
Arrow Build Report for Job nightly-2020-04-14-1 All tasks: https://github.com/ursa-labs/crossbow/branches/all?query=nightly-2020-04-14-1 Failed Tasks: - centos-6-amd64: URL: https://github.com/ursa-labs/crossbow/branches/all?query=nightly-2020-04-14-1-github-centos-6-amd64 -

[jira] [Created] (ARROW-8439) [Python] Filesystem docs are outdated

2020-04-14 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-8439: Summary: [Python] Filesystem docs are outdated Key: ARROW-8439 URL: https://issues.apache.org/jira/browse/ARROW-8439 Project: Apache Arrow

[jira] [Created] (ARROW-8440) Refine simd header files

2020-04-14 Thread Yibo Cai (Jira)
Yibo Cai created ARROW-8440: --- Summary: Refine simd header files Key: ARROW-8440 URL: https://issues.apache.org/jira/browse/ARROW-8440 Project: Apache Arrow Issue Type: Improvement

[jira] [Created] (ARROW-8438) [C++] arrow-io-memory-benchmark crashes

2020-04-14 Thread Yibo Cai (Jira)
Yibo Cai created ARROW-8438: --- Summary: [C++] arrow-io-memory-benchmark crashes Key: ARROW-8438 URL: https://issues.apache.org/jira/browse/ARROW-8438 Project: Apache Arrow Issue Type: Bug

[jira] [Created] (ARROW-8442) [Python] NullType.to_pandas_dtype inconsisent with dtype returned in to_pandas/to_numpy

2020-04-14 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-8442: Summary: [Python] NullType.to_pandas_dtype inconsisent with dtype returned in to_pandas/to_numpy Key: ARROW-8442 URL:

[jira] [Created] (ARROW-8443) [Gandiva][C++] Fix round/truncate to no-op for special cases

2020-04-14 Thread Praveen Kumar (Jira)
Praveen Kumar created ARROW-8443: Summary: [Gandiva][C++] Fix round/truncate to no-op for special cases Key: ARROW-8443 URL: https://issues.apache.org/jira/browse/ARROW-8443 Project: Apache Arrow

[jira] [Created] (ARROW-8447) [C++][Dataset] Ensure Scanner::ToTable preserve ordering

2020-04-14 Thread Francois Saint-Jacques (Jira)
Francois Saint-Jacques created ARROW-8447: - Summary: [C++][Dataset] Ensure Scanner::ToTable preserve ordering Key: ARROW-8447 URL: https://issues.apache.org/jira/browse/ARROW-8447 Project:

[jira] [Created] (ARROW-8448) [Package] Can't build apt packages with ubuntu-focal

2020-04-14 Thread Francois Saint-Jacques (Jira)
Francois Saint-Jacques created ARROW-8448: - Summary: [Package] Can't build apt packages with ubuntu-focal Key: ARROW-8448 URL: https://issues.apache.org/jira/browse/ARROW-8448 Project: Apache

Re: Coordinating / scheduling C++ Parquet-Arrow nested data work (ARROW-1644 and others)

2020-04-14 Thread Wes McKinney
hi Micah, I'm glad that we have the write side of nested completed for 0.17.0. As far as completing the read side and then implementing sufficient testing to exercise corner cases in end-to-end reads/writes, do you anticipate being able to work on this in the next 4-6 weeks (obviously the state

[jira] [Created] (ARROW-8445) [Gandiva][UDF] Add a udf for gandiva to extract the first capture in regex.

2020-04-14 Thread ZMZ91 (Jira)
ZMZ91 created ARROW-8445: Summary: [Gandiva][UDF] Add a udf for gandiva to extract the first capture in regex. Key: ARROW-8445 URL: https://issues.apache.org/jira/browse/ARROW-8445 Project: Apache Arrow

[jira] [Created] (ARROW-8446) [Python][Dataset] Detect and use _metadata file in a list of file paths

2020-04-14 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-8446: Summary: [Python][Dataset] Detect and use _metadata file in a list of file paths Key: ARROW-8446 URL: https://issues.apache.org/jira/browse/ARROW-8446

[jira] [Created] (ARROW-8444) [Documentation] Fix spelling errors across the codebase

2020-04-14 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-8444: -- Summary: [Documentation] Fix spelling errors across the codebase Key: ARROW-8444 URL: https://issues.apache.org/jira/browse/ARROW-8444 Project: Apache Arrow

[jira] [Created] (ARROW-8453) [Integration][Go] Recursive nested types unsupported

2020-04-14 Thread Antoine Pitrou (Jira)
Antoine Pitrou created ARROW-8453: - Summary: [Integration][Go] Recursive nested types unsupported Key: ARROW-8453 URL: https://issues.apache.org/jira/browse/ARROW-8453 Project: Apache Arrow

[NIGHTLY] Arrow Build Report for Job nightly-2020-04-14-2

2020-04-14 Thread Crossbow
Arrow Build Report for Job nightly-2020-04-14-2 All tasks: https://github.com/ursa-labs/crossbow/branches/all?query=nightly-2020-04-14-2 Failed Tasks: - centos-6-amd64: URL: https://github.com/ursa-labs/crossbow/branches/all?query=nightly-2020-04-14-2-github-centos-6-amd64 -

[jira] [Created] (ARROW-8454) [CI] Add 3rdparty Apache dependency tarballs to github

2020-04-14 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-8454: -- Summary: [CI] Add 3rdparty Apache dependency tarballs to github Key: ARROW-8454 URL: https://issues.apache.org/jira/browse/ARROW-8454 Project: Apache Arrow

[jira] [Created] (ARROW-8455) [Rust] Parquet Arrow column read on partially compatible files

2020-04-14 Thread Remi Dettai (Jira)
Remi Dettai created ARROW-8455: -- Summary: [Rust] Parquet Arrow column read on partially compatible files Key: ARROW-8455 URL: https://issues.apache.org/jira/browse/ARROW-8455 Project: Apache Arrow

[jira] [Created] (ARROW-8449) [R] Use CMAKE_UNITY_BUILD everywhere

2020-04-14 Thread Neal Richardson (Jira)
Neal Richardson created ARROW-8449: -- Summary: [R] Use CMAKE_UNITY_BUILD everywhere Key: ARROW-8449 URL: https://issues.apache.org/jira/browse/ARROW-8449 Project: Apache Arrow Issue Type:

[jira] [Created] (ARROW-8450) [Integration][C++] Implement large list/binary/utf8 integration

2020-04-14 Thread Antoine Pitrou (Jira)
Antoine Pitrou created ARROW-8450: - Summary: [Integration][C++] Implement large list/binary/utf8 integration Key: ARROW-8450 URL: https://issues.apache.org/jira/browse/ARROW-8450 Project: Apache

[jira] [Created] (ARROW-8452) [Go][Integration] Go JSON producer generates incorrect nullable flag for nested types

2020-04-14 Thread Antoine Pitrou (Jira)
Antoine Pitrou created ARROW-8452: - Summary: [Go][Integration] Go JSON producer generates incorrect nullable flag for nested types Key: ARROW-8452 URL: https://issues.apache.org/jira/browse/ARROW-8452

[jira] [Created] (ARROW-8451) [Rust] [Datafusion]

2020-04-14 Thread Remi Dettai (Jira)
Remi Dettai created ARROW-8451: -- Summary: [Rust] [Datafusion] Key: ARROW-8451 URL: https://issues.apache.org/jira/browse/ARROW-8451 Project: Apache Arrow Issue Type: Wish Components: