[jira] [Resolved] (ARROW-6898) [Java] Fix potential memory leak in ArrowWriter and several test classes

2019-10-16 Thread Micah Kornfield (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6898?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Micah Kornfield resolved ARROW-6898. Fix Version/s: 0.15.1 1.0.0 Resolution: Fixed Issue resolved by

[jira] [Updated] (ARROW-6650) [Rust] [Integration] Create methods to test Arrow files against Integration JSON

2019-10-16 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6650?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-6650: -- Labels: pull-request-available (was: ) > [Rust] [Integration] Create methods to test Arrow

[jira] [Updated] (ARROW-6911) [Java] Provide composite comparator

2019-10-16 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6911?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-6911: -- Labels: pull-request-available (was: ) > [Java] Provide composite comparator >

[jira] [Created] (ARROW-6911) [Java] Provide composite comparator

2019-10-16 Thread Liya Fan (Jira)
Liya Fan created ARROW-6911: --- Summary: [Java] Provide composite comparator Key: ARROW-6911 URL: https://issues.apache.org/jira/browse/ARROW-6911 Project: Apache Arrow Issue Type: New Feature

[jira] [Commented] (ARROW-6893) [Packaging] Missing APT package metadata for versions prior to 0.15.0

2019-10-16 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6893?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16953359#comment-16953359 ] Kouhei Sutou commented on ARROW-6893: - It's a reasonable use case. I'll work on this later. >

[jira] [Updated] (ARROW-6893) [Packaging] Missing APT package metadata for versions prior to 0.15.0

2019-10-16 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6893?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kouhei Sutou updated ARROW-6893: Summary: [Packaging] Missing APT package metadata for versions prior to 0.15.0 (was: Missing APT

[jira] [Commented] (ARROW-6910) pyarrow.parquet.read_table(...) takes up lots of memory which is not released until program exits

2019-10-16 Thread V Luong (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6910?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16953357#comment-16953357 ] V Luong commented on ARROW-6910: [~wesm] [~apitrou] ARROW-6874's title states that Table.to_pandas()

[jira] [Assigned] (ARROW-6893) Missing APT package metadata for versions prior to 0.15.0

2019-10-16 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6893?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kouhei Sutou reassigned ARROW-6893: --- Assignee: Kouhei Sutou > Missing APT package metadata for versions prior to 0.15.0 >

[jira] [Resolved] (ARROW-6671) [C++] Sparse tensor naming

2019-10-16 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6671?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-6671. --- Fix Version/s: 1.0.0 Resolution: Fixed Issue resolved by pull request 5605

[jira] [Assigned] (ARROW-6886) [C++] arrow::io header nvcc compiler warnings

2019-10-16 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6886?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou reassigned ARROW-6886: - Assignee: Antoine Pitrou > [C++] arrow::io header nvcc compiler warnings >

[jira] [Assigned] (ARROW-6111) [Java] Support LargeVarChar and LargeBinary types and add integration test with C++

2019-10-16 Thread Micah Kornfield (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6111?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Micah Kornfield reassigned ARROW-6111: -- Assignee: (was: Micah Kornfield) > [Java] Support LargeVarChar and LargeBinary

[jira] [Commented] (ARROW-6908) Add support for Bazel

2019-10-16 Thread Micah Kornfield (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6908?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16953332#comment-16953332 ] Micah Kornfield commented on ARROW-6908: Going to take a quick look to see what this would take. 

[jira] [Commented] (ARROW-6910) pyarrow.parquet.read_table(...) takes up lots of memory which is not released until program exits

2019-10-16 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6910?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16953328#comment-16953328 ] Wes McKinney commented on ARROW-6910: - Likely duplicate of ARROW-6874 >

[jira] [Commented] (ARROW-6910) pyarrow.parquet.read_table(...) takes up lots of memory which is not released until program exits

2019-10-16 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6910?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16953324#comment-16953324 ] Antoine Pitrou commented on ARROW-6910: --- How do you measure memory usage? "RSS"? It may be very

[jira] [Commented] (ARROW-6738) [Java] Fix problems with current union comparison logic

2019-10-16 Thread Liya Fan (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6738?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16953323#comment-16953323 ] Liya Fan commented on ARROW-6738: - [~apitrou] Thanks for your comments. IMO, the only behavioral change

[jira] [Commented] (ARROW-6896) [Java] Vector schema root should not share vectors

2019-10-16 Thread Liya Fan (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6896?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16953322#comment-16953322 ] Liya Fan commented on ARROW-6896: - [~jnadeau] I am sorry. Maybe my description is confusing. I mean the

[jira] [Resolved] (ARROW-6865) [Java] Improve the performance of comparing an ArrowBuf against a byte array

2019-10-16 Thread Liya Fan (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6865?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liya Fan resolved ARROW-6865. - Resolution: Fixed Resolved by https://github.com/apache/arrow/pull/5632 > [Java] Improve the

[jira] [Updated] (ARROW-6886) [C++] arrow::io header nvcc compiler warnings

2019-10-16 Thread Paul Taylor (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6886?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Paul Taylor updated ARROW-6886: --- Fix Version/s: 0.15.1 > [C++] arrow::io header nvcc compiler warnings >

[jira] [Resolved] (ARROW-6862) [Developer] Check pull request title

2019-10-16 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6862?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kouhei Sutou resolved ARROW-6862. - Fix Version/s: 1.0.0 Resolution: Fixed Issue resolved by pull request 5628

[jira] [Updated] (ARROW-6910) pyarrow.parquet.read_table(...) takes up lots of memory which is not released until program exits

2019-10-16 Thread V Luong (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6910?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] V Luong updated ARROW-6910: --- Description: I realize that when I read up a lot of Parquet files using pyarrow.parquet.read_table(...), my

[jira] [Created] (ARROW-6910) pyarrow.parquet.read_table(...) takes up lots of memory which is not released until program exits

2019-10-16 Thread V Luong (Jira)
V Luong created ARROW-6910: -- Summary: pyarrow.parquet.read_table(...) takes up lots of memory which is not released until program exits Key: ARROW-6910 URL: https://issues.apache.org/jira/browse/ARROW-6910

[jira] [Updated] (ARROW-6910) pyarrow.parquet.read_table(...) takes up lots of memory which is not released until program exits

2019-10-16 Thread V Luong (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6910?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] V Luong updated ARROW-6910: --- Description: I realize that when I read up a lot of Parquet files using pyarrow.parquet.read_table(...), my

[jira] [Resolved] (ARROW-6901) [Rust][Parquet] SerializedFileWriter writes total_num_rows as zero

2019-10-16 Thread Paddy Horan (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6901?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Paddy Horan resolved ARROW-6901. Fix Version/s: 1.0.0 Resolution: Fixed Issue resolved by pull request 5672

[jira] [Assigned] (ARROW-6901) [Rust][Parquet] SerializedFileWriter writes total_num_rows as zero

2019-10-16 Thread Paddy Horan (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6901?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Paddy Horan reassigned ARROW-6901: -- Assignee: Matthew Franglen > [Rust][Parquet] SerializedFileWriter writes total_num_rows as

[jira] [Assigned] (ARROW-6704) [C++] Cast from timestamp to higher resolution does not check out of bounds timestamps

2019-10-16 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6704?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche reassigned ARROW-6704: Assignee: Joris Van den Bossche (was: Zherui Cao) > [C++] Cast from

[jira] [Commented] (ARROW-6704) [C++] Cast from timestamp to higher resolution does not check out of bounds timestamps

2019-10-16 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6704?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16953251#comment-16953251 ] Joris Van den Bossche commented on ARROW-6704: -- [~czxrrr] there is already an open PR >

[jira] [Assigned] (ARROW-6704) [C++] Cast from timestamp to higher resolution does not check out of bounds timestamps

2019-10-16 Thread Zherui Cao (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6704?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zherui Cao reassigned ARROW-6704: - Assignee: Zherui Cao > [C++] Cast from timestamp to higher resolution does not check out of

[jira] [Updated] (ARROW-6886) [C++] arrow::io header nvcc compiler warnings

2019-10-16 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6886?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-6886: -- Labels: pull-request-available (was: ) > [C++] arrow::io header nvcc compiler warnings >

[jira] [Closed] (ARROW-6713) [Python] Getting "ArrowIOError: Corrupted file, smaller than file footer" when reading large number of parquet files to ParquetDataset()

2019-10-16 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6713?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche closed ARROW-6713. Resolution: Not A Problem > [Python] Getting "ArrowIOError: Corrupted file,

[jira] [Commented] (ARROW-6886) [C++] arrow::io header nvcc compiler warnings

2019-10-16 Thread Paul Taylor (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6886?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16953183#comment-16953183 ] Paul Taylor commented on ARROW-6886: [~apitrou] Yeah this warning is benign, but the team is moving

[jira] [Commented] (ARROW-6905) [Packaging][OSX] Nightly builds on MacOS are failing because of brew compile timeouts

2019-10-16 Thread Neal Richardson (Jira)
or: module 'numpy' has no attribute '__version__' Call Stack (most recent call first): src/arrow/python/CMakeLists.txt:23 (find_package) -- Configuring incomplete, errors occurred! See also "/tmp/apache-arrow-20191016-96519-1lgk95h/build/CMakeFiles/CMakeOutput.log". {code} > [Packagi

[jira] [Commented] (ARROW-6905) [Packaging][OSX] Nightly builds on MacOS are failing because of brew compile timeouts

2019-10-16 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6905?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16953172#comment-16953172 ] Neal Richardson commented on ARROW-6905: I upgraded to 9.3 and got this message  {code} Error:

[jira] [Updated] (ARROW-6869) [C++] Dictionary "delta" building logic in builder_dict.h produces invalid arrays

2019-10-16 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6869?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-6869: Fix Version/s: (was: 0.15.1) > [C++] Dictionary "delta" building logic in builder_dict.h

[jira] [Commented] (ARROW-6738) [Java] Fix problems with current union comparison logic

2019-10-16 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6738?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16953150#comment-16953150 ] Antoine Pitrou commented on ARROW-6738: --- Is this actually a good idea to include this in a bugfix

[jira] [Updated] (ARROW-6823) [C++][Python][R] Support metadata in the feather format?

2019-10-16 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6823?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-6823: - Component/s: R Python C++ > [C++][Python][R]

[jira] [Updated] (ARROW-6057) [Python] Parquet files v2.0 created by spark can't be read by pyarrow

2019-10-16 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6057?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-6057: - Labels: parquet (was: ) > [Python] Parquet files v2.0 created by spark can't be

[jira] [Updated] (ARROW-6057) [Python] Parquet files v2.0 created by spark can't be read by pyarrow

2019-10-16 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6057?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-6057: - Component/s: C++ > [Python] Parquet files v2.0 created by spark can't be read by

[jira] [Updated] (ARROW-6222) [Python] Serialising numpy array yields `pyarrow.lib.ArrowNotImplementedError: list`

2019-10-16 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6222?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-6222: - Component/s: Python > [Python] Serialising numpy array yields >

[jira] [Updated] (ARROW-6698) [Python] Deserialize Python objects using __slots__ in pyarrow.deserialize

2019-10-16 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6698?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-6698: - Component/s: Python > [Python] Deserialize Python objects using __slots__ in

[jira] [Updated] (ARROW-6869) [C++] Dictionary "delta" building logic in builder_dict.h produces invalid arrays

2019-10-16 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6869?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-6869: Fix Version/s: 0.15.1 > [C++] Dictionary "delta" building logic in builder_dict.h produces invalid

[jira] [Assigned] (ARROW-6869) [C++] Dictionary "delta" building logic in builder_dict.h produces invalid arrays

2019-10-16 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6869?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-6869: --- Assignee: Wes McKinney > [C++] Dictionary "delta" building logic in builder_dict.h produces

[jira] [Updated] (ARROW-6905) [Packaging][OSX] Nightly builds on MacOS are failing because of brew compile timeouts

2019-10-16 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6905?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-6905: -- Labels: pull-request-available (was: ) > [Packaging][OSX] Nightly builds on MacOS are failing

[jira] [Assigned] (ARROW-6905) [Packaging][OSX] Nightly builds on MacOS are failing because of brew compile timeouts

2019-10-16 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6905?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-6905: -- Assignee: Neal Richardson > [Packaging][OSX] Nightly builds on MacOS are failing

[jira] [Commented] (ARROW-6905) [Packaging][OSX] Nightly builds on MacOS are failing because of brew compile timeouts

2019-10-16 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6905?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16953132#comment-16953132 ] Wes McKinney commented on ARROW-6905: - from logs, possibly related {code} Warning: Your Xcode

[jira] [Commented] (ARROW-6909) [Python] Define PyObjectBuffer with Py_XDECREF logic in destructor for object array memory

2019-10-16 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6909?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16953118#comment-16953118 ] Antoine Pitrou commented on ARROW-6909: --- Note there may be complications with mixed arrays, though

[jira] [Commented] (ARROW-6908) Add support for Bazel

2019-10-16 Thread Brian Hulette (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6908?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16953119#comment-16953119 ] Brian Hulette commented on ARROW-6908: -- Could

[jira] [Created] (ARROW-6909) [Python] Define PyObjectBuffer with Py_XDECREF logic in destructor for object array memory

2019-10-16 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-6909: --- Summary: [Python] Define PyObjectBuffer with Py_XDECREF logic in destructor for object array memory Key: ARROW-6909 URL: https://issues.apache.org/jira/browse/ARROW-6909

[jira] [Assigned] (ARROW-6738) [Java] Fix problems with current union comparison logic

2019-10-16 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6738?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou reassigned ARROW-6738: - Assignee: Antoine Pitrou (was: Liya Fan) > [Java] Fix problems with current union

[jira] [Resolved] (ARROW-6874) [Python] Memory leak in Table.to_pandas() when nested columns are present

2019-10-16 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6874?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-6874. - Resolution: Fixed Issue resolved by pull request 5674

[jira] [Commented] (ARROW-6906) [C++] Use re2 instead of std::regex in Dataset partitionschemes implementation

2019-10-16 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16953071#comment-16953071 ] Wes McKinney commented on ARROW-6906: - I think they're using devtoolset-2. The fix is probably to

[jira] [Created] (ARROW-6908) Add support for Bazel

2019-10-16 Thread Aryan Naraghi (Jira)
Aryan Naraghi created ARROW-6908: Summary: Add support for Bazel Key: ARROW-6908 URL: https://issues.apache.org/jira/browse/ARROW-6908 Project: Apache Arrow Issue Type: New Feature

[jira] [Commented] (ARROW-6906) [C++] Use re2 instead of std::regex in Dataset partitionschemes implementation

2019-10-16 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16953067#comment-16953067 ] Antoine Pitrou commented on ARROW-6906: --- I don't think we should add new dependencies just to

[jira] [Commented] (ARROW-6874) [Python] Memory leak in Table.to_pandas() when nested columns are present

2019-10-16 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16953065#comment-16953065 ] Antoine Pitrou commented on ARROW-6874: --- [~jorisvandenbossche] You were right. Attached PR is a bit

[jira] [Resolved] (ARROW-6903) [Python] Wheels broken after ARROW-6860 changes

2019-10-16 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6903?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-6903. --- Fix Version/s: (was: 0.15.1) Resolution: Fixed Issue resolved by pull request

[jira] [Resolved] (ARROW-6876) [Python] Reading parquet file becomes really slow for 0.15.0

2019-10-16 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-6876. --- Fix Version/s: (was: 0.15.1) Resolution: Fixed Issue resolved by pull request

[jira] [Commented] (ARROW-6906) [C++] Use re2 instead of std::regex in Dataset partitionschemes implementation

2019-10-16 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16953056#comment-16953056 ] Neal Richardson commented on ARROW-6906: According to 

[jira] [Commented] (ARROW-6886) [C++] arrow::io header nvcc compiler warnings

2019-10-16 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6886?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16953055#comment-16953055 ] Antoine Pitrou commented on ARROW-6886: --- Note this is a benign warning, still you can submit a PR

[jira] [Commented] (ARROW-6886) [C++] arrow::io header nvcc compiler warnings

2019-10-16 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6886?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16953054#comment-16953054 ] Antoine Pitrou commented on ARROW-6886: --- Are you using your own compiler options? > [C++]

[jira] [Created] (ARROW-6907) Allow Plasma store to batch notifications to clients

2019-10-16 Thread Danyang (Jira)
Danyang created ARROW-6907: -- Summary: Allow Plasma store to batch notifications to clients Key: ARROW-6907 URL: https://issues.apache.org/jira/browse/ARROW-6907 Project: Apache Arrow Issue Type:

[jira] [Updated] (ARROW-6886) [C++] arrow::io header nvcc compiler warnings

2019-10-16 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6886?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-6886: -- Priority: Trivial (was: Major) > [C++] arrow::io header nvcc compiler warnings >

[jira] [Updated] (ARROW-6886) [C++] arrow::io header nvcc compiler warnings

2019-10-16 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6886?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-6886: -- Issue Type: Bug (was: New Feature) > [C++] arrow::io header nvcc compiler warnings >

[jira] [Updated] (ARROW-6769) [C++][Dataset] End to End dataset integration test case

2019-10-16 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6769?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-6769: -- Labels: dataset pull-request-available (was: dataset) > [C++][Dataset] End to End dataset

[jira] [Commented] (ARROW-6906) [C++] Use re2 instead of std::regex in Dataset partitionschemes implementation

2019-10-16 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16953034#comment-16953034 ] Antoine Pitrou commented on ARROW-6906: --- What builds / environments does this affect exactly?

[jira] [Updated] (ARROW-6906) Use re2 instead of std::regex in Dataset partitionschemes implementation

2019-10-16 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6906?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-6906: -- Component/s: C++ > Use re2 instead of std::regex in Dataset partitionschemes implementation >

[jira] [Updated] (ARROW-6906) [C++] Use re2 instead of std::regex in Dataset partitionschemes implementation

2019-10-16 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6906?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-6906: -- Summary: [C++] Use re2 instead of std::regex in Dataset partitionschemes implementation (was:

[jira] [Created] (ARROW-6906) Use re2 instead of std::regex in Dataset partitionschemes implementation

2019-10-16 Thread Prudhvi Porandla (Jira)
Prudhvi Porandla created ARROW-6906: --- Summary: Use re2 instead of std::regex in Dataset partitionschemes implementation Key: ARROW-6906 URL: https://issues.apache.org/jira/browse/ARROW-6906

[jira] [Updated] (ARROW-6445) [CI][Crossbow] Nightly Gandiva jar trusty job fails

2019-10-16 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6445?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman updated ARROW-6445: Description: https://travis-ci.org/ursa-labs/crossbow/builds/580192384. Error is due to use of

[jira] [Assigned] (ARROW-6445) [CI][Crossbow] Nightly Gandiva jar trusty job fails

2019-10-16 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6445?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman reassigned ARROW-6445: --- Assignee: Ben Kietzman (was: Prudhvi Porandla) > [CI][Crossbow] Nightly Gandiva jar trusty

[jira] [Resolved] (ARROW-6847) [C++] Add a range_expression interface to Iterator<>

2019-10-16 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6847?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques resolved ARROW-6847. --- Fix Version/s: 1.0.0 Resolution: Fixed Issue resolved by pull request

[jira] [Commented] (ARROW-3850) [Python] Support MapType and StructType for enhanced PySpark integration

2019-10-16 Thread Bryan Cutler (Jira)
[ https://issues.apache.org/jira/browse/ARROW-3850?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16952988#comment-16952988 ] Bryan Cutler commented on ARROW-3850: - I made ARROW-6904 to add MapArray to Arrow Python, once that

[jira] [Created] (ARROW-6905) [Packaging][OSX] Nightly builds on MacOS are failing because of brew compile timeouts

2019-10-16 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-6905: -- Summary: [Packaging][OSX] Nightly builds on MacOS are failing because of brew compile timeouts Key: ARROW-6905 URL: https://issues.apache.org/jira/browse/ARROW-6905

[jira] [Updated] (ARROW-6874) [Python] Memory leak in Table.to_pandas() when nested columns are present

2019-10-16 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6874?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-6874: -- Labels: pull-request-available (was: ) > [Python] Memory leak in Table.to_pandas() when

[jira] [Assigned] (ARROW-6876) [Python] Reading parquet file becomes really slow for 0.15.0

2019-10-16 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-6876: --- Assignee: Wes McKinney (was: Antoine Pitrou) > [Python] Reading parquet file becomes

[jira] [Updated] (ARROW-6872) [C++][Python] Empty table with dictionary-columns raises ArrowNotImplementedError

2019-10-16 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6872?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-6872: Fix Version/s: 1.0.0 > [C++][Python] Empty table with dictionary-columns raises >

[jira] [Updated] (ARROW-6738) [Java] Fix problems with current union comparison logic

2019-10-16 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6738?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-6738: Fix Version/s: 1.0.0 > [Java] Fix problems with current union comparison logic >

[jira] [Commented] (ARROW-6808) [ruby] Doesn't build on windows msys2

2019-10-16 Thread Dominic Sisneros (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6808?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16952960#comment-16952960 ] Dominic Sisneros commented on ARROW-6808: - Thanks it works when I updated my pacman files - yes

[jira] [Commented] (ARROW-6904) [Python] Implement MapArray and MapType

2019-10-16 Thread Bryan Cutler (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6904?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16952955#comment-16952955 ] Bryan Cutler commented on ARROW-6904: - I can work on this > [Python] Implement MapArray and MapType

[jira] [Created] (ARROW-6904) [Python] Implement MapArray and MapType

2019-10-16 Thread Bryan Cutler (Jira)
Bryan Cutler created ARROW-6904: --- Summary: [Python] Implement MapArray and MapType Key: ARROW-6904 URL: https://issues.apache.org/jira/browse/ARROW-6904 Project: Apache Arrow Issue Type:

[jira] [Assigned] (ARROW-6874) [Python] Memory leak in Table.to_pandas() when nested columns are present

2019-10-16 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6874?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou reassigned ARROW-6874: - Assignee: Antoine Pitrou > [Python] Memory leak in Table.to_pandas() when nested

[jira] [Commented] (ARROW-6896) [Java] Vector schema root should not share vectors

2019-10-16 Thread Jacques Nadeau (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6896?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16952941#comment-16952941 ] Jacques Nadeau commented on ARROW-6896: --- I don't understand your comments. Since sharing vectors

[jira] [Assigned] (ARROW-6876) [Python] Reading parquet file becomes really slow for 0.15.0

2019-10-16 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou reassigned ARROW-6876: - Assignee: Antoine Pitrou > [Python] Reading parquet file becomes really slow for 0.15.0

[jira] [Commented] (ARROW-6900) [Python] PyArrow cant serialize pandas IntegerArray

2019-10-16 Thread Sayed Mohammad Hossein Torabi (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16952896#comment-16952896 ] Sayed Mohammad Hossein Torabi commented on ARROW-6900: -- [~wesm] Okay, I will do it!

[jira] [Updated] (ARROW-6903) [Python] Wheels broken after ARROW-6860 changes

2019-10-16 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6903?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-6903: -- Labels: pull-request-available (was: ) > [Python] Wheels broken after ARROW-6860 changes >

[jira] [Commented] (ARROW-6772) [C++] Add operator== for interfaces with an Equals() method

2019-10-16 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16952850#comment-16952850 ] Ben Kietzman commented on ARROW-6772: - Use the `util::EqualityComparable` mixin defined in

[jira] [Updated] (ARROW-6900) [Python] PyArrow cant serialize pandas IntegerArray

2019-10-16 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6900?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-6900: Summary: [Python] PyArrow cant serialize pandas IntegerArray (was: PyArrow cant serialize pandas

[jira] [Commented] (ARROW-6900) [Python] PyArrow cant serialize pandas IntegerArray

2019-10-16 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16952842#comment-16952842 ] Wes McKinney commented on ARROW-6900: - This is not a priority for us to fix right now, but you're

[jira] [Assigned] (ARROW-6903) [Python] Wheels broken after ARROW-6860 changes

2019-10-16 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6903?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-6903: --- Assignee: Wes McKinney > [Python] Wheels broken after ARROW-6860 changes >

[jira] [Created] (ARROW-6903) [Python] Wheels broken after ARROW-6860 changes

2019-10-16 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-6903: --- Summary: [Python] Wheels broken after ARROW-6860 changes Key: ARROW-6903 URL: https://issues.apache.org/jira/browse/ARROW-6903 Project: Apache Arrow Issue

[jira] [Updated] (ARROW-6899) [Python] to_pandas() not implemented on list

2019-10-16 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6899?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-6899: - Summary: [Python] to_pandas() not implemented on list (was: to_pandas() not

[jira] [Created] (ARROW-6902) [C++] Add String*/Binary* support for Compare kernels

2019-10-16 Thread Francois Saint-Jacques (Jira)
Francois Saint-Jacques created ARROW-6902: - Summary: [C++] Add String*/Binary* support for Compare kernels Key: ARROW-6902 URL: https://issues.apache.org/jira/browse/ARROW-6902 Project: Apache

[jira] [Resolved] (ARROW-6814) [C++] Resolve compiler warnings occurred on release build

2019-10-16 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6814?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-6814. --- Fix Version/s: 1.0.0 Resolution: Fixed Issue resolved by pull request 5603

[jira] [Commented] (ARROW-6784) [C++][R] Move filter, take, select C++ code from Rcpp to C++ library

2019-10-16 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6784?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16952824#comment-16952824 ] Neal Richardson commented on ARROW-6784: See also ARROW-5454 > [C++][R] Move filter, take,

[jira] [Commented] (ARROW-5454) [C++] Implement Take on ChunkedArray for DataFrame use

2019-10-16 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5454?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16952823#comment-16952823 ] Neal Richardson commented on ARROW-5454: See also ARROW-6784 > [C++] Implement Take on

[jira] [Updated] (ARROW-6901) [Rust][Parquet] SerializedFileWriter writes total_num_rows as zero

2019-10-16 Thread Matthew Franglen (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6901?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthew Franglen updated ARROW-6901: Summary: [Rust][Parquet] SerializedFileWriter writes total_num_rows as zero (was:

[jira] [Commented] (ARROW-6901) [Rust][Parquet] Rust Parquet SerializedFileWriter writes total_num_rows as zero

2019-10-16 Thread Matthew Franglen (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6901?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16952686#comment-16952686 ] Matthew Franglen commented on ARROW-6901: - Opened a

[jira] [Updated] (ARROW-6901) [Rust][Parquet] Rust Parquet SerializedFileWriter writes total_num_rows as zero

2019-10-16 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6901?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-6901: -- Labels: pull-request-available (was: ) > [Rust][Parquet] Rust Parquet SerializedFileWriter

[jira] [Updated] (ARROW-6901) [Rust][Parquet] Rust Parquet SerializedFileWriter writes total_num_rows as zero

2019-10-16 Thread Matthew Franglen (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6901?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthew Franglen updated ARROW-6901: Description: The SerializedFileWriter does not update total_num_rows at any point. This

[jira] [Updated] (ARROW-6901) [Rust][Parquet] Rust Parquet SerializedFileWriter writes total_num_rows as zero

2019-10-16 Thread Matthew Franglen (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6901?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthew Franglen updated ARROW-6901: Description: The SerializedFileWriter does not update total_num_rows at any point. This

[jira] [Updated] (ARROW-6901) [Rust][Parquet] Rust Parquet SerializedFileWriter writes total_num_rows as zero

2019-10-16 Thread Matthew Franglen (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6901?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthew Franglen updated ARROW-6901: Description: The SerializedFileWriter does not update total_num_rows at any point. This

[jira] [Created] (ARROW-6901) [Rust][Parquet] Rust Parquet SerializedFileWriter writes total_num_rows as zero

2019-10-16 Thread Matthew Franglen (Jira)
Matthew Franglen created ARROW-6901: --- Summary: [Rust][Parquet] Rust Parquet SerializedFileWriter writes total_num_rows as zero Key: ARROW-6901 URL: https://issues.apache.org/jira/browse/ARROW-6901

  1   2   >