[jira] [Assigned] (ARROW-6887) [Java] Create prose documentation for using ValueVectors

2019-10-14 Thread Ji Liu (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6887?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ji Liu reassigned ARROW-6887: - Assignee: Ji Liu > [Java] Create prose documentation for using ValueVectors >

[jira] [Resolved] (ARROW-6452) [Java] Override ValueVector toString() method

2019-10-14 Thread Micah Kornfield (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6452?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Micah Kornfield resolved ARROW-6452. Fix Version/s: 1.0.0 Resolution: Fixed Issue resolved by pull request 5271

[jira] [Resolved] (ARROW-6184) [Java] Provide hash table based dictionary encoder

2019-10-14 Thread Micah Kornfield (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6184?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Micah Kornfield resolved ARROW-6184. Fix Version/s: 1.0.0 Resolution: Fixed Issue resolved by pull request 5058

[jira] [Commented] (ARROW-6799) [C++] Plasma JNI component links to flatbuffers::flatbuffers (unnecessarily?)

2019-10-14 Thread Micah Kornfield (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16951627#comment-16951627 ] Micah Kornfield commented on ARROW-6799: As part of this we should make sure we have this

[jira] [Updated] (ARROW-2892) [Plasma] Implement interface to get Java arrow objects from Plasma

2019-10-14 Thread Micah Kornfield (Jira)
[ https://issues.apache.org/jira/browse/ARROW-2892?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Micah Kornfield updated ARROW-2892: --- Component/s: Java > [Plasma] Implement interface to get Java arrow objects from Plasma >

[jira] [Reopened] (ARROW-2892) [Plasma] Implement interface to get Java arrow objects from Plasma

2019-10-14 Thread Micah Kornfield (Jira)
[ https://issues.apache.org/jira/browse/ARROW-2892?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Micah Kornfield reopened ARROW-2892: > [Plasma] Implement interface to get Java arrow objects from Plasma >

[jira] [Created] (ARROW-6887) [Java] Create prose documentation for using ValueVectors

2019-10-14 Thread Micah Kornfield (Jira)
Micah Kornfield created ARROW-6887: -- Summary: [Java] Create prose documentation for using ValueVectors Key: ARROW-6887 URL: https://issues.apache.org/jira/browse/ARROW-6887 Project: Apache Arrow

[jira] [Resolved] (ARROW-6877) [C++] Boost not found from the correct environment

2019-10-14 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6877?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kouhei Sutou resolved ARROW-6877. - Fix Version/s: (was: 0.15.1) Resolution: Fixed Issue resolved by pull request 5654

[jira] [Updated] (ARROW-6882) cannot create a chunked_array from dictionary_encoding result

2019-10-14 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6882?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-6882: -- Fix Version/s: 0.15.1 > cannot create a chunked_array from dictionary_encoding result >

[jira] [Resolved] (ARROW-6882) cannot create a chunked_array from dictionary_encoding result

2019-10-14 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6882?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-6882. --- Fix Version/s: (was: 0.15.1) 1.0.0 Resolution: Fixed Issue

[jira] [Resolved] (ARROW-6283) [Rust] [DataFusion] Implement operator to write query results to partitioned CSV

2019-10-14 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6283?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Grove resolved ARROW-6283. --- Fix Version/s: 1.0.0 Resolution: Fixed Issue resolved by pull request 5640

[jira] [Resolved] (ARROW-4219) [Rust] [Parquet] Implement ArrowReader

2019-10-14 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-4219?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Grove resolved ARROW-4219. --- Fix Version/s: 1.0.0 Resolution: Fixed Issue resolved by pull request 5523

[jira] [Commented] (ARROW-6884) [Python][Flight] Make server-side RPC exceptions more friendly?

2019-10-14 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6884?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16951421#comment-16951421 ] David Li commented on ARROW-6884: - Ah, yeah, that would be a good improvement. (Especially the gRPC bits

[jira] [Commented] (ARROW-6884) [Python][Flight] Make server-side RPC exceptions more friendly?

2019-10-14 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6884?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16951414#comment-16951414 ] Wes McKinney commented on ARROW-6884: - It might be as simple as showing {code}

[jira] [Updated] (ARROW-6844) [C++][Parquet][Python] List columns read broken with 0.15.0

2019-10-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6844?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-6844: -- Labels: parquet pull-request-available (was: parquet) > [C++][Parquet][Python] List columns

[jira] [Commented] (ARROW-6884) [Python][Flight] Make server-side RPC exceptions more friendly?

2019-10-14 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6884?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16951408#comment-16951408 ] David Li commented on ARROW-6884: - I'm a little wary of automatically mirroring server-side exceptions as

[jira] [Created] (ARROW-6886) [C++] arrow::io header nvcc compiler warnings

2019-10-14 Thread Paul Taylor (Jira)
Paul Taylor created ARROW-6886: -- Summary: [C++] arrow::io header nvcc compiler warnings Key: ARROW-6886 URL: https://issues.apache.org/jira/browse/ARROW-6886 Project: Apache Arrow Issue Type:

[jira] [Updated] (ARROW-6885) [Python] Remove superfluous skipped timedelta test

2019-10-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6885?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-6885: -- Labels: pull-request-available (was: ) > [Python] Remove superfluous skipped timedelta test >

[jira] [Created] (ARROW-6885) [Python] Remove superfluous skipped timedelta test

2019-10-14 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-6885: Summary: [Python] Remove superfluous skipped timedelta test Key: ARROW-6885 URL: https://issues.apache.org/jira/browse/ARROW-6885 Project: Apache

[jira] [Updated] (ARROW-6882) cannot create a chunked_array from dictionary_encoding result

2019-10-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6882?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-6882: -- Labels: pull-request-available (was: ) > cannot create a chunked_array from

[jira] [Assigned] (ARROW-6882) cannot create a chunked_array from dictionary_encoding result

2019-10-14 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6882?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche reassigned ARROW-6882: Assignee: Joris Van den Bossche > cannot create a chunked_array from

[jira] [Updated] (ARROW-6837) [C++/Python] access File Footer custom_metadata

2019-10-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6837?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-6837: -- Labels: pull-request-available (was: ) > [C++/Python] access File Footer custom_metadata >

[jira] [Created] (ARROW-6884) [Python][Flight] Make server-side RPC exceptions more friendly?

2019-10-14 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-6884: --- Summary: [Python][Flight] Make server-side RPC exceptions more friendly? Key: ARROW-6884 URL: https://issues.apache.org/jira/browse/ARROW-6884 Project: Apache Arrow

[jira] [Resolved] (ARROW-6857) [Python][C++] Segfault for dictionary_encode on empty chunked_array (edge case)

2019-10-14 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6857?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-6857. - Resolution: Fixed Issue resolved by pull request 5650

[jira] [Comment Edited] (ARROW-6882) cannot create a chunked_array from dictionary_encoding result

2019-10-14 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6882?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16951364#comment-16951364 ] Joris Van den Bossche edited comment on ARROW-6882 at 10/14/19 9:12 PM:

[jira] [Updated] (ARROW-6857) [Python][C++] Segfault for dictionary_encode on empty chunked_array (edge case)

2019-10-14 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6857?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-6857: Summary: [Python][C++] Segfault for dictionary_encode on empty chunked_array (edge case) (was:

[jira] [Commented] (ARROW-6882) cannot create a chunked_array from dictionary_encoding result

2019-10-14 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6882?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16951364#comment-16951364 ] Joris Van den Bossche commented on ARROW-6882: -- Although, it is only a regression because we

[jira] [Commented] (ARROW-6882) cannot create a chunked_array from dictionary_encoding result

2019-10-14 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6882?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16951362#comment-16951362 ] Joris Van den Bossche commented on ARROW-6882: -- Thanks for the report. Labeling as 0.15.1

[jira] [Updated] (ARROW-6882) cannot create a chunked_array from dictionary_encoding result

2019-10-14 Thread Artem KOZHEVNIKOV (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6882?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Artem KOZHEVNIKOV updated ARROW-6882: - Description: I've experienced a strange error raise when trying to apply

[jira] [Updated] (ARROW-6882) cannot create a chunked_array from dictionary_encoding result

2019-10-14 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6882?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-6882: - Fix Version/s: 0.15.1 > cannot create a chunked_array from dictionary_encoding

[jira] [Updated] (ARROW-6877) [C++] Boost not found from the correct environment

2019-10-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6877?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-6877: -- Labels: pull-request-available (was: ) > [C++] Boost not found from the correct environment >

[jira] [Assigned] (ARROW-6877) [C++] Boost not found from the correct environment

2019-10-14 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6877?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-6877: --- Assignee: Wes McKinney > [C++] Boost not found from the correct environment >

[jira] [Created] (ARROW-6883) [C++] Support sending delta DictionaryBatch or replacement DictionaryBatch in IPC stream writer class

2019-10-14 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-6883: --- Summary: [C++] Support sending delta DictionaryBatch or replacement DictionaryBatch in IPC stream writer class Key: ARROW-6883 URL: https://issues.apache.org/jira/browse/ARROW-6883

[jira] [Commented] (ARROW-6837) [C++/Python] access File Footer custom_metadata

2019-10-14 Thread John Muehlhausen (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6837?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16951353#comment-16951353 ] John Muehlhausen commented on ARROW-6837: - Initially proposed API: {noformat} static Status

[jira] [Closed] (ARROW-6417) [C++][Parquet] Non-dictionary BinaryArray reads from Parquet format have slowed down since 0.11.x

2019-10-14 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6417?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney closed ARROW-6417. --- Fix Version/s: 0.15.0 Resolution: Fixed This was fixed in 0.15.0 by the jemalloc toolchain

[jira] [Commented] (ARROW-6666) [Rust] [DataFusion] Implement string literal expression

2019-10-14 Thread Kyle McCarthy (Jira)
[ https://issues.apache.org/jira/browse/ARROW-?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16951347#comment-16951347 ] Kyle McCarthy commented on ARROW-: -- Does this require for rust's arrow to implement a

[jira] [Created] (ARROW-6882) cannot create a chunked_array from dictionary_encoding result

2019-10-14 Thread Artem KOZHEVNIKOV (Jira)
Artem KOZHEVNIKOV created ARROW-6882: Summary: cannot create a chunked_array from dictionary_encoding result Key: ARROW-6882 URL: https://issues.apache.org/jira/browse/ARROW-6882 Project: Apache

[jira] [Commented] (ARROW-6876) [Python] Reading parquet file becomes really slow for 0.15.0

2019-10-14 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6876?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16951339#comment-16951339 ] Wes McKinney commented on ARROW-6876: - Marked this for 0.15.1 > [Python] Reading parquet file

[jira] [Updated] (ARROW-6659) [Rust] [DataFusion] Refactor of HashAggregateExec to support custom merge

2019-10-14 Thread Kyle McCarthy (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6659?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kyle McCarthy updated ARROW-6659: - Labels: pull-request-available (was: ) > [Rust] [DataFusion] Refactor of HashAggregateExec to

[jira] [Updated] (ARROW-6876) [Python] Reading parquet file becomes really slow for 0.15.0

2019-10-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-6876: -- Labels: pull-request-available (was: ) > [Python] Reading parquet file becomes really slow

[jira] [Updated] (ARROW-6876) [Python] Reading parquet file becomes really slow for 0.15.0

2019-10-14 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-6876: Fix Version/s: 0.15.1 1.0.0 > [Python] Reading parquet file becomes really slow

[jira] [Commented] (ARROW-6874) [Python] Memory leak in Table.to_pandas() when nested columns are present

2019-10-14 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16951314#comment-16951314 ] Joris Van den Bossche commented on ARROW-6874: -- This seems to be caused by ARROW-6570

[jira] [Updated] (ARROW-6878) [Python] pa.array() does not handle list of dicts with bytes keys correctly under python3

2019-10-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6878?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-6878: -- Labels: pull-request-available (was: ) > [Python] pa.array() does not handle list of dicts

[jira] [Updated] (ARROW-6878) [Python] pa.array() does not handle list of dicts with bytes keys correctly under python3

2019-10-14 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6878?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-6878: -- Fix Version/s: 0.15.1 > [Python] pa.array() does not handle list of dicts with bytes keys

[jira] [Updated] (ARROW-6878) [Python] pa.array() does not handle list of dicts with bytes keys correctly under python3

2019-10-14 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6878?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-6878: -- Fix Version/s: 1.0.0 > [Python] pa.array() does not handle list of dicts with bytes keys

[jira] [Created] (ARROW-6881) [Rust] Remove "array_ops" in favor of the "compute" sub-module

2019-10-14 Thread Paddy Horan (Jira)
Paddy Horan created ARROW-6881: -- Summary: [Rust] Remove "array_ops" in favor of the "compute" sub-module Key: ARROW-6881 URL: https://issues.apache.org/jira/browse/ARROW-6881 Project: Apache Arrow

[jira] [Created] (ARROW-6880) [Rust] Add explicit SIMD for min/max kernel

2019-10-14 Thread Paddy Horan (Jira)
Paddy Horan created ARROW-6880: -- Summary: [Rust] Add explicit SIMD for min/max kernel Key: ARROW-6880 URL: https://issues.apache.org/jira/browse/ARROW-6880 Project: Apache Arrow Issue Type:

[jira] [Updated] (ARROW-6879) [Rust] Add explicit SIMD for sum kernel

2019-10-14 Thread Paddy Horan (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6879?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Paddy Horan updated ARROW-6879: --- Parent: ARROW-4591 Issue Type: Sub-task (was: Improvement) > [Rust] Add explicit SIMD for

[jira] [Created] (ARROW-6879) [Rust] Add explicit SIMD for sum kernel

2019-10-14 Thread Paddy Horan (Jira)
Paddy Horan created ARROW-6879: -- Summary: [Rust] Add explicit SIMD for sum kernel Key: ARROW-6879 URL: https://issues.apache.org/jira/browse/ARROW-6879 Project: Apache Arrow Issue Type:

[jira] [Assigned] (ARROW-6878) [Python] pa.array() does not handle list of dicts with bytes keys correctly under python3

2019-10-14 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6878?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou reassigned ARROW-6878: - Assignee: Antoine Pitrou > [Python] pa.array() does not handle list of dicts with bytes

[jira] [Updated] (ARROW-6789) [Python] Automatically box bytes/buffer-like values yielded from `FlightServerBase.do_action` in Result values

2019-10-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6789?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-6789: -- Labels: pull-request-available (was: ) > [Python] Automatically box bytes/buffer-like values

[jira] [Updated] (ARROW-6876) [Python] Reading parquet file becomes really slow for 0.15.0

2019-10-14 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-6876: Summary: [Python] Reading parquet file becomes really slow for 0.15.0 (was: Reading parquet file

[jira] [Assigned] (ARROW-6789) [Python] Automatically box bytes/buffer-like values yielded from `FlightServerBase.do_action` in Result values

2019-10-14 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6789?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-6789: --- Assignee: Wes McKinney > [Python] Automatically box bytes/buffer-like values yielded from

[jira] [Updated] (ARROW-6874) Memory leak in Table.to_pandas() when nested columns are present

2019-10-14 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6874?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-6874: Fix Version/s: 1.0.0 > Memory leak in Table.to_pandas() when nested columns are present >

[jira] [Updated] (ARROW-6878) [Python] pa.array() does not handle list of dicts with bytes keys correctly under python3

2019-10-14 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6878?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-6878: -- Component/s: Python > [Python] pa.array() does not handle list of dicts with bytes keys

[jira] [Updated] (ARROW-6874) [Python] Memory leak in Table.to_pandas() when nested columns are present

2019-10-14 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6874?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-6874: Summary: [Python] Memory leak in Table.to_pandas() when nested columns are present (was: Memory

[jira] [Created] (ARROW-6878) [Python] pa.array() does not handle list of dicts with bytes keys correctly under python3

2019-10-14 Thread Zhuo Peng (Jira)
Zhuo Peng created ARROW-6878: Summary: [Python] pa.array() does not handle list of dicts with bytes keys correctly under python3 Key: ARROW-6878 URL: https://issues.apache.org/jira/browse/ARROW-6878

[jira] [Commented] (ARROW-6876) Reading parquet file becomes really slow for 0.15.0

2019-10-14 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6876?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16951214#comment-16951214 ] Joris Van den Bossche commented on ARROW-6876: -- Small reproducer: {code} import pyarrow as

[jira] [Updated] (ARROW-6877) [C++] Boost not found from the correct environment

2019-10-14 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6877?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-6877: -- Component/s: C++ > [C++] Boost not found from the correct environment >

[jira] [Updated] (ARROW-6877) [C++] Boost not found from the correct environment

2019-10-14 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6877?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-6877: -- Fix Version/s: 0.15.1 1.0.0 > [C++] Boost not found from the correct

[jira] [Commented] (ARROW-6877) [C++] Boost not found from the correct environment

2019-10-14 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6877?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16951207#comment-16951207 ] Antoine Pitrou commented on ARROW-6877: --- cc [~wesm] > [C++] Boost not found from the correct

[jira] [Created] (ARROW-6877) [C++] Boost not found from the correct environment

2019-10-14 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-6877: Summary: [C++] Boost not found from the correct environment Key: ARROW-6877 URL: https://issues.apache.org/jira/browse/ARROW-6877 Project: Apache

[jira] [Updated] (ARROW-6857) Segfault for dictionary_encode on empty chunked_array (edge case)

2019-10-14 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6857?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-6857: -- Fix Version/s: 1.0.0 > Segfault for dictionary_encode on empty chunked_array (edge case) >

[jira] [Commented] (ARROW-6876) Reading parquet file becomes really slow for 0.15.0

2019-10-14 Thread Bob (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6876?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16951177#comment-16951177 ] Bob commented on ARROW-6876: I also tried fastparquet as an engine and it just thrown an error to me when

[jira] [Updated] (ARROW-6873) [Python] Stale CColumn reference break Cython cimport pyarrow

2019-10-14 Thread Uwe Korn (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6873?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Korn updated ARROW-6873: Fix Version/s: 0.15.1 > [Python] Stale CColumn reference break Cython cimport pyarrow >

[jira] [Commented] (ARROW-6876) Reading parquet file becomes really slow for 0.15.0

2019-10-14 Thread Bob (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6876?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16951176#comment-16951176 ] Bob commented on ARROW-6876: [~jorisvandenbossche] thanks. let me know if I can help. We are very special in

[jira] [Commented] (ARROW-6876) Reading parquet file becomes really slow for 0.15.0

2019-10-14 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6876?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16951175#comment-16951175 ] Joris Van den Bossche commented on ARROW-6876: -- Thanks, if it is just floats, I'll try to

[jira] [Comment Edited] (ARROW-6876) Reading parquet file becomes really slow for 0.15.0

2019-10-14 Thread Bob (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6876?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16951172#comment-16951172 ] Bob edited comment on ARROW-6876 at 10/14/19 5:18 PM: -- [~jorisvandenbossche] seems

[jira] [Commented] (ARROW-6876) Reading parquet file becomes really slow for 0.15.0

2019-10-14 Thread Bob (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6876?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16951172#comment-16951172 ] Bob commented on ARROW-6876: [~jorisvandenbossche] seems you guys added this function which caused the issue:

[jira] [Commented] (ARROW-6876) Reading parquet file becomes really slow for 0.15.0

2019-10-14 Thread Bob (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6876?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16951168#comment-16951168 ] Bob commented on ARROW-6876: [~jorisvandenbossche] sorry I cannot share the data with you because they

[jira] [Commented] (ARROW-6876) Reading parquet file becomes really slow for 0.15.0

2019-10-14 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6876?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16951164#comment-16951164 ] Joris Van den Bossche commented on ARROW-6876: -- Thanks for the report. Would you be able to

[jira] [Updated] (ARROW-6876) Reading parquet file becomes really slow for 0.15.0

2019-10-14 Thread Bob (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bob updated ARROW-6876: --- Description: Hi,   I just noticed that reading a parquet file becomes really slow after I upgraded to 0.15.0 when

[jira] [Updated] (ARROW-6876) Reading parquet file becomes really slow for 0.15.0

2019-10-14 Thread Bob (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bob updated ARROW-6876: --- Attachment: image-2019-10-14-18-12-07-652.png > Reading parquet file becomes really slow for 0.15.0 >

[jira] [Updated] (ARROW-6876) Reading parquet file becomes really slow for 0.15.0

2019-10-14 Thread Bob (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bob updated ARROW-6876: --- Attachment: image-2019-10-14-18-10-42-850.png > Reading parquet file becomes really slow for 0.15.0 >

[jira] [Comment Edited] (ARROW-6874) Memory leak in Table.to_pandas() when nested columns are present

2019-10-14 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16951148#comment-16951148 ] Joris Van den Bossche edited comment on ARROW-6874 at 10/14/19 5:09 PM:

[jira] [Created] (ARROW-6876) Reading parquet file becomes really slow for 0.15.0

2019-10-14 Thread Bob (Jira)
Bob created ARROW-6876: -- Summary: Reading parquet file becomes really slow for 0.15.0 Key: ARROW-6876 URL: https://issues.apache.org/jira/browse/ARROW-6876 Project: Apache Arrow Issue Type: Bug

[jira] [Assigned] (ARROW-6873) [Python] Stale CColumn reference break Cython cimport pyarrow

2019-10-14 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6873?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou reassigned ARROW-6873: - Assignee: Uwe Korn > [Python] Stale CColumn reference break Cython cimport pyarrow >

[jira] [Resolved] (ARROW-6873) [Python] Stale CColumn reference break Cython cimport pyarrow

2019-10-14 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6873?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-6873. --- Fix Version/s: (was: 0.15.1) 1.0.0 Resolution: Fixed Issue

[jira] [Commented] (ARROW-6857) Segfault for dictionary_encode on empty chunked_array (edge case)

2019-10-14 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6857?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16951152#comment-16951152 ] Antoine Pitrou commented on ARROW-6857: --- Thanks for the report. Indeed it seems like there's a

[jira] [Updated] (ARROW-6857) Segfault for dictionary_encode on empty chunked_array (edge case)

2019-10-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6857?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-6857: -- Labels: pull-request-available (was: ) > Segfault for dictionary_encode on empty

[jira] [Commented] (ARROW-6874) Memory leak in Table.to_pandas() when nested columns are present

2019-10-14 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16951148#comment-16951148 ] Joris Van den Bossche commented on ARROW-6874: -- Thanks for the report! I tried to

[jira] [Updated] (ARROW-6836) [Format] add a custom_metadata:[KeyValue] field to the Footer table in File.fbs

2019-10-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6836?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-6836: -- Labels: pull-request-available (was: ) > [Format] add a custom_metadata:[KeyValue] field to

[jira] [Updated] (ARROW-6874) Memory leak in Table.to_pandas() when nested columns are present

2019-10-14 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6874?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-6874: - Fix Version/s: 0.15.1 > Memory leak in Table.to_pandas() when nested columns are

[jira] [Created] (ARROW-6875) [Python][Flight] Implement Criteria for ListFlights RPC / list_flights method

2019-10-14 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-6875: --- Summary: [Python][Flight] Implement Criteria for ListFlights RPC / list_flights method Key: ARROW-6875 URL: https://issues.apache.org/jira/browse/ARROW-6875 Project:

[jira] [Assigned] (ARROW-6857) Segfault for dictionary_encode on empty chunked_array (edge case)

2019-10-14 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6857?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou reassigned ARROW-6857: - Assignee: Antoine Pitrou > Segfault for dictionary_encode on empty chunked_array (edge

[jira] [Created] (ARROW-6874) Memory leak in Table.to_pandas() when nested columns are present

2019-10-14 Thread Sergey Mozharov (Jira)
Sergey Mozharov created ARROW-6874: -- Summary: Memory leak in Table.to_pandas() when nested columns are present Key: ARROW-6874 URL: https://issues.apache.org/jira/browse/ARROW-6874 Project: Apache

[jira] [Resolved] (ARROW-6852) [C++] memory-benchmark build failed on Arm64

2019-10-14 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6852?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-6852. --- Resolution: Fixed Issue resolved by pull request 5624

[jira] [Updated] (ARROW-6870) [C#] Add Support for Dictionary Arrays and Dictionary Encoding

2019-10-14 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6870?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-6870: - Summary: [C#] Add Support for Dictionary Arrays and Dictionary Encoding (was:

[jira] [Updated] (ARROW-6857) Segfault for dictionary_encode on empty chunked_array (edge case)

2019-10-14 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6857?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-6857: - Fix Version/s: 0.15.1 > Segfault for dictionary_encode on empty chunked_array

[jira] [Updated] (ARROW-5971) [Website] Blog post introducing Arrow Flight

2019-10-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5971?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-5971: -- Labels: pull-request-available (was: ) > [Website] Blog post introducing Arrow Flight >

[jira] [Updated] (ARROW-6873) [Python] Stale CColumn reference break Cython cimport pyarrow

2019-10-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6873?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-6873: -- Labels: pull-request-available (was: ) > [Python] Stale CColumn reference break Cython

[jira] [Created] (ARROW-6873) [Python] Stale CColumn reference break Cython cimport pyarrow

2019-10-14 Thread Uwe Korn (Jira)
Uwe Korn created ARROW-6873: --- Summary: [Python] Stale CColumn reference break Cython cimport pyarrow Key: ARROW-6873 URL: https://issues.apache.org/jira/browse/ARROW-6873 Project: Apache Arrow

[jira] [Created] (ARROW-6872) [C++][Python] Empty table with dictionary-columns raises ArrowNotImplementedError

2019-10-14 Thread Marco Neumann (Jira)
Marco Neumann created ARROW-6872: Summary: [C++][Python] Empty table with dictionary-columns raises ArrowNotImplementedError Key: ARROW-6872 URL: https://issues.apache.org/jira/browse/ARROW-6872

[jira] [Updated] (ARROW-6871) [Java] Enhance TransferPair related parameters check and tests

2019-10-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6871?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-6871: -- Labels: pull-request-available (was: ) > [Java] Enhance TransferPair related parameters check

[jira] [Commented] (ARROW-6871) [Java] Enhance TransferPair related parameters check and tests

2019-10-14 Thread Ji Liu (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6871?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16950754#comment-16950754 ] Ji Liu commented on ARROW-6871: --- Thanks for your reminder, I will also add a benchmark, if there's no much