[jira] [Comment Edited] (ARROW-1644) [C++][Parquet] Read and write nested Parquet data with a mix of struct and list nesting levels

2019-08-19 Thread Micah Kornfield (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1644?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16911011#comment-16911011 ] Micah Kornfield edited comment on ARROW-1644 at 8/20/19 5:31 AM: -

[jira] [Commented] (ARROW-1644) [C++][Parquet] Read and write nested Parquet data with a mix of struct and list nesting levels

2019-08-19 Thread Micah Kornfield (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1644?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16911011#comment-16911011 ] Micah Kornfield commented on ARROW-1644: [~bhogan-mitre] there isn't much an update.  I put this

[jira] [Updated] (ARROW-6294) [C++] Use hyphen for plasma-store-server executable

2019-08-19 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6294?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-6294: -- Labels: pull-request-available (was: ) > [C++] Use hyphen for plasma-store-server executable

[jira] [Created] (ARROW-6294) [C++] Use hyphen for plasma-store-server executable

2019-08-19 Thread Sutou Kouhei (Jira)
Sutou Kouhei created ARROW-6294: --- Summary: [C++] Use hyphen for plasma-store-server executable Key: ARROW-6294 URL: https://issues.apache.org/jira/browse/ARROW-6294 Project: Apache Arrow

[jira] [Created] (ARROW-6293) datafusion 0.15.0-SNAPSHOT error

2019-08-19 Thread xingzhicn (Jira)
xingzhicn created ARROW-6293: Summary: datafusion 0.15.0-SNAPSHOT error Key: ARROW-6293 URL: https://issues.apache.org/jira/browse/ARROW-6293 Project: Apache Arrow Issue Type: Bug

[jira] [Resolved] (ARROW-5978) [FlightRPC] [Java] Integration test client doesn't close buffers

2019-08-19 Thread Micah Kornfield (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5978?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Micah Kornfield resolved ARROW-5978. Fix Version/s: (was: 1.0.0) 0.15.0 Resolution: Fixed Issue

[jira] [Resolved] (ARROW-3538) [Python] ability to override the automated assignment of uuid for filenames when writing datasets

2019-08-19 Thread Micah Kornfield (Jira)
[ https://issues.apache.org/jira/browse/ARROW-3538?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Micah Kornfield resolved ARROW-3538. Fix Version/s: (was: 1.0.0) 0.15.0 Resolution: Fixed Issue

[jira] [Updated] (ARROW-6125) [Python] Remove any APIs deprecated prior to 0.14.x

2019-08-19 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-6125: -- Labels: pull-request-available (was: ) > [Python] Remove any APIs deprecated prior to 0.14.x

[jira] [Updated] (ARROW-6120) [C++][Gandiva] including some headers causes decimal_test to fail

2019-08-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6120?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-6120: Fix Version/s: 0.15.0 > [C++][Gandiva] including some headers causes decimal_test to fail >

[jira] [Commented] (ARROW-6120) [C++][Gandiva] including some headers causes decimal_test to fail

2019-08-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6120?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16910876#comment-16910876 ] Wes McKinney commented on ARROW-6120: - We should probably lint for and forbid undesirable headers

[jira] [Updated] (ARROW-6095) [C++] Python subproject ignores ARROW_TEST_LINKAGE

2019-08-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6095?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-6095: Fix Version/s: 0.15.0 > [C++] Python subproject ignores ARROW_TEST_LINKAGE >

[jira] [Updated] (ARROW-6094) [Format][Flight] Add GetFlightSchema to Flight RPC

2019-08-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6094?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-6094: Summary: [Format][Flight] Add GetFlightSchema to Flight RPC (was: Add GetFlightSchema to Flight

[jira] [Updated] (ARROW-6049) [C++] Support using Array::View from compatible dictionary type to another

2019-08-19 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6049?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-6049: -- Labels: pull-request-available (was: ) > [C++] Support using Array::View from compatible

[jira] [Updated] (ARROW-6067) [Python] Large memory test failures

2019-08-19 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6067?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-6067: -- Labels: pull-request-available (was: ) > [Python] Large memory test failures >

[jira] [Assigned] (ARROW-6067) [Python] Large memory test failures

2019-08-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6067?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-6067: --- Assignee: Wes McKinney > [Python] Large memory test failures >

[jira] [Commented] (ARROW-6057) [Python] Parquet files v2.0 created by spark can't be read by pyarrow

2019-08-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6057?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16910852#comment-16910852 ] Wes McKinney commented on ARROW-6057: - According to discussions on the Parquet mailing list, Spark

[jira] [Updated] (ARROW-6048) [C++] Add ChunkedArray::View which calls to Array::View

2019-08-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6048?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-6048: Fix Version/s: (was: 1.0.0) 0.15.0 > [C++] Add ChunkedArray::View which

[jira] [Updated] (ARROW-6048) [C++] Add ChunkedArray::View which calls to Array::View

2019-08-19 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6048?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-6048: -- Labels: pull-request-available (was: ) > [C++] Add ChunkedArray::View which calls to

[jira] [Assigned] (ARROW-6048) [C++] Add ChunkedArray::View which calls to Array::View

2019-08-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6048?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-6048: --- Assignee: Wes McKinney > [C++] Add ChunkedArray::View which calls to Array::View >

[jira] [Updated] (ARROW-6046) [C++] Slice RecordBatch of String array with offset 0 returns whole batch

2019-08-19 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6046?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-6046: -- Labels: pull-request-available (was: ) > [C++] Slice RecordBatch of String array with offset

[jira] [Updated] (ARROW-6046) [C++] Slice RecordBatch of String array with offset 0 returns whole batch

2019-08-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6046?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-6046: Fix Version/s: (was: 1.0.0) 0.15.0 > [C++] Slice RecordBatch of String

[jira] [Assigned] (ARROW-6046) [C++] Slice RecordBatch of String array with offset 0 returns whole batch

2019-08-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6046?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-6046: --- Assignee: Wes McKinney > [C++] Slice RecordBatch of String array with offset 0 returns

[jira] [Closed] (ARROW-6027) [C++] CMake Build w/boost_ep fails on Windows - "%1 is not a valid Win32 application"

2019-08-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6027?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney closed ARROW-6027. --- Resolution: Duplicate If the contributor in ARROW-1324 does not respect feel free to pick up the

[jira] [Updated] (ARROW-6011) [Python] Data incomplete when using pyarrow in pyspark in python 3.x

2019-08-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6011?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-6011: Summary: [Python] Data incomplete when using pyarrow in pyspark in python 3.x (was: Data

[jira] [Commented] (ARROW-5995) [Python] pyarrow: hdfs: support file checksum

2019-08-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5995?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16910802#comment-16910802 ] Wes McKinney commented on ARROW-5995: - My comment about libhdfs3 was an aside. The checksum feature

[jira] [Commented] (ARROW-5993) [Python] Reading a dictionary column from Parquet results in disproportionate memory usage

2019-08-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5993?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16910801#comment-16910801 ] Wes McKinney commented on ARROW-5993: - [~danielil] can you make this file publicly accessible again?

[jira] [Updated] (ARROW-5993) [Python] Reading a dictionary column from Parquet results in disproportionate memory usage

2019-08-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5993?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-5993: Fix Version/s: (was: 1.0.0) 0.15.0 > [Python] Reading a dictionary column

[jira] [Updated] (ARROW-5992) [C++] Array::View fails for string/utf8 as binary

2019-08-19 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5992?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-5992: -- Labels: pull-request-available (was: ) > [C++] Array::View fails for string/utf8 as binary >

[jira] [Updated] (ARROW-5992) [C++] Array::View fails for string/utf8 as binary

2019-08-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5992?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-5992: Fix Version/s: (was: 1.0.0) 0.15.0 > [C++] Array::View fails for

[jira] [Assigned] (ARROW-5992) [C++] Array::View fails for string/utf8 as binary

2019-08-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5992?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-5992: --- Assignee: Wes McKinney > [C++] Array::View fails for string/utf8 as binary >

[jira] [Closed] (ARROW-5991) [C++] Does DictionaryBuilder need to be a subclass of ArrayBuilder?

2019-08-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5991?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney closed ARROW-5991. --- Resolution: Won't Fix > [C++] Does DictionaryBuilder need to be a subclass of ArrayBuilder? >

[jira] [Updated] (ARROW-5985) [Developer] Do not suggest setting Fix Version for point releases in dev/merge_arrow_pr.py

2019-08-19 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5985?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-5985: -- Labels: pull-request-available (was: ) > [Developer] Do not suggest setting Fix Version for

[jira] [Updated] (ARROW-6232) [C++] Rename Argsort kernel to SortToIndices

2019-08-19 Thread Sutou Kouhei (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sutou Kouhei updated ARROW-6232: Description: "Argsort" is NumPy specific name. Other languages/libraries use different name: *

[jira] [Updated] (ARROW-6232) [C++] Rename Argsort kernel to SortToIndices

2019-08-19 Thread Sutou Kouhei (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sutou Kouhei updated ARROW-6232: Summary: [C++] Rename Argsort kernel to SortToIndices (was: [C++] Rename Argsort kernel to

[jira] [Assigned] (ARROW-5985) [Developer] Do not suggest setting Fix Version for point releases in dev/merge_arrow_pr.py

2019-08-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5985?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-5985: --- Assignee: Wes McKinney > [Developer] Do not suggest setting Fix Version for point releases

[jira] [Updated] (ARROW-5966) [Python] Capacity error when converting large UTF32 numpy array to arrow array

2019-08-19 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5966?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-5966: -- Labels: pull-request-available (was: ) > [Python] Capacity error when converting large UTF32

[jira] [Commented] (ARROW-6292) [C++] Add an option to build with mimalloc

2019-08-19 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6292?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16910745#comment-16910745 ] Antoine Pitrou commented on ARROW-6292: --- Opened issue upstream at

[jira] [Created] (ARROW-6292) [C++] Add an option to build with mimalloc

2019-08-19 Thread Antoine Pitrou (Jira)
Antoine Pitrou created ARROW-6292: - Summary: [C++] Add an option to build with mimalloc Key: ARROW-6292 URL: https://issues.apache.org/jira/browse/ARROW-6292 Project: Apache Arrow Issue

[jira] [Comment Edited] (ARROW-5966) [Python] Capacity error when converting large UTF32 numpy array to arrow array

2019-08-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5966?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16910700#comment-16910700 ] Wes McKinney edited comment on ARROW-5966 at 8/19/19 7:51 PM: -- This is

[jira] [Commented] (ARROW-5966) [Python] Capacity error when converting large UTF32 numpy array to arrow array

2019-08-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5966?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16910700#comment-16910700 ] Wes McKinney commented on ARROW-5966: - This is buggy. It doesn't look like it's too horrible to fix.

[jira] [Assigned] (ARROW-5966) [Python] Capacity error when converting large UTF32 numpy array to arrow array

2019-08-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5966?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-5966: --- Assignee: Wes McKinney > [Python] Capacity error when converting large UTF32 numpy array to

[jira] [Updated] (ARROW-5966) [Python] Capacity error when converting large UTF32 numpy array to arrow array

2019-08-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5966?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-5966: Fix Version/s: 0.15.0 > [Python] Capacity error when converting large UTF32 numpy array to arrow

[jira] [Closed] (ARROW-5965) [Python] Regression: segfault when reading hive table with v0.14

2019-08-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5965?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney closed ARROW-5965. --- Fix Version/s: 0.15.0 Resolution: Duplicate I'm guessing this is a dup of the memory issue

[jira] [Commented] (ARROW-5960) [C++] Boost dependencies are specified in wrong order

2019-08-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5960?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16910693#comment-16910693 ] Wes McKinney commented on ARROW-5960: - Can you submit a pull request please? > [C++] Boost

[jira] [Updated] (ARROW-5960) [C++] Boost dependencies are specified in wrong order

2019-08-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5960?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-5960: Summary: [C++] Boost dependencies are specified in wrong order (was: Boost dependencies are

[jira] [Updated] (ARROW-5936) [C++] [FlightRPC] user_metadata is not present in fields read from flight

2019-08-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5936?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-5936: Fix Version/s: 1.0.0 > [C++] [FlightRPC] user_metadata is not present in fields read from flight >

[jira] [Updated] (ARROW-5954) [Developer][Documentation] Organize source and binary dependency licenses into directories

2019-08-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5954?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-5954: Summary: [Developer][Documentation] Organize source and binary dependency licenses into

[jira] [Updated] (ARROW-5954) [Developer][Documentation] Organize source and binary dependency licenses into directories

2019-08-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5954?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-5954: Component/s: Developer Tools > [Developer][Documentation] Organize source and binary dependency

[jira] [Updated] (ARROW-5932) [C++] undefined reference to `__cxa_init_primary_exception@CXXABI_1.3.11'

2019-08-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5932?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-5932: Summary: [C++] undefined reference to `__cxa_init_primary_exception@CXXABI_1.3.11' (was:

[jira] [Updated] (ARROW-5913) [C++][Parquet] Add support for Parquet's BYTE_STREAM_SPLIT encoding

2019-08-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5913?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-5913: Summary: [C++][Parquet] Add support for Parquet's BYTE_STREAM_SPLIT encoding (was: Add support

[jira] [Updated] (ARROW-5910) [Python] read_tensor() fails on non-seekable streams

2019-08-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5910?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-5910: Fix Version/s: 0.15.0 > [Python] read_tensor() fails on non-seekable streams >

[jira] [Closed] (ARROW-5907) [Python] base64 support of bytes-like

2019-08-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5907?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney closed ARROW-5907. --- Resolution: Won't Fix I don't think it makes sense to implement the buffer protocol on Array classes

[jira] [Updated] (ARROW-5906) [CI] Set -DARROW_VERBOSE_THIRDPARTY_BUILD=OFF in builds running in Travis CI, maybe all docker-compose builds by default

2019-08-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5906?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-5906: Fix Version/s: (was: 1.0.0) 0.15.0 > [CI] Set

[jira] [Commented] (ARROW-5888) [Python][C++] Add metadata to store Arrow time zones in Parquet file metadata

2019-08-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5888?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16910688#comment-16910688 ] Wes McKinney commented on ARROW-5888: - Yes it should be possible. > [Python][C++] Add metadata to

[jira] [Updated] (ARROW-5888) [Python][C++] Add metadata to store Arrow time zones in Parquet file metadata

2019-08-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5888?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-5888: Fix Version/s: (was: 1.0.0) 0.15.0 > [Python][C++] Add metadata to store

[jira] [Commented] (ARROW-5825) [Python] Exceptions swallowed in ParquetManifest._visit_directories

2019-08-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5825?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16910684#comment-16910684 ] Wes McKinney commented on ARROW-5825: - PRs welcome -- note that the C++ Datasets project is getting

[jira] [Updated] (ARROW-5853) [Python] Expose boolean filter kernel on Array

2019-08-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5853?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-5853: Fix Version/s: 0.15.0 > [Python] Expose boolean filter kernel on Array >

[jira] [Updated] (ARROW-5825) [Python] Exceptions swallowed in ParquetManifest._visit_directories

2019-08-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5825?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-5825: Fix Version/s: (was: 1.0.0) > [Python] Exceptions swallowed in

[jira] [Updated] (ARROW-5743) [C++] Add CMake option to enable "large memory" unit tests

2019-08-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5743?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-5743: Fix Version/s: (was: 1.0.0) 0.15.0 > [C++] Add CMake option to enable

[jira] [Resolved] (ARROW-5734) [Python] Dispatch to Table.from_arrays from pyarrow.table factory function

2019-08-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5734?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-5734. - Fix Version/s: (was: 1.0.0) 0.15.0 Assignee: Wes McKinney

[jira] [Updated] (ARROW-5717) [Python] Support dictionary unification when converting variable dictionaries to pandas

2019-08-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5717?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-5717: Fix Version/s: (was: 1.0.0) 0.15.0 > [Python] Support dictionary

[jira] [Updated] (ARROW-5580) Correct definitions of timestamp functions in Gandiva

2019-08-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5580?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-5580: Fix Version/s: (was: 0.14.0) 0.15.0 > Correct definitions of timestamp

[jira] [Updated] (ARROW-5134) [R][CI] Run nightly tests against multiple R versions

2019-08-19 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5134?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-5134: -- Labels: pull-request-available (was: ) > [R][CI] Run nightly tests against multiple R

[jira] [Resolved] (ARROW-3652) [Python] CategoricalIndex is lost after reading back

2019-08-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-3652?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-3652. - Fix Version/s: (was: 1.0.0) 0.15.0 Resolution: Fixed Issue

[jira] [Commented] (ARROW-1644) [C++][Parquet] Read and write nested Parquet data with a mix of struct and list nesting levels

2019-08-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1644?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16910669#comment-16910669 ] Wes McKinney commented on ARROW-1644: - I'm not aware of any updates; there are no patches available

[jira] [Resolved] (ARROW-5480) [Python] Pandas categorical type doesn't survive a round-trip through parquet

2019-08-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5480?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-5480. - Fix Version/s: 0.15.0 Resolution: Fixed Resolved in

[jira] [Updated] (ARROW-4649) [C++/CI/R] Add (nightly) job that builds `brew install apache-arrow --HEAD`

2019-08-19 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-4649?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-4649: --- Fix Version/s: (was: 1.0.0) 0.15.0 > [C++/CI/R] Add (nightly) job

[jira] [Updated] (ARROW-4649) [C++/CI/R] Add (nightly) job that builds `brew install apache-arrow --HEAD`

2019-08-19 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-4649?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-4649: --- Priority: Major (was: Blocker) > [C++/CI/R] Add (nightly) job that builds `brew install

[jira] [Assigned] (ARROW-4390) [R] Serialize "labeled" metadata in Feather files, IPC messages

2019-08-19 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-4390?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-4390: -- Assignee: Neal Richardson > [R] Serialize "labeled" metadata in Feather files, IPC

[jira] [Assigned] (ARROW-6182) [R] Add note to README about r-arrow conda installation

2019-08-19 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6182?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-6182: -- Assignee: Neal Richardson > [R] Add note to README about r-arrow conda installation

[jira] [Commented] (ARROW-6236) [R] Deduplicate strings using Arrow hash tables instead of passing all values through R's global hash table

2019-08-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6236?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16910603#comment-16910603 ] Wes McKinney commented on ARROW-6236: - Some basic profiling I did suggested the performance gains may

[jira] [Resolved] (ARROW-6258) [R] Add macOS build scripts

2019-08-19 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6258?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques resolved ARROW-6258. --- Resolution: Fixed Issue resolved by pull request 5095

[jira] [Assigned] (ARROW-4648) [C++/Question] Naming/organizational inconsistencies in cpp codebase

2019-08-19 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-4648?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou reassigned ARROW-4648: - Assignee: Antoine Pitrou > [C++/Question] Naming/organizational inconsistencies in cpp

[jira] [Resolved] (ARROW-4648) [C++/Question] Naming/organizational inconsistencies in cpp codebase

2019-08-19 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-4648?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-4648. --- Fix Version/s: (was: 1.0.0) 0.15.0 Resolution: Fixed Issue

[jira] [Updated] (ARROW-6288) [Java] Implement TypeEqualsVisitor comparing vector type equals considering names and metadata

2019-08-19 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6288?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-6288: -- Labels: pull-request-available (was: ) > [Java] Implement TypeEqualsVisitor comparing vector

[jira] [Updated] (ARROW-6250) [Java] Implement ApproxEqualsVisitor comparing approx for floating point

2019-08-19 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6250?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-6250: -- Labels: pull-request-available (was: ) > [Java] Implement ApproxEqualsVisitor comparing

[jira] [Updated] (ARROW-6236) [R] Deduplicate strings using Arrow hash tables instead of passing all values through R's global hash table

2019-08-19 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6236?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-6236: --- Fix Version/s: 0.15.0 > [R] Deduplicate strings using Arrow hash tables instead of passing

[jira] [Updated] (ARROW-3813) [R] lower level construction of Dictionary Arrays

2019-08-19 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-3813?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-3813: --- Fix Version/s: 0.15.0 > [R] lower level construction of Dictionary Arrays >

[jira] [Updated] (ARROW-3750) [R] Pass various wrapped Arrow objects created in Python into R with zero copy via reticulate

2019-08-19 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-3750?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-3750: --- Fix Version/s: 0.15.0 > [R] Pass various wrapped Arrow objects created in Python into R with

[jira] [Updated] (ARROW-4649) [C++/CI/R] Add (nightly) job that builds `brew install apache-arrow --HEAD`

2019-08-19 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-4649?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-4649: --- Component/s: (was: R) > [C++/CI/R] Add (nightly) job that builds `brew install

[jira] [Updated] (ARROW-3317) [R] Test/support conversions from data.frame with a single character column exceeding 2GB capacity of BinaryArray

2019-08-19 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-3317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-3317: --- Fix Version/s: (was: 1.0.0) 0.15.0 > [R] Test/support conversions

[jira] [Updated] (ARROW-3308) [R] Convert R character vector with data exceeding 2GB to chunked array

2019-08-19 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-3308?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-3308: --- Fix Version/s: (was: 1.0.0) 0.15.0 > [R] Convert R character vector

[jira] [Updated] (ARROW-3316) [R] Multi-threaded conversion from R data.frame to Arrow table / record batch

2019-08-19 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-3316?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-3316: --- Fix Version/s: (was: 1.0.0) 0.15.0 > [R] Multi-threaded conversion

[jira] [Resolved] (ARROW-6211) [Java] Remove dependency on RangeEqualsVisitor from ValueVector interface

2019-08-19 Thread Pindikura Ravindra (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6211?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pindikura Ravindra resolved ARROW-6211. --- Fix Version/s: 0.15.0 Resolution: Fixed Issue resolved by pull request 5091

[jira] [Updated] (ARROW-4390) [R] Serialize "labeled" metadata in Feather files, IPC messages

2019-08-19 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-4390?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-4390: --- Fix Version/s: (was: 1.0.0) 0.15.0 > [R] Serialize "labeled" metadata

[jira] [Updated] (ARROW-5501) [R] read/write_feather/arrow?

2019-08-19 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5501?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-5501: --- Fix Version/s: (was: 1.0.0) 0.15.0 > [R] read/write_feather/arrow? >

[jira] [Updated] (ARROW-6258) [R] Add macOS build scripts

2019-08-19 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6258?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-6258: --- Fix Version/s: 0.15.0 > [R] Add macOS build scripts > --- > >

[jira] [Updated] (ARROW-6214) [R] Sanitizer errors triggered via R bindings

2019-08-19 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6214?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-6214: --- Fix Version/s: 0.15.0 > [R] Sanitizer errors triggered via R bindings >

[jira] [Updated] (ARROW-6235) [R] Conversion from arrow::BinaryArray to R character vector not implemented

2019-08-19 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6235?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-6235: --- Fix Version/s: 0.15.0 > [R] Conversion from arrow::BinaryArray to R character vector not

[jira] [Updated] (ARROW-5134) [R][CI] Run nightly tests against multiple R versions

2019-08-19 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5134?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-5134: --- Fix Version/s: (was: 1.0.0) 0.15.0 > [R][CI] Run nightly tests

[jira] [Updated] (ARROW-6182) [R] Add note to README about r-arrow conda installation

2019-08-19 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6182?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-6182: --- Fix Version/s: 0.15.0 > [R] Add note to README about r-arrow conda installation >

[jira] [Commented] (ARROW-6278) [R] Read files from HDFS

2019-08-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6278?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16910484#comment-16910484 ] Wes McKinney commented on ARROW-6278: - Automatically recognizing raw binary in functions like

[jira] [Commented] (ARROW-5713) [Python] fancy indexing on pa.array

2019-08-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5713?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16910482#comment-16910482 ] Wes McKinney commented on ARROW-5713: - It's a little bit easier said than done because Table can have

[jira] [Updated] (ARROW-4398) [Python] Add benchmarks for Arrow<>Parquet BYTE_ARRAY serialization (read and write)

2019-08-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-4398?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-4398: Fix Version/s: (was: 1.0.0) 0.15.0 > [Python] Add benchmarks for

[jira] [Assigned] (ARROW-4398) [Python] Add benchmarks for Arrow<>Parquet BYTE_ARRAY serialization (read and write)

2019-08-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-4398?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-4398: --- Assignee: Wes McKinney > [Python] Add benchmarks for Arrow<>Parquet BYTE_ARRAY

[GitHub] [arrow-testing] wesm commented on issue #8: add crash from ARROW-6269 and ARROW-6270

2019-08-19 Thread GitBox
wesm commented on issue #8: add crash from ARROW-6269 and ARROW-6270 URL: https://github.com/apache/arrow-testing/pull/8#issuecomment-522591728 thanks! This is an automated message from the Apache Git Service. To respond to

[GitHub] [arrow-testing] wesm merged pull request #8: add crash from ARROW-6269 and ARROW-6270

2019-08-19 Thread GitBox
wesm merged pull request #8: add crash from ARROW-6269 and ARROW-6270 URL: https://github.com/apache/arrow-testing/pull/8 This is an automated message from the Apache Git Service. To respond to the message, please log on to