[jira] [Updated] (ARROW-3896) [MATLAB] Decouple MATLAB-Arrow conversion logic from Feather file specific logic

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-3896?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-3896: Fix Version/s: (was: 0.14.0) > [MATLAB] Decouple MATLAB-Arrow conversion logic from Feather

[jira] [Updated] (ARROW-3919) [Python] Support 64 bit indices for pyarrow.serialize and pyarrow.deserialize

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-3919?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-3919: Fix Version/s: (was: 0.14.0) > [Python] Support 64 bit indices for pyarrow.serialize and

[jira] [Updated] (ARROW-3873) [C++] Build shared libraries consistently with -fvisibility=hidden

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-3873?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-3873: Fix Version/s: (was: 0.14.0) 0.15.0 > [C++] Build shared libraries

[jira] [Updated] (ARROW-3901) [Python] Make Schema hashable

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-3901?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-3901: Fix Version/s: (was: 0.14.0) > [Python] Make Schema hashable > - >

[jira] [Updated] (ARROW-4022) [C++] RFC: promote Datum variant out of compute namespace

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4022?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-4022: Fix Version/s: (was: 0.14.0) > [C++] RFC: promote Datum variant out of compute namespace >

[jira] [Updated] (ARROW-4001) [Python] Create Parquet Schema in python

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4001?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-4001: Fix Version/s: (was: 0.14.0) > [Python] Create Parquet Schema in python >

[jira] [Updated] (ARROW-4046) [Python/CI] Run nightly large memory tests

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4046?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-4046: Fix Version/s: (was: 0.14.0) > [Python/CI] Run nightly large memory tests >

[jira] [Updated] (ARROW-4046) [Python/CI] Run nightly large memory tests

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4046?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-4046: Labels: nightly (was: ) > [Python/CI] Run nightly large memory tests >

[jira] [Created] (ARROW-5455) [Rust] Build broken by 2019-05-30 Rust nightly

2019-05-30 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-5455: --- Summary: [Rust] Build broken by 2019-05-30 Rust nightly Key: ARROW-5455 URL: https://issues.apache.org/jira/browse/ARROW-5455 Project: Apache Arrow Issue

[jira] [Resolved] (ARROW-5453) [C++] Just-released cmake-format 0.5.2 breaks the build

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5453?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-5453. - Resolution: Fixed Issue resolved by pull request 4423

[jira] [Updated] (ARROW-4631) [C++] Implement serial version of sort computational kernel

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4631?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-4631: Fix Version/s: (was: 0.14.0) 0.15.0 > [C++] Implement serial version of

[jira] [Updated] (ARROW-4591) [Rust] Add explicit SIMD vectorization for aggregation ops in "array_ops"

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4591?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-4591: Fix Version/s: (was: 0.14.0) > [Rust] Add explicit SIMD vectorization for aggregation ops in

[jira] [Updated] (ARROW-4575) [Python] Add Python Flight implementation to integration testing

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4575?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-4575: Fix Version/s: (was: 0.14.0) > [Python] Add Python Flight implementation to integration

[jira] [Commented] (ARROW-4567) [C++] Convert Scalar values to Array values with length 1

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4567?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16852594#comment-16852594 ] Wes McKinney commented on ARROW-4567: - cc [~fsaintjacques] > [C++] Convert Scalar values to Array

[jira] [Created] (ARROW-5457) [GLib][Plasma] Environment variable name for test is wrong

2019-05-30 Thread Kouhei Sutou (JIRA)
Kouhei Sutou created ARROW-5457: --- Summary: [GLib][Plasma] Environment variable name for test is wrong Key: ARROW-5457 URL: https://issues.apache.org/jira/browse/ARROW-5457 Project: Apache Arrow

[jira] [Updated] (ARROW-5457) [GLib][Plasma] Environment variable name for test is wrong

2019-05-30 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5457?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-5457: -- Labels: pull-request-available (was: ) > [GLib][Plasma] Environment variable name for test is

[jira] [Updated] (ARROW-750) [Format] Add LargeBinary and LargeString types

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-750?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-750: --- Fix Version/s: (was: 0.14.0) 0.15.0 > [Format] Add LargeBinary and LargeString

[jira] [Updated] (ARROW-3840) [C++] Run fuzzer tests with docker-compose

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-3840?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-3840: Fix Version/s: (was: 0.14.0) > [C++] Run fuzzer tests with docker-compose >

[jira] [Updated] (ARROW-3419) [C++] Run include-what-you-use checks as nightly build

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-3419?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-3419: Fix Version/s: (was: 0.14.0) > [C++] Run include-what-you-use checks as nightly build >

[jira] [Updated] (ARROW-3410) [C++] Streaming CSV reader interface for memory-constrainted environments

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-3410?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-3410: Fix Version/s: (was: 0.14.0) > [C++] Streaming CSV reader interface for memory-constrainted

[jira] [Updated] (ARROW-3408) [C++] Add option to CSV reader to dictionary encode individual columns or all string / binary columns

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-3408?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-3408: Labels: datasets (was: ) > [C++] Add option to CSV reader to dictionary encode individual columns

[jira] [Updated] (ARROW-3379) [C++] Implement regex/multichar delimiter tokenizer

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-3379?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-3379: Fix Version/s: (was: 0.14.0) 0.15.0 > [C++] Implement regex/multichar

[jira] [Updated] (ARROW-3424) [Python] Improved workflow for loading an arbitrary collection of Parquet files

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-3424?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-3424: Labels: datasets parquet (was: parquet) > [Python] Improved workflow for loading an arbitrary

[jira] [Updated] (ARROW-3408) [C++] Add option to CSV reader to dictionary encode individual columns or all string / binary columns

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-3408?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-3408: Fix Version/s: (was: 0.14.0) 0.15.0 > [C++] Add option to CSV reader to

[jira] [Updated] (ARROW-3401) [C++] Pluggable statistics collector API for unconvertible CSV values

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-3401?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-3401: Fix Version/s: (was: 0.14.0) 0.15.0 > [C++] Pluggable statistics collector

[jira] [Updated] (ARROW-3406) [C++] Create a caching memory pool implementation

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-3406?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-3406: Fix Version/s: (was: 0.14.0) 0.15.0 > [C++] Create a caching memory pool

[jira] [Updated] (ARROW-4259) [Plasma] CI failure in test_plasma_tf_op

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4259?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-4259: Fix Version/s: (was: 0.14.0) > [Plasma] CI failure in test_plasma_tf_op >

[jira] [Updated] (ARROW-4286) [C++/R] Namespace vendored Boost

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4286?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-4286: Fix Version/s: (was: 0.14.0) > [C++/R] Namespace vendored Boost >

[jira] [Updated] (ARROW-4217) [Plasma] Remove custom object metadata

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-4217: Fix Version/s: (was: 0.14.0) > [Plasma] Remove custom object metadata >

[jira] [Commented] (ARROW-4220) [Python] Add buffered input and output stream ASV benchmarks with simulated high latency IO

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4220?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16852570#comment-16852570 ] Wes McKinney commented on ARROW-4220: - cc [~jorisvandenbossche] > [Python] Add buffered input and

[jira] [Updated] (ARROW-4283) [Python] Should RecordBatchStreamReader/Writer be AsyncIterable?

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4283?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-4283: Fix Version/s: (was: 0.14.0) > [Python] Should RecordBatchStreamReader/Writer be

[jira] [Updated] (ARROW-4309) [Release] gen_apidocs docker-compose task is out of date

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4309?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-4309: Fix Version/s: (was: 0.14.0) > [Release] gen_apidocs docker-compose task is out of date >

[jira] [Resolved] (ARROW-4302) [C++] Add OpenSSL to C++ build toolchain

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-4302. - Resolution: Fixed > [C++] Add OpenSSL to C++ build toolchain >

[jira] [Commented] (ARROW-4301) [Java][Gandiva] Maven snapshot version update does not seem to update Gandiva submodule

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16852571#comment-16852571 ] Wes McKinney commented on ARROW-4301: - [~pravindra] any ideas about this? This will get us again in

[jira] [Updated] (ARROW-4465) [Rust] [DataFusion] Add support for ORDER BY

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-4465: Fix Version/s: (was: 0.14.0) > [Rust] [DataFusion] Add support for ORDER BY >

[jira] [Commented] (ARROW-4439) [C++] Improve FindBrotli.cmake

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16852586#comment-16852586 ] Wes McKinney commented on ARROW-4439: - [~rip@gmail.com] is this OK in master now? > [C++]

[jira] [Updated] (ARROW-4453) [Python] Create Cython wrappers for SparseTensor

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4453?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-4453: Fix Version/s: (was: 0.14.0) > [Python] Create Cython wrappers for SparseTensor >

[jira] [Resolved] (ARROW-4447) [C++] Investigate dynamic linking for libthift

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4447?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-4447. - Resolution: Fixed Assignee: Uwe L. Korn Thrift is now dynamically linked > [C++]

[jira] [Updated] (ARROW-4470) [Python] Pyarrow using considerable more memory when reading partitioned Parquet file

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4470?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-4470: Fix Version/s: (was: 0.14.0) 0.15.0 > [Python] Pyarrow using considerable

[jira] [Commented] (ARROW-4479) [Plasma] Add S3 as external store for Plasma

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4479?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16852589#comment-16852589 ] Wes McKinney commented on ARROW-4479: - What is the status of this project? > [Plasma] Add S3 as

[jira] [Updated] (ARROW-4482) [Website] Add blog archive page

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4482?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-4482: Fix Version/s: (was: 0.14.0) 0.15.0 > [Website] Add blog archive page >

[jira] [Updated] (ARROW-4473) [Website] Add instructions to do a test-deploy of Arrow website and fix bugs

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4473?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-4473: Fix Version/s: (was: 0.14.0) 0.15.0 > [Website] Add instructions to do a

[jira] [Updated] (ARROW-4470) [Python] Pyarrow using considerable more memory when reading partitioned Parquet file

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4470?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-4470: Labels: datasets parquet (was: parquet) > [Python] Pyarrow using considerable more memory when

[jira] [Commented] (ARROW-5452) [R] Add documentation website (pkgdown)

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5452?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16852326#comment-16852326 ] Wes McKinney commented on ARROW-5452: - Yeah, for generated API docs that is fine, if we start writing

[jira] [Updated] (ARROW-3054) [Packaging] Tooling to enable nightly conda packages to be updated to some anaconda.org channel

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-3054?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-3054: Fix Version/s: (was: 0.14.0) > [Packaging] Tooling to enable nightly conda packages to be

[jira] [Updated] (ARROW-3082) [C++] Add SSL support for hiveserver2

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-3082?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-3082: Fix Version/s: (was: 0.14.0) > [C++] Add SSL support for hiveserver2 >

[jira] [Updated] (ARROW-3806) [Python] When converting nested types to pandas, use tuples

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-3806?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-3806: Fix Version/s: (was: 0.14.0) > [Python] When converting nested types to pandas, use tuples >

[jira] [Updated] (ARROW-3789) [Python] Enable calling object in Table.to_pandas to "self-destruct" for improved memory use

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-3789?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-3789: Fix Version/s: (was: 0.14.0) > [Python] Enable calling object in Table.to_pandas to

[jira] [Updated] (ARROW-3764) [C++] Port Python "ParquetDataset" business logic to C++

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-3764?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-3764: Fix Version/s: (was: 0.14.0) 0.15.0 > [C++] Port Python "ParquetDataset"

[jira] [Commented] (ARROW-3759) [R][CI] Build and test on Windows in Appveyor

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-3759?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16852548#comment-16852548 ] Wes McKinney commented on ARROW-3759: - cc [~npr] > [R][CI] Build and test on Windows in Appveyor >

[jira] [Commented] (ARROW-3873) [C++] Build shared libraries consistently with -fvisibility=hidden

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-3873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16852552#comment-16852552 ] Wes McKinney commented on ARROW-3873: - I might take another crack at this to see if it is doable, but

[jira] [Commented] (ARROW-3801) [Python] Pandas-Arrow roundtrip makes pd categorical index not writeable

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-3801?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16852549#comment-16852549 ] Wes McKinney commented on ARROW-3801: - cc [~jorisvandenbossche] > [Python] Pandas-Arrow roundtrip

[jira] [Updated] (ARROW-3827) [Rust] Implement UnionArray

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-3827?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-3827: Fix Version/s: (was: 0.14.0) > [Rust] Implement UnionArray > --- > >

[jira] [Updated] (ARROW-4208) [CI/Python] Have automatized tests for S3

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4208?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-4208: Labels: filesystem s3 (was: s3) > [CI/Python] Have automatized tests for S3 >

[jira] [Updated] (ARROW-4095) [C++] Implement optimizations for dictionary unification where dictionaries are prefixes of the unified dictionary

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4095?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-4095: Fix Version/s: (was: 0.14.0) 0.15.0 > [C++] Implement optimizations for

[jira] [Updated] (ARROW-4133) [C++/Python] ORC adapter should fail gracefully if /etc/timezone is missing instead of aborting

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4133?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-4133: Fix Version/s: (was: 0.14.0) > [C++/Python] ORC adapter should fail gracefully if

[jira] [Updated] (ARROW-4090) [Python] Table.flatten() doesn't work recursively

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4090?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-4090: Fix Version/s: (was: 0.14.0) > [Python] Table.flatten() doesn't work recursively >

[jira] [Updated] (ARROW-4202) [Gandiva] use ArrayFromJson in tests

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4202?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-4202: Fix Version/s: (was: 0.14.0) > [Gandiva] use ArrayFromJson in tests >

[jira] [Updated] (ARROW-4146) [C++] Extend visitor functions to include ArrayBuilder and allow callable visitors

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4146?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-4146: Fix Version/s: (was: 0.14.0) > [C++] Extend visitor functions to include ArrayBuilder and

[jira] [Updated] (ARROW-4201) [C++][Gandiva] integrate test utils with arrow

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4201?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-4201: Fix Version/s: (was: 0.14.0) > [C++][Gandiva] integrate test utils with arrow >

[jira] [Updated] (ARROW-4208) [CI/Python] Have automatized tests for S3

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4208?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-4208: Fix Version/s: (was: 0.14.0) 0.15.0 > [CI/Python] Have automatized tests

[jira] [Created] (ARROW-5456) [GLib][Plasma] Installed plasma-glib may be used on building document

2019-05-30 Thread Kouhei Sutou (JIRA)
Kouhei Sutou created ARROW-5456: --- Summary: [GLib][Plasma] Installed plasma-glib may be used on building document Key: ARROW-5456 URL: https://issues.apache.org/jira/browse/ARROW-5456 Project: Apache

[jira] [Updated] (ARROW-5456) [GLib][Plasma] Installed plasma-glib may be used on building document

2019-05-30 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5456?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-5456: -- Labels: pull-request-available (was: ) > [GLib][Plasma] Installed plasma-glib may be used on

[jira] [Commented] (ARROW-5458) Apache Arrow parallel CRC32c computation optimization

2019-05-30 Thread Yuqi Gu (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16852610#comment-16852610 ] Yuqi Gu commented on ARROW-5458: PR: https://github.com/apache/arrow/pull/4427 > Apache Arrow parallel

[jira] [Updated] (ARROW-5452) [R] Add documentation website (pkgdown)

2019-05-30 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5452?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-5452: -- Labels: pull-request-available (was: ) > [R] Add documentation website (pkgdown) >

[jira] [Updated] (ARROW-1988) [Python] Extend flavor=spark in Parquet writing to handle INT types

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1988?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1988: Fix Version/s: (was: 0.14.0) 0.15.0 > [Python] Extend flavor=spark in

[jira] [Updated] (ARROW-1987) [Website] Enable Docker-based documentation generator to build at a specific Arrow commit

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1987?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1987: Fix Version/s: 0.15.0 > [Website] Enable Docker-based documentation generator to build at a

[jira] [Commented] (ARROW-1989) [Python] Better UX on timestamp conversion to Pandas

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1989?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16852495#comment-16852495 ] Wes McKinney commented on ARROW-1989: - [~jorisvandenbossche] potentially of interest? > [Python]

[jira] [Commented] (ARROW-2006) [C++] Add option to trim excess padding when writing IPC messages

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2006?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16852496#comment-16852496 ] Wes McKinney commented on ARROW-2006: - Our IPC methods lack configurability in general. We may want

[jira] [Updated] (ARROW-1987) [Website] Enable Docker-based documentation generator to build at a specific Arrow commit

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1987?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1987: Fix Version/s: (was: 0.14.0) > [Website] Enable Docker-based documentation generator to build

[jira] [Assigned] (ARROW-1957) [Python] Write nanosecond timestamps using new NANO LogicalType Parquet unit

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1957?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-1957: --- Assignee: TP Boudreau > [Python] Write nanosecond timestamps using new NANO LogicalType

[jira] [Updated] (ARROW-1959) [Python] Add option for "lossy" conversions (overflow -> null) from timestamps to datetime.datetime / pandas.Timestamp

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1959?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1959: Fix Version/s: (was: 0.14.0) 0.15.0 > [Python] Add option for "lossy"

[jira] [Updated] (ARROW-1846) [C++] Implement "any" reduction kernel for boolean data

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1846?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1846: Fix Version/s: (was: 0.14.0) 0.15.0 > [C++] Implement "any" reduction

[jira] [Commented] (ARROW-1957) [Python] Write nanosecond timestamps using new NANO LogicalType Parquet unit

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1957?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16852494#comment-16852494 ] Wes McKinney commented on ARROW-1957: - [~tpboudreau] I assume this is on your critical path >

[jira] [Commented] (ARROW-1837) [Java] Unable to read unsigned integers outside signed range for bit width in integration tests

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1837?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16852493#comment-16852493 ] Wes McKinney commented on ARROW-1837: - [~emkornfi...@gmail.com] if you are interested in unsigned

[jira] [Updated] (ARROW-2077) [Python] Document on how to use Storefact & Arrow to read Parquet from S3/Azure/...

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2077?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-2077: Fix Version/s: (was: 0.14.0) > [Python] Document on how to use Storefact & Arrow to read

[jira] [Assigned] (ARROW-2057) [Python] Configure size of data pages in pyarrow.parquet.write_table

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2057?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-2057: --- Assignee: (was: Uwe L. Korn) > [Python] Configure size of data pages in

[jira] [Updated] (ARROW-2098) [Python] Implement "errors as null" option when coercing Python object arrays to Arrow format

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2098?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-2098: Fix Version/s: (was: 0.14.0) > [Python] Implement "errors as null" option when coercing Python

[jira] [Commented] (ARROW-2037) [Python]: Add tests for ARROW-1941 cases where pandas inferred type is 'empty'

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2037?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16852497#comment-16852497 ] Wes McKinney commented on ARROW-2037: - cc [~jorisvandenbossche] > [Python]: Add tests for ARROW-1941

[jira] [Closed] (ARROW-2186) [C++] Clean up architecture specific compiler flags

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2186?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney closed ARROW-2186. --- Resolution: Not A Problem > [C++] Clean up architecture specific compiler flags >

[jira] [Updated] (ARROW-2130) [Python] Support converting pandas.Timestamp in pyarrow.array

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2130?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-2130: Fix Version/s: (was: 0.14.0) > [Python] Support converting pandas.Timestamp in pyarrow.array >

[jira] [Updated] (ARROW-2127) [Plasma] Transfer of objects between CPUs and GPUs

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2127?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-2127: Fix Version/s: (was: 0.14.0) > [Plasma] Transfer of objects between CPUs and GPUs >

[jira] [Updated] (ARROW-2041) [Python] pyarrow.serialize has high overhead for list of NumPy arrays

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2041?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-2041: Fix Version/s: (was: 0.14.0) 0.15.0 > [Python] pyarrow.serialize has high

[jira] [Updated] (ARROW-1848) [Python] Add documentation examples for reading single Parquet files and datasets from HDFS

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1848?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1848: Fix Version/s: (was: 0.14.0) 0.15.0 > [Python] Add documentation examples

[jira] [Updated] (ARROW-2939) [Python] Provide links to documentation pages for old versions

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2939?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-2939: Summary: [Python] Provide links to documentation pages for old versions (was: [Python] API

[jira] [Commented] (ARROW-2984) [JS] Refactor release verification script to share code with main source release verification script

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16852519#comment-16852519 ] Wes McKinney commented on ARROW-2984: - To close this, let us remove the old JavaScript release

[jira] [Commented] (ARROW-3052) [C++] Detect ORC system packages

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-3052?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16852517#comment-16852517 ] Wes McKinney commented on ARROW-3052: - ORC is now in conda-forge

[jira] [Updated] (ARROW-3016) [C++] Add ability to enable call stack logging for each memory allocation

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-3016?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-3016: Fix Version/s: (was: 0.14.0) > [C++] Add ability to enable call stack logging for each memory

[jira] [Commented] (ARROW-3702) [R] POSIXct mapped to DateType not TimestampType?

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-3702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16852546#comment-16852546 ] Wes McKinney commented on ARROW-3702: - cc [~npr] > [R] POSIXct mapped to DateType not TimestampType?

[jira] [Updated] (ARROW-3706) [Rust] Add record batch reader trait.

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-3706?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-3706: Fix Version/s: (was: 0.14.0) > [Rust] Add record batch reader trait. >

[jira] [Commented] (ARROW-3686) [Python] Support for masked arrays in to/from numpy

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-3686?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16852545#comment-16852545 ] Wes McKinney commented on ARROW-3686: - cc [~jorisvandenbossche] > [Python] Support for masked arrays

[jira] [Updated] (ARROW-3705) [Python] Add "nrows" argument to parquet.read_table read indicated number of rows from file instead of whole file

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-3705?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-3705: Labels: datasets parquet (was: parquet) > [Python] Add "nrows" argument to parquet.read_table

[jira] [Updated] (ARROW-3655) [Gandiva] switch away from default_memory_pool

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-3655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-3655: Fix Version/s: (was: 0.14.0) > [Gandiva] switch away from default_memory_pool >

[jira] [Updated] (ARROW-3709) [CI/Docker/Python] Plasma tests are failing in the docker-compose setup

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-3709?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-3709: Fix Version/s: (was: 0.14.0) > [CI/Docker/Python] Plasma tests are failing in the

[jira] [Commented] (ARROW-3730) [Python] Output a representation of pyarrow.Schema that can be used to reconstruct a schema in a script

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-3730?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16852547#comment-16852547 ] Wes McKinney commented on ARROW-3730: - cc [~jorisvandenbossche] > [Python] Output a representation

[jira] [Updated] (ARROW-3758) [R] Build R library on Windows, document build instructions for Windows developers

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-3758?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-3758: Fix Version/s: (was: 0.14.0) 0.15.0 > [R] Build R library on Windows,

[jira] [Updated] (ARROW-3710) [CI/Python] Run nightly tests against pandas master

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-3710?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-3710: Fix Version/s: (was: 0.14.0) > [CI/Python] Run nightly tests against pandas master >

[jira] [Updated] (ARROW-4759) [Rust] [DataFusion] It should be possible to share an execution context between threads

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4759?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-4759: Fix Version/s: (was: 0.14.0) > [Rust] [DataFusion] It should be possible to share an execution

[jira] [Updated] (ARROW-4429) Add git rebase tips to the 'Contributing' page in the developer docs

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4429?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-4429: Fix Version/s: (was: 0.14.0) > Add git rebase tips to the 'Contributing' page in the developer

[jira] [Updated] (ARROW-4752) [Rust] Add explicit SIMD vectorization for the divide kernel

2019-05-30 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4752?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-4752: Fix Version/s: (was: 0.14.0) > [Rust] Add explicit SIMD vectorization for the divide kernel >

<    1   2   3   4   >