[jira] [Commented] (ARROW-8677) [Python][Parquet] Parquet write_batch and read from Python failes with batch size 10000 or 1 but okay with 1000

2020-05-04 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8677?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17099165#comment-17099165 ] Wes McKinney commented on ARROW-8677: - [~fsaintjacques] since the file was written by Rust

[jira] [Updated] (ARROW-8677) [Python][Parquet] Parquet write_batch and read from Python failes with batch size 10000 or 1 but okay with 1000

2020-05-04 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8677?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8677: Component/s: Rust > [Python][Parquet] Parquet write_batch and read from Python failes with batch

[jira] [Commented] (ARROW-8684) [Python] "SystemError: Bad call flags in _PyMethodDef_RawFastCallDict" in Python 3.7.7 on macOS when using pyarrow wheel

2020-05-04 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8684?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17099164#comment-17099164 ] Wes McKinney commented on ARROW-8684: - I just opened https://github.com/cython/cython/issues/3572 >

[jira] [Updated] (ARROW-8684) [Python] "SystemError: Bad call flags in _PyMethodDef_RawFastCallDict" in Python 3.7.7 on macOS when using pyarrow wheel

2020-05-03 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8684?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8684: Summary: [Python] "SystemError: Bad call flags in _PyMethodDef_RawFastCallDict" in Python 3.7.7 on

[jira] [Commented] (ARROW-8684) [Packaging][Python] "SystemError: Bad call flags in _PyMethodDef_RawFastCallDict" in Python 3.7.7 on macOS when using pyarrow wheel

2020-05-03 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8684?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17098623#comment-17098623 ] Wes McKinney commented on ARROW-8684: - I briefly tried creating a debug build on macOS but couldn't

[jira] [Updated] (ARROW-8684) [Packaging][Python] "SystemError: Bad call flags in _PyMethodDef_RawFastCallDict" in Python 3.7.7 on macOS when using pyarrow wheel

2020-05-03 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8684?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8684: Fix Version/s: 0.17.1 > [Packaging][Python] "SystemError: Bad call flags in >

[jira] [Created] (ARROW-8684) [Packaging][Python] "SystemError: Bad call flags in _PyMethodDef_RawFastCallDict" in Python 3.7.7 on macOS when using pyarrow wheel

2020-05-03 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-8684: --- Summary: [Packaging][Python] "SystemError: Bad call flags in _PyMethodDef_RawFastCallDict" in Python 3.7.7 on macOS when using pyarrow wheel Key: ARROW-8684 URL:

[jira] [Created] (ARROW-8683) [C++] Add option for user-defined version identifier for Arrow libraries

2020-05-03 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-8683: --- Summary: [C++] Add option for user-defined version identifier for Arrow libraries Key: ARROW-8683 URL: https://issues.apache.org/jira/browse/ARROW-8683 Project: Apache

[jira] [Updated] (ARROW-8681) [Rust][DataFusion] Improve like/nlike performance

2020-05-03 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8681: Summary: [Rust][DataFusion] Improve like/nlike performance (was: Improve like/nlike performance)

[jira] [Commented] (ARROW-7852) [Python] 0.16.0 wheels not compatible with older numpy

2020-05-03 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7852?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17098577#comment-17098577 ] Wes McKinney commented on ARROW-7852: - Can you open a new JIRA issue? > [Python] 0.16.0 wheels not

[jira] [Updated] (ARROW-8680) [Rust] ComplexObjectArrayReader incorrect null value shuffling

2020-05-03 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8680?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8680: Component/s: Rust > [Rust] ComplexObjectArrayReader incorrect null value shuffling >

[jira] [Updated] (ARROW-8680) [Java] ComplexObjectArrayReader incorrect null value shuffling

2020-05-03 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8680?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8680: Summary: [Java] ComplexObjectArrayReader incorrect null value shuffling (was:

[jira] [Updated] (ARROW-8680) [Rust] ComplexObjectArrayReader incorrect null value shuffling

2020-05-03 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8680?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8680: Summary: [Rust] ComplexObjectArrayReader incorrect null value shuffling (was: [Java]

[jira] [Updated] (ARROW-8657) [Python][C++][Parquet] Forward compatibility issue from 0.16 to 0.17 when using version='2.0'

2020-05-03 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8657: Fix Version/s: 1.0.0 > [Python][C++][Parquet] Forward compatibility issue from 0.16 to 0.17 when

[jira] [Commented] (ARROW-8679) [Python] supporting pandas sparse series in pyarrow

2020-05-03 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8679?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17098576#comment-17098576 ] Wes McKinney commented on ARROW-8679: - You're welcome to submit a PR > [Python] supporting pandas

[jira] [Commented] (ARROW-8672) [Java] Implement RecordBatch IPC buffer compression from ARROW-300

2020-05-03 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8672?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17098575#comment-17098575 ] Wes McKinney commented on ARROW-8672: - Sure please feel free > [Java] Implement RecordBatch IPC

[jira] [Updated] (ARROW-8679) [Python] supporting pandas sparse series in pyarrow

2020-05-03 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8679?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8679: Summary: [Python] supporting pandas sparse series in pyarrow (was: supporting pandas sparse

[jira] [Updated] (ARROW-8677) [Rust][Parquet] Parquet write_batch and read from Python failes with batch size 10000 or 1 but okay with 1000

2020-05-02 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8677?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8677: Summary: [Rust][Parquet] Parquet write_batch and read from Python failes with batch size 1 or

[jira] [Assigned] (ARROW-8513) [Python] Expose Take with Table input in Python

2020-05-02 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8513?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-8513: --- Assignee: German I. Ramirez-Espinoza > [Python] Expose Take with Table input in Python >

[jira] [Created] (ARROW-8676) [Rust] Create implementation of IPC RecordBatch body buffer compression from ARROW-300

2020-05-02 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-8676: --- Summary: [Rust] Create implementation of IPC RecordBatch body buffer compression from ARROW-300 Key: ARROW-8676 URL: https://issues.apache.org/jira/browse/ARROW-8676

[jira] [Created] (ARROW-8674) [JS] Implement IPC RecordBatch body buffer compression from ARROW-300

2020-05-02 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-8674: --- Summary: [JS] Implement IPC RecordBatch body buffer compression from ARROW-300 Key: ARROW-8674 URL: https://issues.apache.org/jira/browse/ARROW-8674 Project: Apache

[jira] [Created] (ARROW-8675) [C#] Create implementation of ARROW-300 / IPC record batch body buffer compression

2020-05-02 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-8675: --- Summary: [C#] Create implementation of ARROW-300 / IPC record batch body buffer compression Key: ARROW-8675 URL: https://issues.apache.org/jira/browse/ARROW-8675

[jira] [Created] (ARROW-8673) [Go] Implement IPC RecordBatch body compression from ARROW-300

2020-05-02 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-8673: --- Summary: [Go] Implement IPC RecordBatch body compression from ARROW-300 Key: ARROW-8673 URL: https://issues.apache.org/jira/browse/ARROW-8673 Project: Apache Arrow

[jira] [Created] (ARROW-8672) [Java] Implement RecordBatch IPC buffer compression from ARROW-300

2020-05-02 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-8672: --- Summary: [Java] Implement RecordBatch IPC buffer compression from ARROW-300 Key: ARROW-8672 URL: https://issues.apache.org/jira/browse/ARROW-8672 Project: Apache Arrow

[jira] [Created] (ARROW-8671) [C++] Use IPC body compression metadata approved in ARROW-300

2020-05-02 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-8671: --- Summary: [C++] Use IPC body compression metadata approved in ARROW-300 Key: ARROW-8671 URL: https://issues.apache.org/jira/browse/ARROW-8671 Project: Apache Arrow

[jira] [Created] (ARROW-8670) [Format] Create reference implementations of IPC RecordBatch body compression from ARROW-300

2020-05-02 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-8670: --- Summary: [Format] Create reference implementations of IPC RecordBatch body compression from ARROW-300 Key: ARROW-8670 URL: https://issues.apache.org/jira/browse/ARROW-8670

[jira] [Resolved] (ARROW-8562) [C++] IO: Parameterize I/O coalescing using S3 storage metrics

2020-05-01 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8562?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-8562. - Fix Version/s: 1.0.0 Resolution: Fixed Issue resolved by pull request 7022

[jira] [Assigned] (ARROW-8562) [C++] IO: Parameterize I/O coalescing using S3 storage metrics

2020-05-01 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8562?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-8562: --- Assignee: Mayur Srivastava > [C++] IO: Parameterize I/O coalescing using S3 storage metrics

[jira] [Resolved] (ARROW-8593) [C++] Parquet file_serialize_test.cc fails to build with musl libc

2020-05-01 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8593?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-8593. - Fix Version/s: 1.0.0 Resolution: Fixed Issue resolved by pull request 7038

[jira] [Resolved] (ARROW-8660) [C++][Gandiva] Reduce dependence on Boost

2020-05-01 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8660?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-8660. - Resolution: Fixed Issue resolved by pull request 7077

[jira] [Created] (ARROW-8667) [C++] Add multi-consumer Scheduler API to sit one layer above ThreadPool

2020-05-01 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-8667: --- Summary: [C++] Add multi-consumer Scheduler API to sit one layer above ThreadPool Key: ARROW-8667 URL: https://issues.apache.org/jira/browse/ARROW-8667 Project: Apache

[jira] [Resolved] (ARROW-8619) [C++] Use distinct Type::type values for interval types

2020-05-01 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8619?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-8619. - Resolution: Fixed Issue resolved by pull request 7060

[jira] [Updated] (ARROW-8608) [C++] Update vendored mpark/variant.h to latest to fix NVCC compilation issues

2020-05-01 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8608?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8608: Summary: [C++] Update vendored mpark/variant.h to latest to fix NVCC compilation issues (was:

[jira] [Commented] (ARROW-8664) [Java] Add skip null check to all Vector types

2020-05-01 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8664?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17097556#comment-17097556 ] Wes McKinney commented on ARROW-8664: - Please be sure to set the Component field and add

[jira] [Updated] (ARROW-8664) [Java] Add skip null check to all Vector types

2020-05-01 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8664?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8664: Summary: [Java] Add skip null check to all Vector types (was: Add skip null check to all Vector

[jira] [Updated] (ARROW-8663) [Documentation] Small correction to building.rst

2020-05-01 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8663?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8663: Summary: [Documentation] Small correction to building.rst (was: Small correction to building.rst)

[jira] [Updated] (ARROW-8664) [Java] Add skip null check to all Vector types

2020-05-01 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8664?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8664: Component/s: Java > [Java] Add skip null check to all Vector types >

[jira] [Updated] (ARROW-8646) [Java] Allow UnionListWriter to write null values

2020-05-01 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8646?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8646: Summary: [Java] Allow UnionListWriter to write null values (was: Allow UnionListWriter to write

[jira] [Updated] (ARROW-8661) [C++][Gandiva] Reduce number of files and headers

2020-04-30 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8661?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8661: Description: I feel that the Gandiva subpackage is more Java-like in its code organization than

[jira] [Created] (ARROW-8661) [C++][Gandiva] Reduce number of files and headers

2020-04-30 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-8661: --- Summary: [C++][Gandiva] Reduce number of files and headers Key: ARROW-8661 URL: https://issues.apache.org/jira/browse/ARROW-8661 Project: Apache Arrow Issue

[jira] [Created] (ARROW-8660) [C++][Gandiva] Reduce dependence on Boost

2020-04-30 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-8660: --- Summary: [C++][Gandiva] Reduce dependence on Boost Key: ARROW-8660 URL: https://issues.apache.org/jira/browse/ARROW-8660 Project: Apache Arrow Issue Type:

[jira] [Resolved] (ARROW-300) [Format] Add body buffer compression option to IPC message protocol using LZ4 or ZSTD

2020-04-30 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-300?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-300. Fix Version/s: 1.0.0 Resolution: Fixed Issue resolved by pull request 6707

[jira] [Updated] (ARROW-8657) [Python][C++][Parquet] Forward compatibility issue from 0.16 to 0.17 when using version='2.0'

2020-04-30 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8657: Description: With the recent release of 0.17, the ParquetVersion is used to define the logical

[jira] [Commented] (ARROW-8657) [Python][C++][Parquet] Forward compatibility issue from 0.16 to 0.17 when using version='2.0'

2020-04-30 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8657?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17096866#comment-17096866 ] Wes McKinney commented on ARROW-8657: - For the record, I think we need to introduce a new flag to

[jira] [Comment Edited] (ARROW-8657) [Python][C++][Parquet] Forward compatibility issue from 0.16 to 0.17 when using version='2.0'

2020-04-30 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8657?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17096866#comment-17096866 ] Wes McKinney edited comment on ARROW-8657 at 4/30/20, 6:37 PM: --- For the

[jira] [Updated] (ARROW-8657) [Python][C++][Parquet] Forward compatibility issue from 0.16 to 0.17 when using version='2.0'

2020-04-30 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8657: Fix Version/s: 0.17.1 > [Python][C++][Parquet] Forward compatibility issue from 0.16 to 0.17 when

[jira] [Updated] (ARROW-8657) [Python][C++][Parquet] Forward compatibility issue from 0.16 to 0.17 when using version='2.0'

2020-04-30 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8657: Summary: [Python][C++][Parquet] Forward compatibility issue from 0.16 to 0.17 when using

[jira] [Commented] (ARROW-8657) Distinguish parquet version 2 logical type vs DataPageV2

2020-04-30 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8657?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17096862#comment-17096862 ] Wes McKinney commented on ARROW-8657: - > As a result all parquet files that were created with

[jira] [Commented] (ARROW-8654) [Python] pyarrow 0.17.0 fails reading "wide" parquet files

2020-04-30 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8654?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17096859#comment-17096859 ] Wes McKinney commented on ARROW-8654: - Also, the perf of reading very wide Parquet files won't be

[jira] [Commented] (ARROW-8654) [Python] pyarrow 0.17.0 fails reading "wide" parquet files

2020-04-30 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8654?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17096858#comment-17096858 ] Wes McKinney commented on ARROW-8654: - FWIW, "large" metadata from very wide tables is a problematic

[jira] [Closed] (ARROW-8638) Arrow Cython API Usage Gives an error when calling CTable API Endpoints

2020-04-30 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8638?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney closed ARROW-8638. --- Resolution: Information Provided Closing since there isn't a bug to fix, further discussion can take

[jira] [Commented] (ARROW-8642) Is there a good way to convert data types from numpy types to pyarrow DataType?

2020-04-30 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8642?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17096567#comment-17096567 ] Wes McKinney commented on ARROW-8642: - [~trickarcher] if you have questions it's better to use the

[jira] [Commented] (ARROW-8641) [Python] Regression in feather: no longer supports permutation in column selection

2020-04-30 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8641?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17096566#comment-17096566 ] Wes McKinney commented on ARROW-8641: - Too bad this was not tested > [Python] Regression in feather:

[jira] [Updated] (ARROW-8641) [Python] Regression in feather: no longer supports permutation in column selection

2020-04-30 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8641?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8641: Fix Version/s: 1.0.0 > [Python] Regression in feather: no longer supports permutation in column >

[jira] [Closed] (ARROW-8635) [R] test-filesystem.R takes ~40 seconds to run?

2020-04-29 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8635?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney closed ARROW-8635. --- Fix Version/s: (was: 1.0.0) Resolution: Workaround Cool thanks. I'm setting {code}

[jira] [Created] (ARROW-8635) [R] test-filesystem.R takes ~40 seconds to run?

2020-04-29 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-8635: --- Summary: [R] test-filesystem.R takes ~40 seconds to run? Key: ARROW-8635 URL: https://issues.apache.org/jira/browse/ARROW-8635 Project: Apache Arrow Issue

[jira] [Commented] (ARROW-7893) [Developer][GLib] Document GLib development workflow when using conda environment on GTK-based Linux systems

2020-04-29 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7893?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17095966#comment-17095966 ] Wes McKinney commented on ARROW-7893: - I found out the solution to this. The problem occurs when

[jira] [Commented] (ARROW-8633) [C++] Add ValidateAscii function

2020-04-29 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8633?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17095920#comment-17095920 ] Wes McKinney commented on ARROW-8633: - Testing (ignore) > [C++] Add ValidateAscii function >

[jira] [Issue Comment Deleted] (ARROW-8633) [C++] Add ValidateAscii function

2020-04-29 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8633?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8633: Comment: was deleted (was: Testing (ignore)) > [C++] Add ValidateAscii function >

[jira] [Created] (ARROW-8633) [C++] Add ValidateAscii function

2020-04-29 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-8633: --- Summary: [C++] Add ValidateAscii function Key: ARROW-8633 URL: https://issues.apache.org/jira/browse/ARROW-8633 Project: Apache Arrow Issue Type: New Feature

[jira] [Comment Edited] (ARROW-8626) [C++] Implement "round robin" scheduler interface to fixed-size ThreadPool

2020-04-29 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8626?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17095585#comment-17095585 ] Wes McKinney edited comment on ARROW-8626 at 4/29/20, 4:08 PM: --- This work

[jira] [Commented] (ARROW-8626) [C++] Implement "round robin" scheduler interface to fixed-size ThreadPool

2020-04-29 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8626?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17095585#comment-17095585 ] Wes McKinney commented on ARROW-8626: - This work should also be able to give way to other kinds of

[jira] [Created] (ARROW-8626) [C++] Implement "round robin" scheduler interface to fixed-size ThreadPool

2020-04-29 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-8626: --- Summary: [C++] Implement "round robin" scheduler interface to fixed-size ThreadPool Key: ARROW-8626 URL: https://issues.apache.org/jira/browse/ARROW-8626 Project:

[jira] [Commented] (ARROW-8625) Minimum working example of `UnionArray.from_buffers()` method

2020-04-29 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8625?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=1709#comment-1709 ] Wes McKinney commented on ARROW-8625: - Could you clarify what you're looking for? This is documented

[jira] [Closed] (ARROW-8624) [Packaging] Linux system packages aren't building with ARROW_DATASET=ON

2020-04-29 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8624?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney closed ARROW-8624. --- Resolution: Not A Problem ARROW_DATASET is enabled by ARROW_PYTHON=ON. The dataset libraries are

[jira] [Created] (ARROW-8623) [C++][Gandiva] Reduce use of Boost, remove Boost headers from header files

2020-04-29 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-8623: --- Summary: [C++][Gandiva] Reduce use of Boost, remove Boost headers from header files Key: ARROW-8623 URL: https://issues.apache.org/jira/browse/ARROW-8623 Project:

[jira] [Closed] (ARROW-8620) arrow header compiler error using nvcc

2020-04-29 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8620?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney closed ARROW-8620. --- Resolution: Duplicate Dup of ARROW-8608 > arrow header compiler error using nvcc >

[jira] [Updated] (ARROW-8619) [C++] Use distinct Type::type values for interval types

2020-04-28 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8619?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8619: Description: This is a breaking API change, but {{MonthIntervalType}} and {{DayTimeIntervalType}}

[jira] [Assigned] (ARROW-8619) [C++] Use distinct Type::type values for interval types

2020-04-28 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8619?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-8619: --- Assignee: Wes McKinney > [C++] Use distinct Type::type values for interval types >

[jira] [Created] (ARROW-8619) [C++] Use distinct Type::type values for interval types

2020-04-28 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-8619: --- Summary: [C++] Use distinct Type::type values for interval types Key: ARROW-8619 URL: https://issues.apache.org/jira/browse/ARROW-8619 Project: Apache Arrow

[jira] [Updated] (ARROW-8617) [Rust] simd_load_set_invalid does not exist on aarch64

2020-04-28 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8617: Summary: [Rust] simd_load_set_invalid does not exist on aarch64 (was: simd_load_set_invalid does

[jira] [Updated] (ARROW-8475) [CI][Crossbow] Rehabilitate (or delete) hiveserver2 nightly job

2020-04-28 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8475?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8475: Component/s: C++ > [CI][Crossbow] Rehabilitate (or delete) hiveserver2 nightly job >

[jira] [Updated] (ARROW-8592) [C++] Docs still list LLVM 7 as compiler used

2020-04-28 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8592?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8592: Fix Version/s: 1.0.0 > [C++] Docs still list LLVM 7 as compiler used >

[jira] [Commented] (ARROW-8614) [Website] Create Rust-specific 0.17.0 blog post

2020-04-28 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8614?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17094621#comment-17094621 ] Wes McKinney commented on ARROW-8614: - I agree that having Rust blog posts on arrow.apache.org would

[jira] [Updated] (ARROW-8615) [R] Incorrect error message when passing file without random access to read_feather

2020-04-28 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8615?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8615: Summary: [R] Incorrect error message when passing file without random access to read_feather

[jira] [Commented] (ARROW-8615) [R] read_feather with CompressedInputStream fail

2020-04-28 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8615?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17094618#comment-17094618 ] Wes McKinney commented on ARROW-8615: - This isn't supported because the Feather format requires

[jira] [Commented] (ARROW-7605) [C++] Merge jemalloc and other BUNDLED dependencies into libarrow.a

2020-04-28 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7605?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17094537#comment-17094537 ] Wes McKinney commented on ARROW-7605: - I need someone to pick up the PR and help, I have too little

[jira] [Commented] (ARROW-8611) [R] Can't install arrow 0.17 on Ubuntu 18.04 R 3.6.3

2020-04-28 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8611?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17094528#comment-17094528 ] Wes McKinney commented on ARROW-8611: - Can you describe the steps you took to arrive at the above

[jira] [Updated] (ARROW-8611) [R] Can't install arrow 0.17 on Ubuntu 18.04 R 3.6.3

2020-04-28 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8611: Summary: [R] Can't install arrow 0.17 on Ubuntu 18.04 R 3.6.3 (was: Can't install arrow 0.17 on

[jira] [Resolved] (ARROW-8606) [CI] Don't trigger all builds on a change to any file in ci/

2020-04-27 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8606?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-8606. - Fix Version/s: 1.0.0 Resolution: Fixed Issue resolved by pull request 7046

[jira] [Updated] (ARROW-8157) [C++] Upgrade to LLVM 9

2020-04-26 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8157?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8157: Fix Version/s: 1.0.0 > [C++] Upgrade to LLVM 9 > --- > > Key:

[jira] [Updated] (ARROW-8116) [C++] Create CMake utility to streamline creating ADD_$COMPONENT_TEST helper functions

2020-04-26 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8116?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8116: Fix Version/s: 1.0.0 > [C++] Create CMake utility to streamline creating ADD_$COMPONENT_TEST

[jira] [Updated] (ARROW-8152) [C++] IO: split large coalesced reads into smaller ones

2020-04-26 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8152?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8152: Fix Version/s: 1.0.0 > [C++] IO: split large coalesced reads into smaller ones >

[jira] [Updated] (ARROW-8078) [Python] Missing links in the docs regarding field and schema DataTypes

2020-04-26 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8078?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8078: Fix Version/s: 1.0.0 > [Python] Missing links in the docs regarding field and schema DataTypes >

[jira] [Resolved] (ARROW-8069) [C++] Should the default value of "check_metadata" arguments of Equals methods be "true"?

2020-04-26 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8069?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-8069. - Fix Version/s: 0.17.0 Assignee: Wes McKinney Resolution: Fixed The default was

[jira] [Updated] (ARROW-8062) [C++][Dataset] Parquet Dataset factory from a _metadata/_common_metadata file

2020-04-26 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8062?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8062: Fix Version/s: 1.0.0 > [C++][Dataset] Parquet Dataset factory from a _metadata/_common_metadata

[jira] [Updated] (ARROW-8050) [Python][Packaging] Do not include generated Cython source files in wheel packages

2020-04-26 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8050?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8050: Fix Version/s: 1.0.0 > [Python][Packaging] Do not include generated Cython source files in wheel

[jira] [Closed] (ARROW-8041) [C++] protobuf_ep fails to build on Raspbian due to linking issues relating to atomics

2020-04-26 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8041?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney closed ARROW-8041. --- Resolution: Cannot Reproduce I'm unable to reproduce after resolving ARROW-7968 > [C++] protobuf_ep

[jira] [Updated] (ARROW-8025) [C++] Implement cast to Binary and FixedSizeBinary

2020-04-26 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8025?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8025: Fix Version/s: 1.0.0 > [C++] Implement cast to Binary and FixedSizeBinary >

[jira] [Updated] (ARROW-8023) [Website] Write a blog post about the C data interface

2020-04-26 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8023?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8023: Fix Version/s: 1.0.0 > [Website] Write a blog post about the C data interface >

[jira] [Updated] (ARROW-7976) [C++] Add field to IpcReadOptions to include padding in Buffer metadata accounting

2020-04-26 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7976?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-7976: Summary: [C++] Add field to IpcReadOptions to include padding in Buffer metadata accounting (was:

[jira] [Updated] (ARROW-7964) [C++] Add short representation string to common classes

2020-04-26 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7964?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-7964: Fix Version/s: 1.0.0 > [C++] Add short representation string to common classes >

[jira] [Updated] (ARROW-7957) [Python] ParquetDataset cannot take HadoopFileSystem as filesystem

2020-04-26 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7957?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-7957: Fix Version/s: 1.0.0 > [Python] ParquetDataset cannot take HadoopFileSystem as filesystem >

[jira] [Commented] (ARROW-7871) [Python] Expose more compute kernels

2020-04-26 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7871?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17092958#comment-17092958 ] Wes McKinney commented on ARROW-7871: - I'm working on a plan (as discussed on the mailing list) that

[jira] [Updated] (ARROW-7878) [C++] Implement LogicalPlan and LogicalPlanBuilder

2020-04-26 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7878?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-7878: Fix Version/s: 1.0.0 > [C++] Implement LogicalPlan and LogicalPlanBuilder >

[jira] [Commented] (ARROW-7873) [Python] Segfault in pandas version 1.0.1, read_parquet after creating a clickhouse odbc connection

2020-04-26 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17092959#comment-17092959 ] Wes McKinney commented on ARROW-7873: - [~mvcalder] were you able to resolve this? > [Python]

[jira] [Updated] (ARROW-7867) [Python] ArrowIOError: Invalid Parquet file size is 0 bytes on reading from S3

2020-04-26 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7867?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-7867: Labels: dataset-parquet-read (was: ) > [Python] ArrowIOError: Invalid Parquet file size is 0

[jira] [Updated] (ARROW-7867) [Python] ArrowIOError: Invalid Parquet file size is 0 bytes on reading from S3

2020-04-26 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7867?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-7867: Fix Version/s: 1.0.0 > [Python] ArrowIOError: Invalid Parquet file size is 0 bytes on reading from

[jira] [Commented] (ARROW-7867) [Python] ArrowIOError: Invalid Parquet file size is 0 bytes on reading from S3

2020-04-26 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17092957#comment-17092957 ] Wes McKinney commented on ARROW-7867: - cc [~jorisvandenbossche] > [Python] ArrowIOError: Invalid

[jira] [Assigned] (ARROW-7871) [Python] Expose more compute kernels

2020-04-26 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7871?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-7871: --- Assignee: Wes McKinney > [Python] Expose more compute kernels >

<    4   5   6   7   8   9   10   11   12   13   >