[jira] [Commented] (ARROW-8772) [C++] Expand SumKernel benchmark to more types

2020-05-12 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17105432#comment-17105432 ] Wes McKinney commented on ARROW-8772: - Please be cautious about patches to src/arrow/compute if you

[jira] [Created] (ARROW-8769) [C++] Add convenience methods to access fields by name in StructScalar

2020-05-11 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-8769: --- Summary: [C++] Add convenience methods to access fields by name in StructScalar Key: ARROW-8769 URL: https://issues.apache.org/jira/browse/ARROW-8769 Project: Apache

[jira] [Assigned] (ARROW-8750) [Python] pyarrow.feather.write_feather does not default to lz4 compression if it's available

2020-05-11 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8750?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-8750: --- Assignee: Wes McKinney > [Python] pyarrow.feather.write_feather does not default to lz4

[jira] [Commented] (ARROW-555) [C++] String algorithm library for StringArray/BinaryArray

2020-05-11 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-555?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17104685#comment-17104685 ] Wes McKinney commented on ARROW-555: Having ASCII versions of functions sounds fine to me. There is a

[jira] [Commented] (ARROW-8765) [C++] Design Scheduler API

2020-05-11 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8765?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17104625#comment-17104625 ] Wes McKinney commented on ARROW-8765: - Possibly a dup of ARROW-8667 > [C++] Design Scheduler API >

[jira] [Commented] (ARROW-8767) [C++] Make ThreadPool task ordering configurable

2020-05-11 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8767?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17104623#comment-17104623 ] Wes McKinney commented on ARROW-8767: - I linked a couple of related issues. Whatever solution is

[jira] [Commented] (ARROW-555) [C++] String algorithm library for StringArray/BinaryArray

2020-05-11 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-555?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17104530#comment-17104530 ] Wes McKinney commented on ARROW-555: Cool. I will circle back here once I have a PR up for the work I

[jira] [Created] (ARROW-8762) [C++][Gandiva] Replace Gandiva's BitmapAnd with common implementation

2020-05-11 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-8762: --- Summary: [C++][Gandiva] Replace Gandiva's BitmapAnd with common implementation Key: ARROW-8762 URL: https://issues.apache.org/jira/browse/ARROW-8762 Project: Apache

[jira] [Commented] (ARROW-555) [C++] String algorithm library for StringArray/BinaryArray

2020-05-11 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-555?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17104443#comment-17104443 ] Wes McKinney commented on ARROW-555: Update: I'm in the middle of an overhaul of the API for

[jira] [Commented] (ARROW-7905) [Go][Parquet] Port the C++ Parquet implementation to Go

2020-05-11 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7905?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17104426#comment-17104426 ] Wes McKinney commented on ARROW-7905: - We can discuss more on the mailing list, but keep in mind that

[jira] [Commented] (ARROW-7905) [Go][Parquet] Port the C++ Parquet implementation to Go

2020-05-10 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7905?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17103976#comment-17103976 ] Wes McKinney commented on ARROW-7905: - [~emkornfi...@gmail.com] I'm curious what you mean about #1.

[jira] [Updated] (ARROW-8752) [Rust] Remove unused hashmap

2020-05-09 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8752?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8752: Summary: [Rust] Remove unused hashmap (was: Remove unused hashmap ) > [Rust] Remove unused

[jira] [Updated] (ARROW-8752) [Rust] Remove unused hashmap

2020-05-09 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8752?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8752: Component/s: Rust > [Rust] Remove unused hashmap > - > >

[jira] [Created] (ARROW-8750) [Python] pyarrow.feather.write_feather does not default to lz4 compression if it's available

2020-05-09 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-8750: --- Summary: [Python] pyarrow.feather.write_feather does not default to lz4 compression if it's available Key: ARROW-8750 URL: https://issues.apache.org/jira/browse/ARROW-8750

[jira] [Updated] (ARROW-8749) [C++] IpcFormatWriter writes dictionary batches with wrong ID

2020-05-09 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8749?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8749: Fix Version/s: (was: 0.17.1) > [C++] IpcFormatWriter writes dictionary batches with wrong ID >

[jira] [Commented] (ARROW-8749) [C++] IpcFormatWriter writes dictionary batches with wrong ID

2020-05-09 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8749?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17103484#comment-17103484 ] Wes McKinney commented on ARROW-8749: - I'd guess the bug has been present for a while, then > [C++]

[jira] [Resolved] (ARROW-8747) [C++] Feather tests with compression cause failure on big-endian platforms

2020-05-09 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8747?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-8747. - Fix Version/s: 1.0.0 Resolution: Fixed Issue resolved by pull request 7137

[jira] [Commented] (ARROW-8749) [C++] IpcFormatWriter writes dictionary batches with wrong ID

2020-05-09 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8749?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17103475#comment-17103475 ] Wes McKinney commented on ARROW-8749: - Definitely not good. This may be an artifact of the fact that

[jira] [Updated] (ARROW-8749) [C++] IpcFormatWriter writes dictionary batches with wrong ID

2020-05-09 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8749?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8749: Fix Version/s: 0.17.1 1.0.0 > [C++] IpcFormatWriter writes dictionary batches

[jira] [Commented] (ARROW-8746) [Python][Documentation] Add column limit recommendations Parquet page

2020-05-09 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8746?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17103470#comment-17103470 ] Wes McKinney commented on ARROW-8746: - I appreciate the interest but perhaps let's discuss on the

[jira] [Created] (ARROW-8746) [Python][Documentation] Add column limit recommendations Parquet page

2020-05-09 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-8746: --- Summary: [Python][Documentation] Add column limit recommendations Parquet page Key: ARROW-8746 URL: https://issues.apache.org/jira/browse/ARROW-8746 Project: Apache

[jira] [Commented] (ARROW-8654) [Python] pyarrow 0.17.0 fails reading "wide" parquet files

2020-05-09 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8654?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17103376#comment-17103376 ] Wes McKinney commented on ARROW-8654: - I agree that adding documentation recommending against very

[jira] [Closed] (ARROW-8654) [Python] pyarrow 0.17.0 fails reading "wide" parquet files

2020-05-09 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8654?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney closed ARROW-8654. --- Resolution: Duplicate This is a dup of ARROW-8694, which has been fixed. There is discussion on the

[jira] [Resolved] (ARROW-5875) [FlightRPC] Test RPC features in integration tests

2020-05-08 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5875?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-5875. - Resolution: Fixed Issue resolved by pull request 6617

[jira] [Assigned] (ARROW-5875) [FlightRPC] Test RPC features in integration tests

2020-05-08 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5875?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-5875: --- Assignee: David Li > [FlightRPC] Test RPC features in integration tests >

[jira] [Updated] (ARROW-5875) [FlightRPC] Test RPC features in integration tests

2020-05-08 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5875?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-5875: Fix Version/s: (was: 2.0.0) 1.0.0 > [FlightRPC] Test RPC features in

[jira] [Commented] (ARROW-8739) [Java] Standardise Logger naming

2020-05-08 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17102663#comment-17102663 ] Wes McKinney commented on ARROW-8739: - If you would please add "[$COMPONENT]" to the issue title >

[jira] [Updated] (ARROW-8738) [Java] Investigate adding a getUnsafe method to vectors

2020-05-08 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8738?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8738: Summary: [Java] Investigate adding a getUnsafe method to vectors (was: Investigate adding a

[jira] [Updated] (ARROW-8739) [Java] Standardise Logger naming

2020-05-08 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8739?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8739: Summary: [Java] Standardise Logger naming (was: Standardise Logger naming) > [Java] Standardise

[jira] [Updated] (ARROW-8636) [C++][Plasma] plasma client delete (of objectid) causes an exception and abort

2020-05-08 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8636?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8636: Summary: [C++][Plasma] plasma client delete (of objectid) causes an exception and abort (was:

[jira] [Issue Comment Deleted] (ARROW-8734) [R] Compilation error on macOS

2020-05-07 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8734?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8734: Comment: was deleted (was: The development version of the R package relies on the development

[jira] [Commented] (ARROW-8734) [R] Compilation error on macOS

2020-05-07 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8734?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17101954#comment-17101954 ] Wes McKinney commented on ARROW-8734: - The development version of the R package relies on the

[jira] [Updated] (ARROW-8706) [C++][Parquet] Tracking JIRA for PARQUET-1857 (unencrypted INT16_MAX Parquet row group limit)

2020-05-07 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8706?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8706: Description: JIRA tracking PARQUET-1857 to make sure this patch gets included in a patch release

[jira] [Resolved] (ARROW-8706) [C++][Parquet] Tracking JIRA for PARQUET-1857 (unencrypted INT16_MAX Parquet row group limit)

2020-05-07 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8706?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-8706. - Assignee: Wes McKinney Resolution: Fixed This was merged > [C++][Parquet] Tracking JIRA

[jira] [Updated] (ARROW-8726) [R][Dataset] segfault with a mis-specified partition

2020-05-07 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8726?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8726: Fix Version/s: 1.0.0 > [R][Dataset] segfault with a mis-specified partition >

[jira] [Updated] (ARROW-8704) [C++] Fix Parquet crash on invalid input (OSS-Fuzz)

2020-05-07 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8704?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8704: Fix Version/s: 0.17.1 > [C++] Fix Parquet crash on invalid input (OSS-Fuzz) >

[jira] [Updated] (ARROW-8728) [C++] Bitmap operation may cause buffer overflow

2020-05-07 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8728?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8728: Fix Version/s: 0.17.1 > [C++] Bitmap operation may cause buffer overflow >

[jira] [Resolved] (ARROW-8641) [Python] Regression in feather: no longer supports permutation in column selection

2020-05-07 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8641?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-8641. - Resolution: Fixed Issue resolved by pull request 7122

[jira] [Commented] (ARROW-8731) Error when using toPandas with PyArrow

2020-05-07 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8731?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17101823#comment-17101823 ] Wes McKinney commented on ARROW-8731: - cc [~bryanc] -- I think you need to set an environment

[jira] [Updated] (ARROW-8725) [Rust] redundant directory walk in rust parquet datasource code

2020-05-07 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8725?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8725: Fix Version/s: (was: 0.18.0) 1.0.0 > [Rust] redundant directory walk in

[jira] [Updated] (ARROW-8710) [Rust] Continuation marker not written correctly in IPC writer, and stream not flushed

2020-05-07 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8710?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8710: Fix Version/s: (was: 0.18.0) 1.0.0 > [Rust] Continuation marker not written

[jira] [Updated] (ARROW-8730) [Rust] Use slice instead of for function arguments

2020-05-07 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8730?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8730: Fix Version/s: (was: 0.18.0) 1.0.0 > [Rust] Use slice instead of for

[jira] [Updated] (ARROW-8728) [C++] Bitmap operation may cause buffer overflow

2020-05-07 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8728?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8728: Fix Version/s: (was: 0.18.0) 1.0.0 > [C++] Bitmap operation may cause

[jira] [Created] (ARROW-8727) [C++] Do not require struct-initialization of StringConverter to parse strings to other types

2020-05-06 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-8727: --- Summary: [C++] Do not require struct-initialization of StringConverter to parse strings to other types Key: ARROW-8727 URL: https://issues.apache.org/jira/browse/ARROW-8727

[jira] [Updated] (ARROW-8726) [R][Dataset] segfault with a mis-specified partition

2020-05-06 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8726?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8726: Summary: [R][Dataset] segfault with a mis-specified partition (was: segfault with a mis-specified

[jira] [Updated] (ARROW-8725) redundant directory walk in rust parquet datasource code

2020-05-06 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8725?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8725: Component/s: Rust - DataFusion Rust > redundant directory walk in rust parquet

[jira] [Updated] (ARROW-8725) [Rust] redundant directory walk in rust parquet datasource code

2020-05-06 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8725?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8725: Summary: [Rust] redundant directory walk in rust parquet datasource code (was: redundant

[jira] [Closed] (ARROW-8719) Stream data easily- function to convert record batch to streamable format

2020-05-06 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8719?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney closed ARROW-8719. --- Resolution: Information Provided > Stream data easily- function to convert record batch to

[jira] [Commented] (ARROW-8719) Stream data easily- function to convert record batch to streamable format

2020-05-06 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8719?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17101184#comment-17101184 ] Wes McKinney commented on ARROW-8719: - I'm inclined to close this issue because we don't do user

[jira] [Commented] (ARROW-8719) Stream data easily- function to convert record batch to streamable format

2020-05-06 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8719?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17101183#comment-17101183 ] Wes McKinney commented on ARROW-8719: - Just looking at the client code {code} int sock;

[jira] [Commented] (ARROW-7076) `pip install pyarrow` with python 3.8 fail with message : Could not build wheels for pyarrow which use PEP 517 and cannot be installed directly

2020-05-06 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7076?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17100863#comment-17100863 ] Wes McKinney commented on ARROW-7076: - [~ManthanAdmane] the issue you're describing is beyond the

[jira] [Updated] (ARROW-8709) [C++] Implement Array to JSON function

2020-05-06 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8709?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8709: Summary: [C++] Implement Array to JSON function (was: [C++] ArrayToJSON) > [C++] Implement Array

[jira] [Comment Edited] (ARROW-8709) [C++] ArrayToJSON

2020-05-06 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8709?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17100851#comment-17100851 ] Wes McKinney edited comment on ARROW-8709 at 5/6/20, 2:36 PM: -- [~abemammen]

[jira] [Commented] (ARROW-8709) [C++] ArrayToJSON

2020-05-06 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8709?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17100851#comment-17100851 ] Wes McKinney commented on ARROW-8709: - [~abemammen] generally speaking, in the C++ library we build

[jira] [Commented] (ARROW-8641) [Python] Regression in feather: no longer supports permutation in column selection

2020-05-06 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8641?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17100848#comment-17100848 ] Wes McKinney commented on ARROW-8641: - Yes, indeed sounds like we need to sort the "included_fields"

[jira] [Commented] (ARROW-8641) [Python] Regression in feather: no longer supports permutation in column selection

2020-05-06 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8641?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17100760#comment-17100760 ] Wes McKinney commented on ARROW-8641: - Probably easiest to handle this on the Python side. Can we add

[jira] [Created] (ARROW-8711) [Python] Expose strptime timestamp parsing in read_csv conversion options

2020-05-05 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-8711: --- Summary: [Python] Expose strptime timestamp parsing in read_csv conversion options Key: ARROW-8711 URL: https://issues.apache.org/jira/browse/ARROW-8711 Project:

[jira] [Created] (ARROW-8712) [R] Expose strptime timestamp parsing in read_csv conversion options

2020-05-05 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-8712: --- Summary: [R] Expose strptime timestamp parsing in read_csv conversion options Key: ARROW-8712 URL: https://issues.apache.org/jira/browse/ARROW-8712 Project: Apache

[jira] [Commented] (ARROW-2034) [C++] Filesystem implementation for Azure Blob Store

2020-05-05 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-2034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17100361#comment-17100361 ] Wes McKinney commented on ARROW-2034: - I see that TileDB (MIT license) has built a C++ wrapper for

[jira] [Commented] (ARROW-1231) [C++] Add filesystem / IO implementation for Google Cloud Storage

2020-05-05 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17100358#comment-17100358 ] Wes McKinney commented on ARROW-1231: - I see that TileDB (MIT license) has built a wrapper for GCS

[jira] [Resolved] (ARROW-8590) [Rust] Use Arrow pretty print utility in DataFusion

2020-05-05 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8590?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-8590. - Fix Version/s: 1.0.0 Resolution: Fixed Issue resolved by pull request 7035

[jira] [Assigned] (ARROW-8590) [Rust] Use Arrow pretty print utility in DataFusion

2020-05-05 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8590?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-8590: --- Assignee: Mark Hildreth > [Rust] Use Arrow pretty print utility in DataFusion >

[jira] [Assigned] (ARROW-8684) [Python] "SystemError: Bad call flags in _PyMethodDef_RawFastCallDict" in Python 3.7.7 on macOS when using pyarrow wheel

2020-05-05 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8684?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-8684: --- Assignee: Wes McKinney > [Python] "SystemError: Bad call flags in

[jira] [Commented] (ARROW-1614) [C++] Add a Tensor logical value type implemented using ExtensionType

2020-05-05 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1614?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17100306#comment-17100306 ] Wes McKinney commented on ARROW-1614: - Sounds like two different extension types. Do you want to

[jira] [Updated] (ARROW-8709) [C++] ArrayToJSON

2020-05-05 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8709?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8709: Summary: [C++] ArrayToJSON (was: ArrayToJSON) > [C++] ArrayToJSON > - > >

[jira] [Resolved] (ARROW-8657) [Python][C++][Parquet] Forward compatibility issue from 0.16 to 0.17 when using version='2.0'

2020-05-05 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-8657. - Resolution: Fixed Issue resolved by pull request 7089

[jira] [Updated] (ARROW-8608) [C++] Update vendored mpark/variant.h to latest to fix NVCC compilation issues

2020-05-05 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8608?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8608: Fix Version/s: 0.17.1 > [C++] Update vendored mpark/variant.h to latest to fix NVCC compilation

[jira] [Resolved] (ARROW-7391) [Python] Remove unnecessary classes from the binding layer

2020-05-05 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7391?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-7391. - Resolution: Fixed Issue resolved by pull request 7026

[jira] [Comment Edited] (ARROW-8694) [Python][Parquet] parquet.read_schema() fails when loading wide table created from Pandas DataFrame

2020-05-05 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8694?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17100282#comment-17100282 ] Wes McKinney edited comment on ARROW-8694 at 5/5/20, 10:01 PM: ---

[jira] [Commented] (ARROW-8694) [Python][Parquet] parquet.read_schema() fails when loading wide table created from Pandas DataFrame

2020-05-05 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8694?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17100282#comment-17100282 ] Wes McKinney commented on ARROW-8694: - [~ekisslinger] I don't mean to be argumentative, but where are

[jira] [Resolved] (ARROW-8644) [Python] Dask integration tests failing due to change in not including partition columns

2020-05-05 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8644?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-8644. - Resolution: Fixed Issue resolved by pull request 7096

[jira] [Resolved] (ARROW-8694) [Python][Parquet] parquet.read_schema() fails when loading wide table created from Pandas DataFrame

2020-05-05 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8694?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-8694. - Resolution: Fixed Issue resolved by pull request 7103

[jira] [Created] (ARROW-8706) [C++][Parquet] Tracking JIRA for PARQUET-1857 (unencrypted INT16_MAX Parquet row group limit)

2020-05-05 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-8706: --- Summary: [C++][Parquet] Tracking JIRA for PARQUET-1857 (unencrypted INT16_MAX Parquet row group limit) Key: ARROW-8706 URL: https://issues.apache.org/jira/browse/ARROW-8706

[jira] [Commented] (ARROW-8677) [C++][Parquet] ParquetFileReader unable to read files with more than 32768 row groups

2020-05-05 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8677?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17100029#comment-17100029 ] Wes McKinney commented on ARROW-8677: - I found the problem -- your file contains 40001 row groups but

[jira] [Updated] (ARROW-8677) [C++][Parquet] ParquetFileReader unable to read files with more than 32767 row groups

2020-05-05 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8677?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8677: Summary: [C++][Parquet] ParquetFileReader unable to read files with more than 32767 row groups

[jira] [Updated] (ARROW-8677) [C++][Parquet] ParquetFileReader unable to read files with more than 32768 row groups

2020-05-05 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8677?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8677: Summary: [C++][Parquet] ParquetFileReader unable to read files with more than 32768 row groups

[jira] [Comment Edited] (ARROW-8677) [Rust][Python][Parquet] Parquet write_batch and read from Python failes with batch size 10000 or 1 but okay with 1000

2020-05-05 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8677?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17100012#comment-17100012 ] Wes McKinney edited comment on ARROW-8677 at 5/5/20, 3:39 PM: -- I'm looking

[jira] [Commented] (ARROW-8677) [Rust][Python][Parquet] Parquet write_batch and read from Python failes with batch size 10000 or 1 but okay with 1000

2020-05-05 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8677?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17100012#comment-17100012 ] Wes McKinney commented on ARROW-8677: - I'm looking at the example file > [Rust][Python][Parquet]

[jira] [Assigned] (ARROW-8677) [Rust][Python][Parquet] Parquet write_batch and read from Python failes with batch size 10000 or 1 but okay with 1000

2020-05-05 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8677?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-8677: --- Assignee: Wes McKinney > [Rust][Python][Parquet] Parquet write_batch and read from Python

[jira] [Closed] (ARROW-8700) [C++] static libgflags.a fails to link properly in gcc 4.x

2020-05-04 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8700?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney closed ARROW-8700. --- Resolution: Cannot Reproduce This turned out to be a problem with my development environment >

[jira] [Commented] (ARROW-8700) [C++] static libgflags.a fails to link properly in gcc 4.x

2020-05-04 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8700?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17099435#comment-17099435 ] Wes McKinney commented on ARROW-8700: - Seems possibly related to

[jira] [Created] (ARROW-8700) [C++] static libgflags.a fails to link properly in gcc 4.x

2020-05-04 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-8700: --- Summary: [C++] static libgflags.a fails to link properly in gcc 4.x Key: ARROW-8700 URL: https://issues.apache.org/jira/browse/ARROW-8700 Project: Apache Arrow

[jira] [Resolved] (ARROW-5666) [Python] Underscores in partition (string) values are dropped when reading dataset

2020-05-04 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5666?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-5666. - Resolution: Fixed Test added in

[jira] [Assigned] (ARROW-5666) [Python] Underscores in partition (string) values are dropped when reading dataset

2020-05-04 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5666?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-5666: --- Assignee: Joris Van den Bossche > [Python] Underscores in partition (string) values are

[jira] [Updated] (ARROW-5666) [Python] Underscores in partition (string) values are dropped when reading dataset

2020-05-04 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5666?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-5666: Fix Version/s: 1.0.0 > [Python] Underscores in partition (string) values are dropped when reading

[jira] [Resolved] (ARROW-5310) [Python] better error message on creating ParquetDataset from empty directory

2020-05-04 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5310?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-5310. - Resolution: Fixed Resolved in

[jira] [Assigned] (ARROW-5310) [Python] better error message on creating ParquetDataset from empty directory

2020-05-04 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5310?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-5310: --- Assignee: Joris Van den Bossche > [Python] better error message on creating ParquetDataset

[jira] [Updated] (ARROW-5310) [Python] better error message on creating ParquetDataset from empty directory

2020-05-04 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5310?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-5310: Fix Version/s: 1.0.0 > [Python] better error message on creating ParquetDataset from empty

[jira] [Assigned] (ARROW-5572) [Python] raise error message when passing invalid filter in parquet reading

2020-05-04 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5572?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-5572: --- Assignee: Joris Van den Bossche > [Python] raise error message when passing invalid filter

[jira] [Resolved] (ARROW-5572) [Python] raise error message when passing invalid filter in parquet reading

2020-05-04 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5572?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-5572. - Fix Version/s: 1.0.0 Resolution: Fixed Issue resolved by pull request 7052

[jira] [Comment Edited] (ARROW-8694) [Python][Parquet] parquet.read_schema() fails when loading wide table created from Pandas DataFrame

2020-05-04 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8694?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17099408#comment-17099408 ] Wes McKinney edited comment on ARROW-8694 at 5/4/20, 11:02 PM: --- This was

[jira] [Updated] (ARROW-8694) [Python][Parquet] parquet.read_schema() fails when loading wide table created from Pandas DataFrame

2020-05-04 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8694?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8694: Fix Version/s: 0.17.1 > [Python][Parquet] parquet.read_schema() fails when loading wide table

[jira] [Commented] (ARROW-8694) [Python][Parquet] parquet.read_schema() fails when loading wide table created from Pandas DataFrame

2020-05-04 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8694?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17099408#comment-17099408 ] Wes McKinney commented on ARROW-8694: - This was introduced by

[jira] [Assigned] (ARROW-8694) [Python][Parquet] parquet.read_schema() fails when loading wide table created from Pandas DataFrame

2020-05-04 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8694?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-8694: --- Assignee: Wes McKinney > [Python][Parquet] parquet.read_schema() fails when loading wide

[jira] [Updated] (ARROW-8694) [Python][Parquet] parquet.read_schema() fails when loading wide table created from Pandas DataFrame

2020-05-04 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8694?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8694: Fix Version/s: 1.0.0 > [Python][Parquet] parquet.read_schema() fails when loading wide table

[jira] [Updated] (ARROW-8694) [Python][Parquet] parquet.read_schema() fails when loading wide table created from Pandas DataFrame

2020-05-04 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8694?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8694: Fix Version/s: (was: 0.17.1) > [Python][Parquet] parquet.read_schema() fails when loading wide

[jira] [Updated] (ARROW-8694) [Python][Parquet] parquet.read_schema() fails when loading wide table created from Pandas DataFrame

2020-05-04 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8694?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8694: Summary: [Python][Parquet] parquet.read_schema() fails when loading wide table created from Pandas

[jira] [Closed] (ARROW-8685) [Python] ImportError with NumPy<1.16.

2020-05-04 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8685?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney closed ARROW-8685. --- > [Python] ImportError with NumPy<1.16. > - > > Key:

[jira] [Commented] (ARROW-8684) [Python] "SystemError: Bad call flags in _PyMethodDef_RawFastCallDict" in Python 3.7.7 on macOS when using pyarrow wheel

2020-05-04 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8684?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17099224#comment-17099224 ] Wes McKinney commented on ARROW-8684: - [~apitrou] [~kszucs] another problem that this exposes is that

[jira] [Updated] (ARROW-8677) [Rust][Python][Parquet] Parquet write_batch and read from Python failes with batch size 10000 or 1 but okay with 1000

2020-05-04 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8677?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8677: Summary: [Rust][Python][Parquet] Parquet write_batch and read from Python failes with batch size

<    3   4   5   6   7   8   9   10   11   12   >