[jira] [Created] (ARROW-7059) Reading parquet file with many columns is still slow for 0.15.1

2019-11-04 Thread Eric Kisslinger (Jira)
Eric Kisslinger created ARROW-7059: -- Summary: Reading parquet file with many columns is still slow for 0.15.1 Key: ARROW-7059 URL: https://issues.apache.org/jira/browse/ARROW-7059 Project: Apache

[jira] [Commented] (ARROW-7059) [Python] Reading parquet file with many columns is much slower in 0.15.x versus 0.14.x

2019-11-05 Thread Eric Kisslinger (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7059?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16967688#comment-16967688 ] Eric Kisslinger commented on ARROW-7059: I found this work around which might help in

[jira] [Commented] (ARROW-7059) [Python] Reading parquet file with many columns is much slower in 0.15.x versus 0.14.x

2019-11-06 Thread Eric Kisslinger (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7059?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16968718#comment-16968718 ] Eric Kisslinger commented on ARROW-7059: np. For completeness I also noticed what may be a

[jira] [Updated] (ARROW-7059) [Python] Reading parquet file with many columns is much slower in 0.15.x versus 0.14.x

2019-11-06 Thread Eric Kisslinger (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eric Kisslinger updated ARROW-7059: --- Attachment: image-2019-11-06-13-16-05-102.png > [Python] Reading parquet file with many

[jira] [Commented] (ARROW-7059) [Python] Reading parquet file with many columns is much slower in 0.15.x versus 0.14.x

2019-11-06 Thread Eric Kisslinger (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7059?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16968488#comment-16968488 ] Eric Kisslinger commented on ARROW-7059: Thanks for the suggestion. I was unfamiliar with perf.

[jira] [Updated] (ARROW-7059) [Python] Reading parquet file with many columns is much slower in 0.15.x versus 0.14.x

2019-11-06 Thread Eric Kisslinger (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eric Kisslinger updated ARROW-7059: --- Attachment: image-2019-11-06-08-18-42-783.png > [Python] Reading parquet file with many

[jira] [Updated] (ARROW-7059) [Python] Reading parquet file with many columns is much slower in 0.15.x versus 0.14.x

2019-11-06 Thread Eric Kisslinger (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eric Kisslinger updated ARROW-7059: --- Attachment: image-2019-11-06-08-23-18-897.png > [Python] Reading parquet file with many

[jira] [Updated] (ARROW-7059) [Python] Reading parquet file with many columns is much slower in 0.15.x versus 0.14.x

2019-11-06 Thread Eric Kisslinger (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eric Kisslinger updated ARROW-7059: --- Attachment: image-2019-11-06-08-25-05-885.png > [Python] Reading parquet file with many

[jira] [Updated] (ARROW-7059) [Python] Reading parquet file with many columns is much slower in 0.15.x versus 0.14.x

2019-11-06 Thread Eric Kisslinger (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eric Kisslinger updated ARROW-7059: --- Attachment: image-2019-11-06-08-19-11-662.png > [Python] Reading parquet file with many

[jira] [Commented] (ARROW-7059) [Python] Reading parquet file with many columns is much slower in 0.15.x versus 0.14.x

2019-11-06 Thread Eric Kisslinger (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7059?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16968557#comment-16968557 ] Eric Kisslinger commented on ARROW-7059: InĀ 

[jira] [Updated] (ARROW-7059) [Python] Reading parquet file with many columns is much slower in 0.15.x versus 0.14.x

2019-11-06 Thread Eric Kisslinger (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eric Kisslinger updated ARROW-7059: --- Attachment: image-2019-11-06-09-23-54-372.png > [Python] Reading parquet file with many

[jira] [Created] (ARROW-8694) parquet.read_schema() fails when loading wide table created from Pandas DataFrame

2020-05-04 Thread Eric Kisslinger (Jira)
Eric Kisslinger created ARROW-8694: -- Summary: parquet.read_schema() fails when loading wide table created from Pandas DataFrame Key: ARROW-8694 URL: https://issues.apache.org/jira/browse/ARROW-8694

[jira] [Commented] (ARROW-8694) [Python][Parquet] parquet.read_schema() fails when loading wide table created from Pandas DataFrame

2020-05-05 Thread Eric Kisslinger (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8694?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17099866#comment-17099866 ] Eric Kisslinger commented on ARROW-8694: I can't really disagree with the founders of this very

[jira] [Commented] (ARROW-8694) [Python][Parquet] parquet.read_schema() fails when loading wide table created from Pandas DataFrame

2020-05-06 Thread Eric Kisslinger (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8694?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17100797#comment-17100797 ] Eric Kisslinger commented on ARROW-8694: Thanks for the clarification on what qualifies as