[jira] [Created] (ARROW-1883) [Python] BUG: Table.to_pandas metadata checking fails if columns are not present

2017-12-04 Thread Joris Van den Bossche (JIRA)
Joris Van den Bossche created ARROW-1883: Summary: [Python] BUG: Table.to_pandas metadata checking fails if columns are not present Key: ARROW-1883 URL: https://issues.apache.org/jira/browse/ARROW-1883

[jira] [Updated] (ARROW-1883) [Python] BUG: Table.to_pandas metadata checking fails if columns are not present

2017-12-04 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1883?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-1883: - Description: Found this bug in the example in the pandas documentation (), which

[jira] [Updated] (ARROW-1883) [Python] BUG: Table.to_pandas metadata checking fails if columns are not present

2017-12-04 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1883?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-1883: - Description: Found this bug in the example in the pandas documentation (), which

[jira] [Updated] (ARROW-1883) [Python] BUG: Table.to_pandas metadata checking fails if columns are not present

2017-12-04 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1883?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-1883: - Description: Found this bug in the example in the pandas documentation

[jira] [Updated] (ARROW-1883) [Python] BUG: Table.to_pandas metadata checking fails if columns are not present

2017-12-04 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1883?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-1883: - Description: Found this bug in the example in the pandas documentation (), which

[jira] [Commented] (ARROW-1908) [Python] Construction of arrow table from pandas DataFrame with duplicate column names crashes

2017-12-10 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1908?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16285288#comment-16285288 ] Joris Van den Bossche commented on ARROW-1908: -- There is still the general question that

[jira] [Commented] (ARROW-1994) [Python] Test against Pandas master

2018-02-26 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16376567#comment-16376567 ] Joris Van den Bossche commented on ARROW-1994: -- We have a daily conda build test here:

[jira] [Created] (ARROW-3953) Pandas MultiIndex renamed labels to codes (pd 0.24)

2018-12-07 Thread Joris Van den Bossche (JIRA)
Joris Van den Bossche created ARROW-3953: Summary: Pandas MultiIndex renamed labels to codes (pd 0.24) Key: ARROW-3953 URL: https://issues.apache.org/jira/browse/ARROW-3953 Project: Apache

[jira] [Commented] (ARROW-1983) [Python] Add ability to write parquet `_metadata` file

2019-05-16 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1983?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16841668#comment-16841668 ] Joris Van den Bossche commented on ARROW-1983: -- > We also need to make sure that the file

[jira] [Updated] (ARROW-5359) [Python] timestamp_as_object support for pa.Table.to_pandas in pyarrow

2019-05-17 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5359?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-5359: - Summary: [Python] timestamp_as_object support for pa.Table.to_pandas in pyarrow

[jira] [Commented] (ARROW-3531) [Python] Deprecate Schema.field_by_name in favor of __getitem__

2019-05-27 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-3531?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16848892#comment-16848892 ] Joris Van den Bossche commented on ARROW-3531: -- We may also want to have a {{field()}}

[jira] [Commented] (ARROW-1983) [Python] Add ability to write parquet `_metadata` file

2019-05-27 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1983?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16849187#comment-16849187 ] Joris Van den Bossche commented on ARROW-1983: -- I think so yes (at least when reading, it

[jira] [Comment Edited] (ARROW-1983) [Python] Add ability to write parquet `_metadata` file

2019-05-27 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1983?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16849187#comment-16849187 ] Joris Van den Bossche edited comment on ARROW-1983 at 5/27/19 8:29 PM:

[jira] [Created] (ARROW-5427) [Python] RangeIndex serialization change implications

2019-05-27 Thread Joris Van den Bossche (JIRA)
Joris Van den Bossche created ARROW-5427: Summary: [Python] RangeIndex serialization change implications Key: ARROW-5427 URL: https://issues.apache.org/jira/browse/ARROW-5427 Project: Apache

[jira] [Comment Edited] (ARROW-1983) [Python] Add ability to write parquet `_metadata` file

2019-05-27 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1983?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16849187#comment-16849187 ] Joris Van den Bossche edited comment on ARROW-1983 at 5/27/19 8:33 PM:

[jira] [Commented] (ARROW-1983) [Python] Add ability to write parquet `_metadata` file

2019-05-27 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1983?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16849118#comment-16849118 ] Joris Van den Bossche commented on ARROW-1983: -- {quote}Correspondingly, please also write a

[jira] [Assigned] (ARROW-5169) [Python] non-nullable fields are converted to nullable in {{Table.from_pandas}}

2019-05-27 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5169?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche reassigned ARROW-5169: Assignee: Joris Van den Bossche > [Python] non-nullable fields are

[jira] [Commented] (ARROW-5430) [Python] Can read but not write parquet partitioned on large ints

2019-05-28 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5430?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16849485#comment-16849485 ] Joris Van den Bossche commented on ARROW-5430: -- Actually, I see we had ARROW-2972 about

[jira] [Commented] (ARROW-5430) [Python] Can read but not write parquet partitioned on large ints

2019-05-28 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5430?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16849499#comment-16849499 ] Joris Van den Bossche commented on ARROW-5430: -- Not fully, see my first comment with a

[jira] [Commented] (ARROW-5430) [Python] Can read but not write parquet partitioned on large ints

2019-05-28 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5430?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16849483#comment-16849483 ] Joris Van den Bossche commented on ARROW-5430: -- Thanks for the report! The error is actually

[jira] [Commented] (ARROW-5430) [Python] Can read but not write parquet partitioned on large ints

2019-05-28 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5430?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16849491#comment-16849491 ] Joris Van den Bossche commented on ARROW-5430: -- I agree that ideally we should either fix it

[jira] [Commented] (ARROW-5430) [Python] Can read but not write parquet partitioned on large ints

2019-05-28 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5430?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16849539#comment-16849539 ] Joris Van den Bossche commented on ARROW-5430: -- The keys come from the directory names, so

[jira] [Updated] (ARROW-5430) [Python] Can read but not write parquet partitioned on large ints

2019-05-28 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5430?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-5430: - Labels: parquet (was: ) > [Python] Can read but not write parquet partitioned

[jira] [Created] (ARROW-5514) [C++] Printer for uint64 shows wrong values

2019-06-05 Thread Joris Van den Bossche (JIRA)
Joris Van den Bossche created ARROW-5514: Summary: [C++] Printer for uint64 shows wrong values Key: ARROW-5514 URL: https://issues.apache.org/jira/browse/ARROW-5514 Project: Apache Arrow

[jira] [Comment Edited] (ARROW-2667) [C++/Python] Add pandas-like take method to Array

2019-06-05 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2667?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16856516#comment-16856516 ] Joris Van den Bossche edited comment on ARROW-2667 at 6/5/19 8:55 AM:

[jira] [Commented] (ARROW-5450) [Python] TimestampArray.to_pylist() fails with OverflowError: Python int too large to convert to C long

2019-06-05 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5450?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16856527#comment-16856527 ] Joris Van den Bossche commented on ARROW-5450: -- Thanks for the report! The problem here is

[jira] [Updated] (ARROW-5104) [Python/C++] Schema for empty tables include index column as integer

2019-06-05 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5104?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-5104: - Fix Version/s: 0.14.0 > [Python/C++] Schema for empty tables include index

[jira] [Commented] (ARROW-5138) [Python/C++] Row group retrieval doesn't restore index properly

2019-06-05 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5138?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16856508#comment-16856508 ] Joris Van den Bossche commented on ARROW-5138: -- [~wesmckinn] I don't think that will solve

[jira] [Commented] (ARROW-2667) [C++/Python] Add pandas-like take method to Array

2019-06-05 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2667?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16856516#comment-16856516 ] Joris Van den Bossche commented on ARROW-2667: -- [~wesmckinn] you renamed this issue to only

[jira] [Commented] (ARROW-5430) [Python] Can read but not write parquet partitioned on large ints

2019-05-28 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5430?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16850158#comment-16850158 ] Joris Van den Bossche commented on ARROW-5430: -- Robin: yes, a fix for the error type +

[jira] [Commented] (ARROW-5450) [Python] TimestampArray.to_pylist() fails with OverflowError: Python int too large to convert to C long

2019-06-07 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5450?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16858365#comment-16858365 ] Joris Van den Bossche commented on ARROW-5450: -- Yes, certainly given the time range

[jira] [Resolved] (ARROW-3801) [Python] Pandas-Arrow roundtrip makes pd categorical index not writeable

2019-06-07 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-3801?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche resolved ARROW-3801. -- Resolution: Works for Me I am going to close this issue, as I think it is

[jira] [Commented] (ARROW-3801) [Python] Pandas-Arrow roundtrip makes pd categorical index not writeable

2019-06-07 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-3801?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16858460#comment-16858460 ] Joris Van den Bossche commented on ARROW-3801: -- [~buhrmann] do you know which version of

[jira] [Commented] (ARROW-2298) [Python] Add option to not consider NaN to be null when converting to an integer Arrow type

2019-06-07 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16858547#comment-16858547 ] Joris Van den Bossche commented on ARROW-2298: -- [~farnoy] For me, the example you show above

[jira] [Updated] (ARROW-4350) [Python] nested numpy arrays cannot be converted to ListArray

2019-06-07 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4350?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-4350: - Description: Nested numpy arrays (as the scalar value) cannot be converted to a

[jira] [Assigned] (ARROW-2818) [Python] Better error message when passing SparseDataFrame into Table.from_pandas

2019-06-07 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2818?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche reassigned ARROW-2818: Assignee: Joris Van den Bossche > [Python] Better error message when

[jira] [Commented] (ARROW-4350) [Python] nested numpy arrays cannot be converted to a list-of-list ListArray

2019-06-07 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16858438#comment-16858438 ] Joris Van den Bossche commented on ARROW-4350: -- Updated the title and top post with

[jira] [Commented] (ARROW-3801) [Python] Pandas-Arrow roundtrip makes pd categorical index not writeable

2019-06-07 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-3801?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16858552#comment-16858552 ] Joris Van den Bossche commented on ARROW-3801: -- I am not yet too familiar with the logics

[jira] [Updated] (ARROW-4350) [Python] nested numpy arrays cannot be converted to ListArray

2019-06-07 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4350?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-4350: - Summary: [Python] nested numpy arrays cannot be converted to ListArray (was:

[jira] [Updated] (ARROW-4350) [Python] nested numpy arrays cannot be converted to a list-of-list ListArray

2019-06-07 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4350?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-4350: - Summary: [Python] nested numpy arrays cannot be converted to a list-of-list

[jira] [Updated] (ARROW-4350) [Python] nested numpy arrays

2019-06-07 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4350?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-4350: - Description: Nested numpy arrays cannot be converted to a list-of-list type

[jira] [Commented] (ARROW-5480) [Python] Pandas categorical type doesn't survive a round-trip through parquet

2019-06-05 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5480?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16856967#comment-16856967 ] Joris Van den Bossche commented on ARROW-5480: -- [~wesmckinn] I think this can be closed as

[jira] [Comment Edited] (ARROW-1989) [Python] Better UX on timestamp conversion to Pandas

2019-06-07 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1989?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16858724#comment-16858724 ] Joris Van den Bossche edited comment on ARROW-1989 at 6/7/19 3:00 PM:

[jira] [Commented] (ARROW-1989) [Python] Better UX on timestamp conversion to Pandas

2019-06-07 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1989?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16858736#comment-16858736 ] Joris Van den Bossche commented on ARROW-1989: -- The mention of

[jira] [Commented] (ARROW-2037) [Python]: Add tests for ARROW-1941 cases where pandas inferred type is 'empty'

2019-06-07 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2037?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16858692#comment-16858692 ] Joris Van den Bossche commented on ARROW-2037: -- You get an "empty" inferred type if you have

[jira] [Comment Edited] (ARROW-2037) [Python]: Add tests for ARROW-1941 cases where pandas inferred type is 'empty'

2019-06-07 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2037?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16858692#comment-16858692 ] Joris Van den Bossche edited comment on ARROW-2037 at 6/7/19 2:19 PM:

[jira] [Commented] (ARROW-2037) [Python]: Add tests for ARROW-1941 cases where pandas inferred type is 'empty'

2019-06-07 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2037?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16858694#comment-16858694 ] Joris Van den Bossche commented on ARROW-2037: -- And that case is already tested here:

[jira] [Closed] (ARROW-2037) [Python]: Add tests for ARROW-1941 cases where pandas inferred type is 'empty'

2019-06-07 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2037?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche closed ARROW-2037. Resolution: Invalid > [Python]: Add tests for ARROW-1941 cases where pandas

[jira] [Commented] (ARROW-2037) [Python]: Add tests for ARROW-1941 cases where pandas inferred type is 'empty'

2019-06-07 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2037?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16858688#comment-16858688 ] Joris Van den Bossche commented on ARROW-2037: -- Not fully sure what is left to do here. The

[jira] [Commented] (ARROW-3801) [Python] Pandas-Arrow roundtrip makes pd categorical index not writeable

2019-06-07 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-3801?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16858756#comment-16858756 ] Joris Van den Bossche commented on ARROW-3801: -- In general, or only for this specific case?

[jira] [Updated] (ARROW-2037) [Python]: Add tests for ARROW-1941 cases where pandas inferred type is 'empty'

2019-06-07 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2037?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-2037: - Fix Version/s: (was: 0.14.0) > [Python]: Add tests for ARROW-1941 cases

[jira] [Commented] (ARROW-1989) [Python] Better UX on timestamp conversion to Pandas

2019-06-07 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1989?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16858724#comment-16858724 ] Joris Van den Bossche commented on ARROW-1989: -- Looking into this. But, I can't find a

[jira] [Created] (ARROW-5436) [Python] expose filters argument in parquet.read_table

2019-05-29 Thread Joris Van den Bossche (JIRA)
Joris Van den Bossche created ARROW-5436: Summary: [Python] expose filters argument in parquet.read_table Key: ARROW-5436 URL: https://issues.apache.org/jira/browse/ARROW-5436 Project: Apache

[jira] [Updated] (ARROW-5419) [C++] CSV strings_can_be_null option doesn't respect all null_values

2019-05-27 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5419?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-5419: - Description: Relates to ARROW-5195 and 

[jira] [Updated] (ARROW-5419) [C++] CSV strings_can_be_null option doesn't respect all null_values

2019-05-27 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5419?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-5419: - Labels: csv (was: ) > [C++] CSV strings_can_be_null option doesn't respect all

[jira] [Updated] (ARROW-5420) [Java] Implement or remove getCurrentSizeInBytes in VariableWidthVector

2019-05-27 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5420?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-5420: - Summary: [Java] Implement or remove getCurrentSizeInBytes in VariableWidthVector

[jira] [Updated] (ARROW-5420) [Java] Implement or remove getCurrentSizeInBytes in VariableWidthVector

2019-05-27 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5420?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-5420: - Component/s: Java > [Java] Implement or remove getCurrentSizeInBytes in

[jira] [Updated] (ARROW-5419) [C++] CSV strings_can_be_null option doesn't respect all null_values

2019-05-27 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5419?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-5419: - Fix Version/s: 0.14.0 > [C++] CSV strings_can_be_null option doesn't respect all

[jira] [Commented] (ARROW-5419) [C++] CSV strings_can_be_null option doesn't respect all null_values

2019-05-27 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5419?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16848741#comment-16848741 ] Joris Van den Bossche commented on ARROW-5419: -- As a sidenote: an "empty" field is the

[jira] [Updated] (ARROW-5349) [Python/C++] Provide a way to specify the file path in parquet ColumnChunkMetaData

2019-05-27 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5349?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-5349: - Attachment: test_pyspark_dataset.zip > [Python/C++] Provide a way to specify the

[jira] [Commented] (ARROW-5349) [Python/C++] Provide a way to specify the file path in parquet ColumnChunkMetaData

2019-05-27 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5349?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16848753#comment-16848753 ] Joris Van den Bossche commented on ARROW-5349: -- Summary of the resolution:

[jira] [Closed] (ARROW-5424) [Doc] [Python] Add docs for JSON reader

2019-06-12 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5424?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche closed ARROW-5424. Resolution: Duplicate > [Doc] [Python] Add docs for JSON reader >

[jira] [Updated] (ARROW-5562) pyarrow parquet writer does not handle negative zero correctly

2019-06-12 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5562?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-5562: - Labels: parquet (was: ) > pyarrow parquet writer does not handle negative zero

[jira] [Updated] (ARROW-5562) pyarrow parquet writer does not handle negative zero correctly

2019-06-12 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5562?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-5562: - Component/s: C++ > pyarrow parquet writer does not handle negative zero

[jira] [Commented] (ARROW-5568) [Python] Allow parsing more general JSON formats

2019-06-12 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16861790#comment-16861790 ] Joris Van den Bossche commented on ARROW-5568: -- {quote}I have JSON data where the columnar

[jira] [Commented] (ARROW-5540) [Python] pa.lib.tzinfo_to_string(tz) throws ValueError: Unable to convert timezone `tzoffset(None, -14400)` to string

2019-06-12 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5540?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16861965#comment-16861965 ] Joris Van den Bossche commented on ARROW-5540: -- [~Koojav] Thanks for the report. Going from

[jira] [Commented] (ARROW-5248) [Python] support dateutil timezones

2019-06-12 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5248?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16861969#comment-16861969 ] Joris Van den Bossche commented on ARROW-5248: -- Another example of dateutil timezone was

[jira] [Updated] (ARROW-5562) [C++] parquet writer does not handle negative zero correctly

2019-06-12 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5562?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-5562: - Summary: [C++] parquet writer does not handle negative zero correctly (was:

[jira] [Created] (ARROW-5572) [Python] raise error message when passing invalid filter in parquet reading

2019-06-12 Thread Joris Van den Bossche (JIRA)
Joris Van den Bossche created ARROW-5572: Summary: [Python] raise error message when passing invalid filter in parquet reading Key: ARROW-5572 URL: https://issues.apache.org/jira/browse/ARROW-5572

[jira] [Updated] (ARROW-5618) [C++] [Parquet] Using deprecated Int96 storage for timestamps triggers integer overflow in some cases

2019-06-17 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-5618: - Labels: parquet (was: ) > [C++] [Parquet] Using deprecated Int96 storage for

[jira] [Commented] (ARROW-5208) [Python] Inconsistent resulting type during casting in pa.array() when mask is present

2019-06-17 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5208?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16865558#comment-16865558 ] Joris Van den Bossche commented on ARROW-5208: -- [~ArtemK] still interested to take a look at

[jira] [Commented] (ARROW-5220) [Python] index / unknown columns in specified schema in Table.from_pandas

2019-06-17 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5220?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16865966#comment-16865966 ] Joris Van den Bossche commented on ARROW-5220: -- [~wesmckinn] what do you think of the idea

[jira] [Assigned] (ARROW-4847) [Python] Add pyarrow.table factory function that dispatches to various ctors based on type of input

2019-06-18 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4847?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche reassigned ARROW-4847: Assignee: Joris Van den Bossche (was: Wes McKinney) > [Python] Add

[jira] [Assigned] (ARROW-5309) [Python] Add clarifications to Python "append" methods that return new objects

2019-06-18 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5309?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche reassigned ARROW-5309: Assignee: Joris Van den Bossche > [Python] Add clarifications to Python

[jira] [Assigned] (ARROW-4076) [Python] schema validation and filters

2019-06-18 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4076?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche reassigned ARROW-4076: Assignee: Joris Van den Bossche > [Python] schema validation and filters

[jira] [Assigned] (ARROW-3686) [Python] Support for masked arrays in to/from numpy

2019-06-12 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-3686?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche reassigned ARROW-3686: Assignee: Joris Van den Bossche > [Python] Support for masked arrays in

[jira] [Closed] (ARROW-5540) [Python] pa.lib.tzinfo_to_string(tz) throws ValueError: Unable to convert timezone `tzoffset(None, -14400)` to string

2019-06-12 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5540?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche closed ARROW-5540. Resolution: Duplicate > [Python] pa.lib.tzinfo_to_string(tz) throws ValueError:

[jira] [Commented] (ARROW-2298) [Python] Add option to not consider NaN to be null when converting to an integer Arrow type

2019-06-12 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16862275#comment-16862275 ] Joris Van den Bossche commented on ARROW-2298: -- I am not sure I fully understand your

[jira] [Updated] (ARROW-5532) [JS] Field Metadata Not Read

2019-06-12 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5532?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-5532: - Component/s: JavaScript > [JS] Field Metadata Not Read >

[jira] [Updated] (ARROW-5532) [JS] Field Metadata Not Read

2019-06-12 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5532?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-5532: - Labels: Javas (was: ) > [JS] Field Metadata Not Read >

[jira] [Commented] (ARROW-5540) [Python] pa.lib.tzinfo_to_string(tz) throws ValueError: Unable to convert timezone `tzoffset(None, -14400)` to string

2019-06-12 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5540?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16862079#comment-16862079 ] Joris Van den Bossche commented on ARROW-5540: -- Thanks for the follow-up. OK, since it is

[jira] [Commented] (ARROW-5220) [Python] index / unknown columns in specified schema in Table.from_pandas

2019-06-13 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5220?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16863527#comment-16863527 ] Joris Van den Bossche commented on ARROW-5220: -- I can look into taking the index columns

[jira] [Created] (ARROW-5606) [Python] pandas.RangeIndex._start/_stop/_step are deprecated

2019-06-14 Thread Joris Van den Bossche (JIRA)
Joris Van den Bossche created ARROW-5606: Summary: [Python] pandas.RangeIndex._start/_stop/_step are deprecated Key: ARROW-5606 URL: https://issues.apache.org/jira/browse/ARROW-5606 Project:

[jira] [Created] (ARROW-5655) [Python] Table.from_pydict/from_arrays not using types in specified schema correctly

2019-06-19 Thread Joris Van den Bossche (JIRA)
Joris Van den Bossche created ARROW-5655: Summary: [Python] Table.from_pydict/from_arrays not using types in specified schema correctly Key: ARROW-5655 URL:

[jira] [Created] (ARROW-5654) [C++] ChunkedArray should validate the types of the arrays

2019-06-19 Thread Joris Van den Bossche (JIRA)
Joris Van den Bossche created ARROW-5654: Summary: [C++] ChunkedArray should validate the types of the arrays Key: ARROW-5654 URL: https://issues.apache.org/jira/browse/ARROW-5654 Project:

[jira] [Commented] (ARROW-5630) [Python] Table of nested arrays doesn't round trip

2019-06-19 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16867997#comment-16867997 ] Joris Van den Bossche commented on ARROW-5630: -- Sure, I didn't yet look into it (and will

[jira] [Comment Edited] (ARROW-5630) [Python] Table of nested arrays doesn't round trip

2019-06-19 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16867997#comment-16867997 ] Joris Van den Bossche edited comment on ARROW-5630 at 6/19/19 8:48 PM:

[jira] [Commented] (ARROW-5630) [Python] Table of nested arrays doesn't round trip

2019-06-19 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16867988#comment-16867988 ] Joris Van den Bossche commented on ARROW-5630: -- Yes, with the default of nullable=True, I

[jira] [Commented] (ARROW-5630) [Python] Table of nested arrays doesn't round trip

2019-06-19 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16867981#comment-16867981 ] Joris Van den Bossche commented on ARROW-5630: -- It is somehow related to the length of the

[jira] [Commented] (ARROW-2136) [Python] Non-nullable schema fields not checked in conversions from pandas

2019-06-21 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2136?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16869274#comment-16869274 ] Joris Van den Bossche commented on ARROW-2136: -- You can also run into this when using

[jira] [Assigned] (ARROW-5668) [Python] Display "not null" in Schema.__repr__ for non-nullable fields

2019-06-21 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5668?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche reassigned ARROW-5668: Assignee: Joris Van den Bossche > [Python] Display "not null" in

[jira] [Commented] (ARROW-2136) [Python] Non-nullable schema fields not checked in conversions from pandas

2019-06-21 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2136?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16869281#comment-16869281 ] Joris Van den Bossche commented on ARROW-2136: -- For {{Table.from_pandas}}, in the end, the

[jira] [Updated] (ARROW-5666) [Python] Underscores in partition (string) values are dropped when reading dataset

2019-06-20 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5666?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-5666: - Labels: parquet (was: ) > [Python] Underscores in partition (string) values are

[jira] [Commented] (ARROW-5666) [Python] Underscores in partition (string) values are dropped when reading dataset

2019-06-20 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5666?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16868583#comment-16868583 ] Joris Van den Bossche commented on ARROW-5666: -- Thanks for the report! The problem is that

[jira] [Commented] (ARROW-5665) ArrowInvalid on converting Pandas Series with dtype float64

2019-06-20 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5665?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16868572#comment-16868572 ] Joris Van den Bossche commented on ARROW-5665: -- [~tnesztler] Can you try to provide a

[jira] [Updated] (ARROW-5665) [Python] ArrowInvalid on converting Pandas Series with dtype float64

2019-06-20 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5665?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-5665: - Summary: [Python] ArrowInvalid on converting Pandas Series with dtype float64

[jira] [Commented] (ARROW-3176) [Python] Overflow in Date32 column conversion to pandas

2019-06-24 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-3176?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16870886#comment-16870886 ] Joris Van den Bossche commented on ARROW-3176: -- I fixed the issue on the pandas side,

[jira] [Commented] (ARROW-5514) [C++] Printer for uint64 shows wrong values

2019-06-11 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5514?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16861504#comment-16861504 ] Joris Van den Bossche commented on ARROW-5514: -- Sorry for the slow reply (and thanks for the

[jira] [Commented] (ARROW-2136) [Python] Non-nullable schema fields not checked in conversions from pandas

2019-06-11 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2136?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16861474#comment-16861474 ] Joris Van den Bossche commented on ARROW-2136: -- I have a PR for ARROW-5169

[jira] [Commented] (ARROW-2572) [Python] Add factory function to create a Table from Columns and Schema.

2019-06-18 Thread Joris Van den Bossche (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2572?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16866393#comment-16866393 ] Joris Van den Bossche commented on ARROW-2572: -- The {{Table.from_arrays}} nowadays clearly

  1   2   3   4   5   6   7   8   9   10   >