[jira] [Created] (ARROW-10146) [Python] Parquet metadata to_dict raises attribute error

2020-09-30 Thread Florian Jetter (Jira)
Florian Jetter created ARROW-10146: -- Summary: [Python] Parquet metadata to_dict raises attribute error Key: ARROW-10146 URL: https://issues.apache.org/jira/browse/ARROW-10146 Project: Apache Arrow

[jira] [Updated] (ARROW-8142) [Python/C++] Casting empty table from after parquet roundtrip causes critical failure

2020-03-18 Thread Florian Jetter (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8142?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Florian Jetter updated ARROW-8142: -- Description: When casting a schema of an empty table from dict encoded to non-dict encoded

[jira] [Created] (ARROW-8142) [Python/C++] Casting empty table from after parquet roundtrip causes critical failure

2020-03-18 Thread Florian Jetter (Jira)
Florian Jetter created ARROW-8142: - Summary: [Python/C++] Casting empty table from after parquet roundtrip causes critical failure Key: ARROW-8142 URL: https://issues.apache.org/jira/browse/ARROW-8142

[jira] [Commented] (ARROW-4176) [C++/Python] Human readable arrow schema comparison

2020-03-13 Thread Florian Jetter (Jira)
[ https://issues.apache.org/jira/browse/ARROW-4176?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17058527#comment-17058527 ] Florian Jetter commented on ARROW-4176: --- This came up in a PR regarding the str/repr of the schemas

[jira] [Commented] (ARROW-8057) Schema equality not roundtrip safe

2020-03-10 Thread Florian Jetter (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8057?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17055804#comment-17055804 ] Florian Jetter commented on ARROW-8057: --- Investigating the fields explicitly shows that the

[jira] [Created] (ARROW-8057) Schema equality not roundtrip safe

2020-03-10 Thread Florian Jetter (Jira)
Florian Jetter created ARROW-8057: - Summary: Schema equality not roundtrip safe Key: ARROW-8057 URL: https://issues.apache.org/jira/browse/ARROW-8057 Project: Apache Arrow Issue Type: Bug

[jira] [Created] (ARROW-7732) [Python][C++] Parquet statistics wrong for pandas Categorical

2020-01-31 Thread Florian Jetter (Jira)
Florian Jetter created ARROW-7732: - Summary: [Python][C++] Parquet statistics wrong for pandas Categorical Key: ARROW-7732 URL: https://issues.apache.org/jira/browse/ARROW-7732 Project: Apache Arrow

[jira] [Commented] (ARROW-6339) [Python][C++] Rowgroup statistics for pd.NaT array ill defined

2019-08-24 Thread Florian Jetter (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6339?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16914949#comment-16914949 ] Florian Jetter commented on ARROW-6339: --- The same is true for other null values, e.g. {code:python}

[jira] [Assigned] (ARROW-6339) [Python][C++] Rowgroup statistics for pd.NaT array ill defined

2019-08-24 Thread Florian Jetter (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6339?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Florian Jetter reassigned ARROW-6339: - Assignee: (was: Florian Jetter) > [Python][C++] Rowgroup statistics for pd.NaT

[jira] [Assigned] (ARROW-6339) [Python][C++] Rowgroup statistics for pd.NaT array ill defined

2019-08-23 Thread Florian Jetter (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6339?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Florian Jetter reassigned ARROW-6339: - Assignee: Florian Jetter > [Python][C++] Rowgroup statistics for pd.NaT array ill

[jira] [Created] (ARROW-6339) [Python][C++] Rowgroup statistics for pd.NaT array ill defined

2019-08-23 Thread Florian Jetter (Jira)
Florian Jetter created ARROW-6339: - Summary: [Python][C++] Rowgroup statistics for pd.NaT array ill defined Key: ARROW-6339 URL: https://issues.apache.org/jira/browse/ARROW-6339 Project: Apache Arrow

[jira] [Created] (ARROW-5889) [Python][C++] Parquet backwards compat for timestamps without timezone broken

2019-07-09 Thread Florian Jetter (JIRA)
Florian Jetter created ARROW-5889: - Summary: [Python][C++] Parquet backwards compat for timestamps without timezone broken Key: ARROW-5889 URL: https://issues.apache.org/jira/browse/ARROW-5889

[jira] [Created] (ARROW-5888) [Python][C++] Parquet write metadata not roundtrip safe for timezone timestamps

2019-07-09 Thread Florian Jetter (JIRA)
Florian Jetter created ARROW-5888: - Summary: [Python][C++] Parquet write metadata not roundtrip safe for timezone timestamps Key: ARROW-5888 URL: https://issues.apache.org/jira/browse/ARROW-5888

[jira] [Commented] (ARROW-5873) [Python][C++] Segmentation fault when comparing schema with None

2019-07-09 Thread Florian Jetter (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16881274#comment-16881274 ] Florian Jetter commented on ARROW-5873: --- Turns out we've seen this prior to 0.14.0 as well but not

[jira] [Created] (ARROW-5878) [Python][C++] Parquet reader not forward compatible for timestamps without timezone

2019-07-08 Thread Florian Jetter (JIRA)
Florian Jetter created ARROW-5878: - Summary: [Python][C++] Parquet reader not forward compatible for timestamps without timezone Key: ARROW-5878 URL: https://issues.apache.org/jira/browse/ARROW-5878

[jira] [Updated] (ARROW-5873) [Python/C++] Segmentation fault when comparing schema with None

2019-07-08 Thread Florian Jetter (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5873?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Florian Jetter updated ARROW-5873: -- Description: When comparing a schema with a Python {{None}} I get a segmentation fault. This

[jira] [Created] (ARROW-5873) [Python/C++] Segmentation fault when comparing schema with None

2019-07-08 Thread Florian Jetter (JIRA)
Florian Jetter created ARROW-5873: - Summary: [Python/C++] Segmentation fault when comparing schema with None Key: ARROW-5873 URL: https://issues.apache.org/jira/browse/ARROW-5873 Project: Apache

[jira] [Created] (ARROW-5138) [Python/C++] Row group retrieval doesn't restore index properly

2019-04-08 Thread Florian Jetter (JIRA)
Florian Jetter created ARROW-5138: - Summary: [Python/C++] Row group retrieval doesn't restore index properly Key: ARROW-5138 URL: https://issues.apache.org/jira/browse/ARROW-5138 Project: Apache

[jira] [Created] (ARROW-5104) [Python/C++] Schema for empty tables include index column as integer

2019-04-03 Thread Florian Jetter (JIRA)
Florian Jetter created ARROW-5104: - Summary: [Python/C++] Schema for empty tables include index column as integer Key: ARROW-5104 URL: https://issues.apache.org/jira/browse/ARROW-5104 Project: Apache

[jira] [Created] (ARROW-5089) [C++/Python] Writing dictionary encoded columns to parquet is extremely slow when using chunk size

2019-04-02 Thread Florian Jetter (JIRA)
Florian Jetter created ARROW-5089: - Summary: [C++/Python] Writing dictionary encoded columns to parquet is extremely slow when using chunk size Key: ARROW-5089 URL:

[jira] [Created] (ARROW-5085) [Python/C++] Conversion of dict encoded null column fails in parquet writing when using RowGroups

2019-04-01 Thread Florian Jetter (JIRA)
Florian Jetter created ARROW-5085: - Summary: [Python/C++] Conversion of dict encoded null column fails in parquet writing when using RowGroups Key: ARROW-5085 URL: https://issues.apache.org/jira/browse/ARROW-5085

[jira] [Created] (ARROW-4629) [Python] Pandas to arrow conversion slowed down by local imports

2019-02-19 Thread Florian Jetter (JIRA)
Florian Jetter created ARROW-4629: - Summary: [Python] Pandas to arrow conversion slowed down by local imports Key: ARROW-4629 URL: https://issues.apache.org/jira/browse/ARROW-4629 Project: Apache

[jira] [Created] (ARROW-4267) [Python/C++] Segfault when reading rowgroups with duplicated columns

2019-01-15 Thread Florian Jetter (JIRA)
Florian Jetter created ARROW-4267: - Summary: [Python/C++] Segfault when reading rowgroups with duplicated columns Key: ARROW-4267 URL: https://issues.apache.org/jira/browse/ARROW-4267 Project: Apache

[jira] [Created] (ARROW-4176) [C++/Python] Human readable arrow schema comparison

2019-01-07 Thread Florian Jetter (JIRA)
Florian Jetter created ARROW-4176: - Summary: [C++/Python] Human readable arrow schema comparison Key: ARROW-4176 URL: https://issues.apache.org/jira/browse/ARROW-4176 Project: Apache Arrow

[jira] [Created] (ARROW-3176) [Python] Overflow in Date32 column conversion to pandas

2018-09-05 Thread Florian Jetter (JIRA)
Florian Jetter created ARROW-3176: - Summary: [Python] Overflow in Date32 column conversion to pandas Key: ARROW-3176 URL: https://issues.apache.org/jira/browse/ARROW-3176 Project: Apache Arrow

[jira] [Created] (ARROW-2856) [Python/C++] Array constructor should not truncate floats when casting to int

2018-07-16 Thread Florian Jetter (JIRA)
Florian Jetter created ARROW-2856: - Summary: [Python/C++] Array constructor should not truncate floats when casting to int Key: ARROW-2856 URL: https://issues.apache.org/jira/browse/ARROW-2856

[jira] [Created] (ARROW-2719) [Python/C++] ArrowSchema not hashable

2018-06-18 Thread Florian Jetter (JIRA)
Florian Jetter created ARROW-2719: - Summary: [Python/C++] ArrowSchema not hashable Key: ARROW-2719 URL: https://issues.apache.org/jira/browse/ARROW-2719 Project: Apache Arrow Issue Type: Bug

[jira] [Created] (ARROW-2714) [C++/Python] Variable step size slicing for arrays

2018-06-15 Thread Florian Jetter (JIRA)
Florian Jetter created ARROW-2714: - Summary: [C++/Python] Variable step size slicing for arrays Key: ARROW-2714 URL: https://issues.apache.org/jira/browse/ARROW-2714 Project: Apache Arrow

[jira] [Created] (ARROW-2694) [Python] ArrayValue string conversion returns the representation instead of the converted python object string

2018-06-10 Thread Florian Jetter (JIRA)
Florian Jetter created ARROW-2694: - Summary: [Python] ArrayValue string conversion returns the representation instead of the converted python object string Key: ARROW-2694 URL:

[jira] [Created] (ARROW-2646) [Python] Pandas roundtrip for date objects

2018-05-30 Thread Florian Jetter (JIRA)
Florian Jetter created ARROW-2646: - Summary: [Python] Pandas roundtrip for date objects Key: ARROW-2646 URL: https://issues.apache.org/jira/browse/ARROW-2646 Project: Apache Arrow Issue

[jira] [Created] (ARROW-2603) [Python] from pandas raises ArrowInvalid for date(time) subclasses

2018-05-17 Thread Florian Jetter (JIRA)
Florian Jetter created ARROW-2603: - Summary: [Python] from pandas raises ArrowInvalid for date(time) subclasses Key: ARROW-2603 URL: https://issues.apache.org/jira/browse/ARROW-2603 Project: Apache

[jira] [Created] (ARROW-2510) [Python] Segmentation fault when converting empty column as categorical

2018-04-25 Thread Florian Jetter (JIRA)
Florian Jetter created ARROW-2510: - Summary: [Python] Segmentation fault when converting empty column as categorical Key: ARROW-2510 URL: https://issues.apache.org/jira/browse/ARROW-2510 Project:

[jira] [Created] (ARROW-2443) [Python] Conversion from pandas of empty categorical fails with ArrowInvalid

2018-04-10 Thread Florian Jetter (JIRA)
Florian Jetter created ARROW-2443: - Summary: [Python] Conversion from pandas of empty categorical fails with ArrowInvalid Key: ARROW-2443 URL: https://issues.apache.org/jira/browse/ARROW-2443

[jira] [Created] (ARROW-2240) [Python] Array initialization with leading numpy nan fails with exception

2018-03-01 Thread Florian Jetter (JIRA)
Florian Jetter created ARROW-2240: - Summary: [Python] Array initialization with leading numpy nan fails with exception Key: ARROW-2240 URL: https://issues.apache.org/jira/browse/ARROW-2240 Project:

[jira] [Commented] (ARROW-2194) [Python] Pandas columns metadata incorrect for empty string columns

2018-03-01 Thread Florian Jetter (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2194?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16381739#comment-16381739 ] Florian Jetter commented on ARROW-2194: --- I haven't checked the master but on `0.8.0` all other

[jira] [Created] (ARROW-2194) Pandas columns metadata incorrect for empty string columns

2018-02-21 Thread Florian Jetter (JIRA)
Florian Jetter created ARROW-2194: - Summary: Pandas columns metadata incorrect for empty string columns Key: ARROW-2194 URL: https://issues.apache.org/jira/browse/ARROW-2194 Project: Apache Arrow

[jira] [Commented] (ARROW-1555) PyArrow write_to_dataset on s3

2017-09-19 Thread Florian Jetter (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1555?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16172076#comment-16172076 ] Florian Jetter commented on ARROW-1555: --- [~wesmckinn] Yes, it seems like some abstract methods of

[jira] [Assigned] (ARROW-1555) PyArrow write_to_dataset on s3

2017-09-19 Thread Florian Jetter (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1555?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Florian Jetter reassigned ARROW-1555: - Assignee: Florian Jetter > PyArrow write_to_dataset on s3 >

[jira] [Commented] (ARROW-1455) [Python] Add Dockerfile for validating Dask integration outside of usual CI

2017-09-03 Thread Florian Jetter (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1455?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16151988#comment-16151988 ] Florian Jetter commented on ARROW-1455: --- Maybe we could still add it to the CI but allow failures:

[jira] [Commented] (ARROW-1456) [Python] Run s3fs unit tests in Travis CI

2017-09-03 Thread Florian Jetter (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16151987#comment-16151987 ] Florian Jetter commented on ARROW-1456: --- Mocking s3 may already be enough for the arrow unit tests.

[jira] [Assigned] (ARROW-1417) [Python] Allow more generic filesystem objects to be passed to ParquetDataset

2017-09-03 Thread Florian Jetter (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1417?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Florian Jetter reassigned ARROW-1417: - Assignee: Florian Jetter > [Python] Allow more generic filesystem objects to be passed

[jira] [Assigned] (ARROW-1413) [C++] Add include-what-you-use configuration

2017-08-28 Thread Florian Jetter (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1413?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Florian Jetter reassigned ARROW-1413: - Assignee: Florian Jetter > [C++] Add include-what-you-use configuration >

[jira] [Created] (ARROW-1417) [Python] Allow more generic filesystem objects to be passed to ParquetDataset

2017-08-27 Thread Florian Jetter (JIRA)
Florian Jetter created ARROW-1417: - Summary: [Python] Allow more generic filesystem objects to be passed to ParquetDataset Key: ARROW-1417 URL: https://issues.apache.org/jira/browse/ARROW-1417

[jira] [Created] (ARROW-1328) [Python] pyarrow.Table.from_pandas option timestamps_to_ms changes column values

2017-08-03 Thread Florian Jetter (JIRA)
Florian Jetter created ARROW-1328: - Summary: [Python] pyarrow.Table.from_pandas option timestamps_to_ms changes column values Key: ARROW-1328 URL: https://issues.apache.org/jira/browse/ARROW-1328

[jira] [Assigned] (ARROW-439) [Python] Add option in "to_pandas" conversions to yield Categorical from String/Binary arrays

2017-07-24 Thread Florian Jetter (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-439?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Florian Jetter reassigned ARROW-439: Assignee: Florian Jetter > [Python] Add option in "to_pandas" conversions to yield