[jira] [Created] (ARROW-15716) [Dataset][Python] Parse a list of fragment paths to gather filters

2022-02-17 Thread Lance Dacey (Jira)
Lance Dacey created ARROW-15716: --- Summary: [Dataset][Python] Parse a list of fragment paths to gather filters Key: ARROW-15716 URL: https://issues.apache.org/jira/browse/ARROW-15716 Project: Apache Arro

[jira] [Created] (ARROW-15474) [Python] Possibility of a table.drop_duplicates() function?

2022-01-26 Thread Lance Dacey (Jira)
Lance Dacey created ARROW-15474: --- Summary: [Python] Possibility of a table.drop_duplicates() function? Key: ARROW-15474 URL: https://issues.apache.org/jira/browse/ARROW-15474 Project: Apache Arrow

[jira] [Created] (ARROW-12365) [Python] [Dataset] Add partition_filename_cb to ds.write_dataset()

2021-04-13 Thread Lance Dacey (Jira)
Lance Dacey created ARROW-12365: --- Summary: [Python] [Dataset] Add partition_filename_cb to ds.write_dataset() Key: ARROW-12365 URL: https://issues.apache.org/jira/browse/ARROW-12365 Project: Apache Arro

[jira] [Created] (ARROW-12364) [Python] [Dataset] Add metadata_collector option to ds.write_dataset()

2021-04-13 Thread Lance Dacey (Jira)
Lance Dacey created ARROW-12364: --- Summary: [Python] [Dataset] Add metadata_collector option to ds.write_dataset() Key: ARROW-12364 URL: https://issues.apache.org/jira/browse/ARROW-12364 Project: Apache

[jira] [Created] (ARROW-11453) [Python] [Dataset] Unable to use write_dataset() to Azure Blob with adlfs 0.6.0

2021-02-01 Thread Lance Dacey (Jira)
Lance Dacey created ARROW-11453: --- Summary: [Python] [Dataset] Unable to use write_dataset() to Azure Blob with adlfs 0.6.0 Key: ARROW-11453 URL: https://issues.apache.org/jira/browse/ARROW-11453 Project

[jira] [Created] (ARROW-11390) [Python] pyarrow 3.0 issues with turbodbc

2021-01-26 Thread Lance Dacey (Jira)
Lance Dacey created ARROW-11390: --- Summary: [Python] pyarrow 3.0 issues with turbodbc Key: ARROW-11390 URL: https://issues.apache.org/jira/browse/ARROW-11390 Project: Apache Arrow Issue Type: Bu

[jira] [Created] (ARROW-11250) [Python] Inconsistent behavior calling ds.dataset()

2021-01-14 Thread Lance Dacey (Jira)
Lance Dacey created ARROW-11250: --- Summary: [Python] Inconsistent behavior calling ds.dataset() Key: ARROW-11250 URL: https://issues.apache.org/jira/browse/ARROW-11250 Project: Apache Arrow Issu

[jira] [Created] (ARROW-10694) [Python] ds.write_dataset() generates empty files for each final partition

2020-11-23 Thread Lance Dacey (Jira)
Lance Dacey created ARROW-10694: --- Summary: [Python] ds.write_dataset() generates empty files for each final partition Key: ARROW-10694 URL: https://issues.apache.org/jira/browse/ARROW-10694 Project: Apa

[jira] [Created] (ARROW-10517) [Python] Unable to read/write Parquet datasets with fsspec on Azure Blob

2020-11-08 Thread Lance Dacey (Jira)
Lance Dacey created ARROW-10517: --- Summary: [Python] Unable to read/write Parquet datasets with fsspec on Azure Blob Key: ARROW-10517 URL: https://issues.apache.org/jira/browse/ARROW-10517 Project: Apach

[jira] [Created] (ARROW-9682) [Python] Unable to specify the partition style with pq.write_to_dataset

2020-08-10 Thread Lance Dacey (Jira)
Lance Dacey created ARROW-9682: -- Summary: [Python] Unable to specify the partition style with pq.write_to_dataset Key: ARROW-9682 URL: https://issues.apache.org/jira/browse/ARROW-9682 Project: Apache Arr

[jira] [Created] (ARROW-9514) The new Dataset API will not work with files on Azure Blob (pq.read_table() does work and so does Dask)

2020-07-17 Thread Lance Dacey (Jira)
Lance Dacey created ARROW-9514: -- Summary: The new Dataset API will not work with files on Azure Blob (pq.read_table() does work and so does Dask) Key: ARROW-9514 URL: https://issues.apache.org/jira/browse/ARROW-9514