[jira] [Updated] (ARROW-16240) [Python] Support row_group_size/chunk_size keyword in pq.write_to_dataset with use_legacy_dataset=False

Joris Van den Bossche (Jira) Thu, 21 Apr 2022 11:33:05 -0700


     [ 
https://issues.apache.org/jira/browse/ARROW-16240?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


Joris Van den Bossche updated ARROW-16240:
------------------------------------------
    Summary: [Python] Support row_group_size/chunk_size keyword in 
pq.write_to_dataset with use_legacy_dataset=False  (was: [Python] 
_create_dataset_for_fragments() helper function needs to be updated)

> [Python] Support row_group_size/chunk_size keyword in pq.write_to_dataset 
> with use_legacy_dataset=False
> -------------------------------------------------------------------------------------------------------
>
>                 Key: ARROW-16240
>                 URL: https://issues.apache.org/jira/browse/ARROW-16240
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: Python
>            Reporter: Alenka Frim
>            Priority: Major
>
> {{_create_dataset_for_fragments() }}helper function in test_dataset.py needs 
> to be updated to reflect the changes in the {{write_to_dataset}} in 
> ARROW-16122 : The default for {{use_legacy_dataset}} keyword will be set to 
> False but the {{dataset.write_dataset(..)}} doesn't support the parquet 
> {{row_group_size}} keyword.
> See discussion: 
> [https://github.com/apache/arrow/pull/12811#discussion_r845304218]



--
This message was sent by Atlassian Jira
(v8.20.7#820007)

[jira] [Updated] (ARROW-16240) [Python] Support row_group_size/chunk_size keyword in pq.write_to_dataset with use_legacy_dataset=False

Reply via email to