[jira] [Commented] (ARROW-15183) [Python][Docs] Add Missing Dataset Write Options

Will Jones (Jira) Wed, 22 Dec 2021 07:15:05 -0800


    [ 
https://issues.apache.org/jira/browse/ARROW-15183?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17463892#comment-17463892
 ]


Will Jones commented on ARROW-15183:
------------------------------------

As far as I know, those parameters don't often need to be changed from their 
defaults, but there may be some extreme cases where it's worth tuning them. If 
we know circumstances where we recommend setting those, we can write some 
guidance in the datasets guide.

If we end up getting a lot of questions about this, we might have more 
information about this. That's how we ended up writing the partitioning 
guidance for ARROW-15150.

> [Python][Docs] Add Missing Dataset Write Options 
> -------------------------------------------------
>
>                 Key: ARROW-15183
>                 URL: https://issues.apache.org/jira/browse/ARROW-15183
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: Documentation, Python
>            Reporter: Vibhatha Lakmal Abeykoon
>            Assignee: Vibhatha Lakmal Abeykoon
>            Priority: Major
>
> Recently the write options `max_open_files`, `max_rows_per_file`, 
> `min_rows_per_group` and `max_rows_per_group` were included to the Python 
> bindings. But these are not documented here: 
> [https://arrow.apache.org/docs/python/dataset.html#writing-datasets.] 



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

[jira] [Commented] (ARROW-15183) [Python][Docs] Add Missing Dataset Write Options

Reply via email to