[
https://issues.apache.org/jira/browse/ARROW-15183?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17463892#comment-17463892
]
Will Jones commented on ARROW-15183:
------------------------------------
As far as I know, those parameters don't often need to be changed from their
defaults, but there may be some extreme cases where it's worth tuning them. If
we know circumstances where we recommend setting those, we can write some
guidance in the datasets guide.
If we end up getting a lot of questions about this, we might have more
information about this. That's how we ended up writing the partitioning
guidance for ARROW-15150.
> [Python][Docs] Add Missing Dataset Write Options
> -------------------------------------------------
>
> Key: ARROW-15183
> URL: https://issues.apache.org/jira/browse/ARROW-15183
> Project: Apache Arrow
> Issue Type: Improvement
> Components: Documentation, Python
> Reporter: Vibhatha Lakmal Abeykoon
> Assignee: Vibhatha Lakmal Abeykoon
> Priority: Major
>
> Recently the write options `max_open_files`, `max_rows_per_file`,
> `min_rows_per_group` and `max_rows_per_group` were included to the Python
> bindings. But these are not documented here:
> [https://arrow.apache.org/docs/python/dataset.html#writing-datasets.]
--
This message was sent by Atlassian Jira
(v8.20.1#820001)