Jonathan Keane created ARROW-14464:
--------------------------------------
Summary: [R] Change write_parquet()'s default chunk_size from all
rows
Key: ARROW-14464
URL: https://issues.apache.org/jira/browse/ARROW-14464
Project: Apache Arrow
Issue Type: Improvement
Components: R
Reporter: Jonathan Keane
Assignee: Jonathan Keane
Currently, when {{chunk_size}} is not given in {{write_parquet()}} it defaults
to 1 chunk of all of the rows in the table. This could be fine for smallish
numbers of rows, but when we have lots of rows, we want files that contain a
decent number of row groups.
It looks like this was added in https://github.com/apache/arrow/pull/5451 and
wasn't discussed there, so this default might not be intentional.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)