Jonathan Keane created ARROW-14464:
--------------------------------------

             Summary: [R] Change write_parquet()'s default chunk_size from all 
rows
                 Key: ARROW-14464
                 URL: https://issues.apache.org/jira/browse/ARROW-14464
             Project: Apache Arrow
          Issue Type: Improvement
          Components: R
            Reporter: Jonathan Keane
            Assignee: Jonathan Keane


Currently, when {{chunk_size}} is not given in {{write_parquet()}} it defaults 
to 1 chunk of all of the rows in the table. This could be fine for smallish 
numbers of rows, but when we have lots of rows, we want files that contain a 
decent number of row groups.

It looks like this was added in https://github.com/apache/arrow/pull/5451 and 
wasn't discussed there, so this default might not be intentional.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to