[GitHub] [arrow] jonkeane commented on a change in pull request #11681: MINOR: [R][Doc] Update phrasing of docs for chunk_size argument to better reflect what it means

GitBox Fri, 12 Nov 2021 08:20:43 -0800


jonkeane commented on a change in pull request #11681:
URL: https://github.com/apache/arrow/pull/11681#discussion_r748414843




##########
File path: r/R/parquet.R
##########
@@ -82,7 +82,7 @@ read_parquet <- function(file,
 #' @param x `data.frame`, [RecordBatch], or [Table]
 #' @param sink A string file path, URI, or [OutputStream], or path in a file
 #' system (`SubTreeFileSystem`)
-#' @param chunk_size chunk size in number of rows. If NULL, the total number 
of rows is used.
+#' @param chunk_size how many rows of data to write to disk at once. If NULL, 
the total number of rows is used.

Review comment:
       ```suggestion
   #' @param chunk_size how many rows of data to write to disk at once. This 
directly corresponds to how many rows will be in each row group in parquet. If 
NULL, a best guess will be made for optimal size (based on the number of 
columns and number of rows), or the total number of rows is used if the data 
has fewer than 250 million cells (rows x cols).
   ```




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

[GitHub] [arrow] jonkeane commented on a change in pull request #11681: MINOR: [R][Doc] Update phrasing of docs for chunk_size argument to better reflect what it means

Reply via email to