This is an automated email from the ASF dual-hosted git repository.
kou pushed a commit to branch main
in repository https://gitbox.apache.org/repos/asf/arrow.git
The following commit(s) were added to refs/heads/main by this push:
new 1f8cec331e MINOR: [C++][Parquet][Docs] Increase chunk_size in docs
(#40705)
1f8cec331e is described below
commit 1f8cec331ecf757ac58d41e8915db0860f1de29d
Author: Shadab Zafar <[email protected]>
AuthorDate: Wed Nov 19 06:31:57 2025 +0530
MINOR: [C++][Parquet][Docs] Increase chunk_size in docs (#40705)
### Rationale for this change
Is there a reason we're using a low value here. All other examples use
`128*1024` or `64*1024`.
I was stumped by this as I used it without really reading about the
parameter and spent a day figuring out why my parquet writes were so slow.
### What changes are included in this PR?
Increase `chunk_size` to `64*1024`
### Are these changes tested?
Yes.
### Are there any user-facing changes?
Yes.
Authored-by: Shadab Zafar <[email protected]>
Signed-off-by: Sutou Kouhei <[email protected]>
---
cpp/examples/arrow/parquet_read_write.cc | 2 +-
1 file changed, 1 insertion(+), 1 deletion(-)
diff --git a/cpp/examples/arrow/parquet_read_write.cc
b/cpp/examples/arrow/parquet_read_write.cc
index 2465018966..b0765e1da6 100644
--- a/cpp/examples/arrow/parquet_read_write.cc
+++ b/cpp/examples/arrow/parquet_read_write.cc
@@ -120,7 +120,7 @@ arrow::Status WriteFullFile(std::string path_to_file) {
ARROW_RETURN_NOT_OK(parquet::arrow::WriteTable(*table.get(),
arrow::default_memory_pool(),
outfile,
- /*chunk_size=*/3, props,
arrow_props));
+ /*chunk_size=*/64*1024,
props, arrow_props));
return arrow::Status::OK();
}