This is an automated email from the ASF dual-hosted git repository.

kou pushed a commit to branch main
in repository https://gitbox.apache.org/repos/asf/arrow.git


The following commit(s) were added to refs/heads/main by this push:
     new 1f8cec331e MINOR: [C++][Parquet][Docs] Increase chunk_size in docs 
(#40705)
1f8cec331e is described below

commit 1f8cec331ecf757ac58d41e8915db0860f1de29d
Author: Shadab Zafar <[email protected]>
AuthorDate: Wed Nov 19 06:31:57 2025 +0530

    MINOR: [C++][Parquet][Docs] Increase chunk_size in docs (#40705)
    
    ### Rationale for this change
    
    Is there a reason we're using a low value here. All other examples use 
`128*1024` or `64*1024`.
    
    I was stumped by this as I used it without really reading about the 
parameter and spent a day figuring out why my parquet writes were so slow.
    
    ### What changes are included in this PR?
    
    Increase `chunk_size` to `64*1024`
    
    ### Are these changes tested?
    
    Yes.
    
    ### Are there any user-facing changes?
    
    Yes.
    
    Authored-by: Shadab Zafar <[email protected]>
    Signed-off-by: Sutou Kouhei <[email protected]>
---
 cpp/examples/arrow/parquet_read_write.cc | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/cpp/examples/arrow/parquet_read_write.cc 
b/cpp/examples/arrow/parquet_read_write.cc
index 2465018966..b0765e1da6 100644
--- a/cpp/examples/arrow/parquet_read_write.cc
+++ b/cpp/examples/arrow/parquet_read_write.cc
@@ -120,7 +120,7 @@ arrow::Status WriteFullFile(std::string path_to_file) {
 
   ARROW_RETURN_NOT_OK(parquet::arrow::WriteTable(*table.get(),
                                                  arrow::default_memory_pool(), 
outfile,
-                                                 /*chunk_size=*/3, props, 
arrow_props));
+                                                 /*chunk_size=*/64*1024, 
props, arrow_props));
   return arrow::Status::OK();
 }
 

Reply via email to