devinjdangelo opened a new issue, #7691:
URL: https://github.com/apache/arrow-datafusion/issues/7691

   ### Is your feature request related to a problem or challenge?
   
   While working on benchmarking for 
https://github.com/apache/arrow-datafusion/pull/7655 and starting to compare to 
serialize/write performance of other systems, I realized our current default 
parquet compression codec is Uncompressed. This is surprising as one almost 
always wants compression when using parquet.
   
   The default is currently inherited up from `arrow-rs`
   
   
https://github.com/apache/arrow-rs/blob/471f6dd2911d8328ca56efe2f685e08c0a3fb8c8/parquet/src/file/properties.rs#L33-L34
   
   ### Describe the solution you'd like
   
   Set a sensible default compression such as Zstd level 3.
   
   ### Describe alternatives you've considered
   
   The default of Uncompressed is coming from the `arrow-rs` crate. We could 
instead update the default there. 
   
   ### Additional context
   
   _No response_


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to