andygrove opened a new pull request #8978: URL: https://github.com/apache/arrow/pull/8978
DataFusion changes in this PR: - `ExecutionContext.write_parquet` now accepts an optional Parquet `WriterProperties` so that calls can choose Parquet writer options, such as which compression format to use. Benchmark changes in this PR: - The TPC-H `convert` subcommand now accepts a compression format when writing Parquet files and it defauilts to Snappy - A bug has been fixed which was causing a "file already exists" error now that we have multiple files and not just lineitem - Progress information is output to stdout along with timings for better UX ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected]
