xieqi commented on pull request #8229: URL: https://github.com/apache/arrow/pull/8229#issuecomment-697120706
@pitrou For Parquet write, the end-user still use the standard GZip as the compression codec, we add a compression_plugin API in parquet WriterProperties Builder, the end-user can use the following code snippet to enable plugin: `parquet::WriterProperties::Builder builder;` `builder.compression(parquet::Compression::GZIP);` `builder.compression_plugin("libGzipPlugin.so");` It will use the plugin to compress and write some plugin hint in ColumnMetaData's key_value_metadata. For parquet read, it will first check if parquet ColumnMetaData's key_value_metadata has plugin information. It will call plugin to decompress the data if has such metadata, otherwise it will call the standard GZip to decompress data. So it is transparent for end-user in parquet read side. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org