xieqi commented on pull request #8229:
URL: https://github.com/apache/arrow/pull/8229#issuecomment-697120706


   @pitrou 
   For Parquet write, the end-user still use the standard GZip as the 
compression codec, we add a compression_plugin API in parquet WriterProperties 
Builder, the end-user can use the following code snippet to enable plugin:
   `parquet::WriterProperties::Builder builder;`
   `builder.compression(parquet::Compression::GZIP);`
   `builder.compression_plugin("libGzipPlugin.so");`
   It will use the plugin to compress and write some plugin hint in 
ColumnMetaData's key_value_metadata.
   
   For parquet read, it will first check if parquet ColumnMetaData's 
key_value_metadata has plugin information. It will call plugin to decompress 
the data if has such metadata, otherwise it will call the standard GZip to 
decompress data. So it is transparent for end-user in parquet read side.
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Reply via email to