Claire McGinty created PARQUET-2350:
---------------------------------------

             Summary: Create Configuration key for enabling Byte Stream Split 
Encoding in ParquetWRiter
                 Key: PARQUET-2350
                 URL: https://issues.apache.org/jira/browse/PARQUET-2350
             Project: Parquet
          Issue Type: Improvement
            Reporter: Claire McGinty


All of the properties in 
[ParquetWriter|https://github.com/apache/parquet-mr/blob/master/parquet-hadoop/src/main/java/org/apache/parquet/hadoop/ParquetWriter.java]
 have an associated Configuration key (for example, 
[ParquetOutputFormat.DICTIONARY_PAGE_SIZE|https://github.com/apache/parquet-mr/blob/910bcc4edc2d707670e02e9ceadd98dacd9f08d2/parquet-hadoop/src/main/java/org/apache/parquet/hadoop/ParquetOutputFormat.java#L140]
 corresponds to ParquetWriter#withDictionaryPageSize), except for 
`ParquetWriter#withByteStreamSplitEncoding`.

 

Can we add a Configuration key for this? Happy to make a PR, given some input 
on naming convention (`parquet.encoding.bytestreamsplit.enabled` maybe?)

 

 

 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to