[ 
https://issues.apache.org/jira/browse/PARQUET-2176?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

patchwork01 updated PARQUET-2176:
---------------------------------
    Description: 
ParquetWriter does not expose any way to set the properties for column index or 
statistics truncation.

With ParquetOutputFormat those can be set with 
parquet.columnindex.truncate.length and parquet.statistics.truncate.length. 
These are not applied for ParquetWriter.

These properties are documented here: 
[https://github.com/apache/parquet-mr/blob/master/parquet-hadoop/README.md]

  was:
ParquetWriter does not expose any way to set the properties for column index or 
statistics truncation.

With ParquetOutputFormat those can be set with 
parquet.columnindex.truncate.length and parquet.statistics.truncate.length. 
These are not applied for ParquetWriter.

See also: 
https://github.com/apache/parquet-mr/blob/master/parquet-hadoop/README.md


> Parquet writers should allow for configurable index/statistics truncation
> -------------------------------------------------------------------------
>
>                 Key: PARQUET-2176
>                 URL: https://issues.apache.org/jira/browse/PARQUET-2176
>             Project: Parquet
>          Issue Type: Improvement
>          Components: parquet-mr
>    Affects Versions: 1.12.3
>            Reporter: patchwork01
>            Priority: Major
>
> ParquetWriter does not expose any way to set the properties for column index 
> or statistics truncation.
> With ParquetOutputFormat those can be set with 
> parquet.columnindex.truncate.length and parquet.statistics.truncate.length. 
> These are not applied for ParquetWriter.
> These properties are documented here: 
> [https://github.com/apache/parquet-mr/blob/master/parquet-hadoop/README.md]



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to