[
https://issues.apache.org/jira/browse/PARQUET-1059?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16089519#comment-16089519
]
Dapeng Sun commented on PARQUET-1059:
-
Hi [~wesmckinn], thank you for your comments, how about create a new write
version, such as {{PARQUET_3_0}} or {{PARQUET_2_1}} , I think this
optimization would be easy put into a new WRITE_VERSION.
> Improve the RLE encoding for Parquet Dictionary IDs
> ---
>
> Key: PARQUET-1059
> URL: https://issues.apache.org/jira/browse/PARQUET-1059
> Project: Parquet
> Issue Type: Improvement
>Reporter: Dapeng Sun
>
> The IDs of Parquet Dictionary encoding is using
> {{RunLengthBitPackingHybridEncoder}}.
> RunLengthBitPackingHybridEncoder handles encoding with {{repeat}} and
> {{bitpacking}}, we should improve it with the method likes
> {{DeltaBinaryPackingWriter}}
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)