[ 
https://issues.apache.org/jira/browse/PARQUET-1059?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16089519#comment-16089519
 ] 

Dapeng Sun commented on PARQUET-1059:
-------------------------------------

Hi [~wesmckinn], thank you for your comments, how about create a new write 
version, such as  {{PARQUET_3_0}} or {{PARQUET_2_1}} , I think this 
optimization would be easy put into a new WRITE_VERSION.

> Improve the RLE encoding for Parquet Dictionary IDs
> ---------------------------------------------------
>
>                 Key: PARQUET-1059
>                 URL: https://issues.apache.org/jira/browse/PARQUET-1059
>             Project: Parquet
>          Issue Type: Improvement
>            Reporter: Dapeng Sun
>
> The IDs of Parquet Dictionary encoding is using 
> {{RunLengthBitPackingHybridEncoder}}.
> RunLengthBitPackingHybridEncoder handles encoding with {{repeat}} and 
> {{bitpacking}}, we should improve it with the method likes 
> {{DeltaBinaryPackingWriter}}



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to