[ 
https://issues.apache.org/jira/browse/PARQUET-1059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Dapeng Sun updated PARQUET-1059:
--------------------------------
    Description: 
The IDs of Parquet Dictionary encoding is using 
{{RunLengthBitPackingHybridEncoder}}.
RunLengthBitPackingHybridEncoder handles encoding with {{repeat}} and 
{{bitpacking}}, we should improve it with the method likes 
{{DeltaBinaryPackingWriter}}

  was:
The IDs of Parquet Dictionary encoding is using 
RunLengthBitPackingHybridEncoder.
RunLengthBitPackingHybridEncoder handle encoding with {{repeat}} and 
{{bitpacking}}, we should improve it with the method likes 
{{DeltaBinaryPackingWriter}}


> Improve the RLE encoding for Parquet Dictionary IDs
> ---------------------------------------------------
>
>                 Key: PARQUET-1059
>                 URL: https://issues.apache.org/jira/browse/PARQUET-1059
>             Project: Parquet
>          Issue Type: Improvement
>            Reporter: Dapeng Sun
>
> The IDs of Parquet Dictionary encoding is using 
> {{RunLengthBitPackingHybridEncoder}}.
> RunLengthBitPackingHybridEncoder handles encoding with {{repeat}} and 
> {{bitpacking}}, we should improve it with the method likes 
> {{DeltaBinaryPackingWriter}}



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to