[
https://issues.apache.org/jira/browse/PARQUET-1059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Dapeng Sun updated PARQUET-1059:
--------------------------------
Description:
The IDs of Parquet Dictionary encoding is using
{{RunLengthBitPackingHybridEncoder}}.
RunLengthBitPackingHybridEncoder handles encoding with {{repeat}} and
{{bitpacking}}, we should improve it with the method likes
{{DeltaBinaryPackingWriter}}
was:
The IDs of Parquet Dictionary encoding is using
RunLengthBitPackingHybridEncoder.
RunLengthBitPackingHybridEncoder handle encoding with {{repeat}} and
{{bitpacking}}, we should improve it with the method likes
{{DeltaBinaryPackingWriter}}
> Improve the RLE encoding for Parquet Dictionary IDs
> ---------------------------------------------------
>
> Key: PARQUET-1059
> URL: https://issues.apache.org/jira/browse/PARQUET-1059
> Project: Parquet
> Issue Type: Improvement
> Reporter: Dapeng Sun
>
> The IDs of Parquet Dictionary encoding is using
> {{RunLengthBitPackingHybridEncoder}}.
> RunLengthBitPackingHybridEncoder handles encoding with {{repeat}} and
> {{bitpacking}}, we should improve it with the method likes
> {{DeltaBinaryPackingWriter}}
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)