[ 
https://issues.apache.org/jira/browse/PARQUET-467?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Wes McKinney updated PARQUET-467:
---------------------------------
    Description: 
While RLE is the preferred encoding format (and BIT_PACKED is deprecated in 
Parquet 2.0), we will need to support this encoding format for legacy Parquet 
files that use it. As part of this JIRA we will verify round-tripping levels to 
this encoding format.

See also PARQUET-462

  was:This is implemented in Impala, but unclear how much data in the wild is 
encoded in this format (deprecating according to parquet-format) with RLE as 
the preferred encoding (for repetition/definition levels). At minimum we should 
raise an exception if this encoding is encoutnered.


> Implement and test BIT_PACKED level encoding / decoding 
> --------------------------------------------------------
>
>                 Key: PARQUET-467
>                 URL: https://issues.apache.org/jira/browse/PARQUET-467
>             Project: Parquet
>          Issue Type: Bug
>          Components: parquet-cpp
>            Reporter: Wes McKinney
>
> While RLE is the preferred encoding format (and BIT_PACKED is deprecated in 
> Parquet 2.0), we will need to support this encoding format for legacy Parquet 
> files that use it. As part of this JIRA we will verify round-tripping levels 
> to this encoding format.
> See also PARQUET-462



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to