choi woo cheol created PARQUET-407:
--------------------------------------
Summary: Incorrect delta-encoding example
Key: PARQUET-407
URL: https://issues.apache.org/jira/browse/PARQUET-407
Project: Parquet
Issue Type: Improvement
Components: parquet-format
Reporter: choi woo cheol
Priority: Trivial
The minimum and the number of bits are incorrect at delta encoding Example 2 In
{{Encodings.md}}.
In the example,
{code}
Example 2
7, 5, 3, 1, 2, 3, 4, 5, the deltas would be
-2, -2, -2, 1, 1, 1, 1
The minimum is -2, so the relative deltas are:
0, 0, 0, 3, 3, 3, 3
The encoded data is
header: 8 (block size), 1 (miniblock count), 8 (value count), 7 (first value)
block 0 (minimum delta), 2 (bitwidth), 000000111111b (0,0,0,3,3,3 packed on 2
bits)
{code}
The minimum is -2 and the relative deltas are 0, 0, 0, 3, 3, 3, 3. So, this
should be corrected as below:
{code}
block -2 (minimum delta), 2 (bitwidth), 00000011111111b (0,0,0,3,3,3,3 packed
on 2 bits)
{code}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)