Liya Fan created ARROW-6307:
-------------------------------
Summary: [Java] Provide RLE vector
Key: ARROW-6307
URL: https://issues.apache.org/jira/browse/ARROW-6307
Project: Apache Arrow
Issue Type: New Feature
Components: Java
Reporter: Liya Fan
Assignee: Liya Fan
RLE (run length encoding) is a widely used encoding/decoding technique.
Compared with other encoding/decoding techniques, it is easier to work with the
encoded data.
We want to provide an RLE vector implementation in Arrow. The design details
include:
1. RleVector implements ValueVector.
2. the data structure of RleVector includes an inner vector, plus a repetition
buffer.
3. we do not provide random access over the RleVector
4. In the future, we will provide iterators to access the vector in sequence.
5. RleVector does not support update, but supports appending.
6. In the future, we will provide encoder/decoder to efficiently transform
encoded/decoded vectors.
--
This message was sent by Atlassian Jira
(v8.3.2#803003)