Liya Fan created ARROW-6307: ------------------------------- Summary: [Java] Provide RLE vector Key: ARROW-6307 URL: https://issues.apache.org/jira/browse/ARROW-6307 Project: Apache Arrow Issue Type: New Feature Components: Java Reporter: Liya Fan Assignee: Liya Fan
RLE (run length encoding) is a widely used encoding/decoding technique. Compared with other encoding/decoding techniques, it is easier to work with the encoded data. We want to provide an RLE vector implementation in Arrow. The design details include: 1. RleVector implements ValueVector. 2. the data structure of RleVector includes an inner vector, plus a repetition buffer. 3. we do not provide random access over the RleVector 4. In the future, we will provide iterators to access the vector in sequence. 5. RleVector does not support update, but supports appending. 6. In the future, we will provide encoder/decoder to efficiently transform encoded/decoded vectors. -- This message was sent by Atlassian Jira (v8.3.2#803003)