jorgecarleitao commented on a change in pull request #170:
URL: https://github.com/apache/parquet-format/pull/170#discussion_r606336848



##########
File path: rle-bitpacked.md
##########
@@ -0,0 +1,78 @@
+# RLE-Bitpacked hybrid encoder
+
+The RLE-Bitpacked hybrid encoder is a parquet-specific encoder that combines 
two well known encoding strategies,
+[RLE](https://en.wikipedia.org/wiki/Run-length_encoding) and bitpacking. Note 
that "combine" here means this encoder allows both encodings within the same 
stream, and, during encoding, it can switch between them.
+
+This encoder is only used to encode integer values that may either represent 
definition levels, representation levels or ids of dictionary-encoded pages. 
Note that this encoder supports integers that can be represented in less than 8 
bits.

Review comment:
       I admit it is the first time I am going through the parquet format, and 
I am hopping for some help here.
   
   I based this on `Encoding.md`:
   
   ```
   Note that the RLE encoding method is only supported for the following types 
of data:
   
       Repetition and definition levels
       Dictionary indices
       Boolean values in data pages, as an alternative to PLAIN encoding
   ```
   which I can see I forgot the boolean data. Maybe this requires updating the 
`Encoding.md` also?




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to