wjones127 commented on code in PR #189: URL: https://github.com/apache/parquet-format/pull/189#discussion_r1081899568
########## Encodings.md: ########## @@ -280,16 +280,19 @@ concatenated back to back. The expected savings is from the cost of encoding the and possibly better compression in the data (it is no longer interleaved with the lengths). The data stream looks like: - +``` <Delta Encoded Lengths> <Byte Array Data> +``` -For example, if the data was "Hello", "World", "Foobar", "ABCDEF": +For example, if the data was "Hello", "World", "Foobar", "ABCDEF" -The encoded data would be DeltaEncoding(5, 5, 6, 6) "HelloWorldFoobarABCDEF" +The encoded data would be comprised of the following segments: Review Comment: ```suggestion then the encoded data would be comprised of the following segments: ``` -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: dev-unsubscr...@parquet.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org