pitrou commented on code in PR #13333: URL: https://github.com/apache/arrow/pull/13333#discussion_r1048719263
########## docs/source/format/Columnar.rst: ########## @@ -765,6 +765,68 @@ application. We discuss dictionary encoding as it relates to serialization further below. +.. _run-length-encoded-layout: + +Run-Length Encoded Layout +------------------------- + +Run-Length is a data representation that represents data as sequences of the +same value, called runs. Each run is represented as a value, and an integer +describing how often this value is repeated. + +Any array can be run-length encoded. A run-length encoded array has no buffers +by itself, but has two child arrays. The first one holds a signed integer +called a "run end" for each run. The run ends array can hold either 16, 32, or +64-bit integers. The actual values of each run are held +the second child array. Review Comment: Ah, darn, these are run ends, not run lengths. Scratch that... -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
