Improved encodings for arrays
-----------------------------

                 Key: AVRO-679
                 URL: https://issues.apache.org/jira/browse/AVRO-679
             Project: Avro
          Issue Type: New Feature
          Components: spec
            Reporter: Stu Hood
            Priority: Minor


There are better ways to encode arrays of varints [1] which are faster to 
decode, and more space efficient than encoding varints independently.

Extending the idea to other types of variable length data like 'bytes' and 
'string', you could encode the entries for an array block as an array of 
lengths, followed by contiguous byte/utf8 data.

[1] group varint encoding: slides 57-63 of 
http://static.googleusercontent.com/external_content/untrusted_dlcp/research.google.com/en/us/people/jeff/WSDM09-keynote.pdf

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to