[ https://issues.apache.org/jira/browse/AVRO-1259?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Doug Cutting updated AVRO-1259: ------------------------------- Attachment: AVRO-1259.patch Here's a patch that implements this. No new tests are required, since TestShredder already writes and reads various sequences of zero lengths. > improve Trevni encoding of sparse fields > ---------------------------------------- > > Key: AVRO-1259 > URL: https://issues.apache.org/jira/browse/AVRO-1259 > Project: Avro > Issue Type: Improvement > Components: trevni > Reporter: Doug Cutting > Attachments: AVRO-1259.patch > > > If in most records a field is null, Trevni writes a null byte (zero length) > for that record in that column. This might be optimized by instead using a > run-length encoding for lengths. The length is signed, so negative lengths > might be used to indicate the number of zero-lengths before the next non-zero > value. This could thus be back-compatible. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira