[
https://issues.apache.org/jira/browse/AVRO-806?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13022255#comment-13022255
]
Douglas Creager commented on AVRO-806:
--------------------------------------
And we could handle the union index in the same way as we'd handle array and
map counts. Each branch of the union can be seen as a container that can only
have 0 or 1 elements. Since we'll already need some kind of repetition count
to keep track of how many elements are in an array type, we can use the same
mechanism to keep track of how many elements are in each union branch.
> add a column-major codec for data files
> ---------------------------------------
>
> Key: AVRO-806
> URL: https://issues.apache.org/jira/browse/AVRO-806
> Project: Avro
> Issue Type: New Feature
> Components: java, spec
> Reporter: Doug Cutting
>
> Define a codec that, when a data file's schema is a record schema, writes
> blocks within the file in column-major order. This would permit better
> compression and also permit efficient skipping of fields that are not of
> interest.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira