[
https://issues.apache.org/jira/browse/AVRO-806?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13058098#comment-13058098
]
Doug Cutting commented on AVRO-806:
-----------------------------------
> I think we should make unions columnar as well.
That would be nice, but I'd rather we have something useful sooner than
something perfect later. We can extend it later in a backward-compatible
manner. It would not be forward compatible, but that might be acceptable as
long as there's only a single implementation (Java).
> If we want to avoid decompressing columns that are not accessed [ ... ]
I think the advantage of a columnar format is to avoid touching data that's not
needed, and avoiding decompression is consistent with that.
> add a column-major codec for data files
> ---------------------------------------
>
> Key: AVRO-806
> URL: https://issues.apache.org/jira/browse/AVRO-806
> Project: Avro
> Issue Type: New Feature
> Components: java, spec
> Reporter: Doug Cutting
> Assignee: Doug Cutting
> Attachments: AVRO-806-v2.patch, AVRO-806.patch, avro-file-columnar.pdf
>
>
> Define a codec that, when a data file's schema is a record schema, writes
> blocks within the file in column-major order. This would permit better
> compression and also permit efficient skipping of fields that are not of
> interest.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira