corwinjoy commented on code in PR #242:
URL: https://github.com/apache/parquet-format/pull/242#discussion_r1609213158
##########
src/main/thrift/parquet.thrift:
##########
@@ -1165,6 +1317,62 @@ struct FileMetaData {
9: optional binary footer_signing_key_metadata
}
+/** Metadata for a column in this file. */
+struct FileColumnMetadataV3 {
+ /** All column chunks in this file (one per row group) **/
+ 1: required list<ColumnChunkV3> columns
Review Comment:
@emkornfield One additional point in favor of a set of bytes may be random
access for subsets of the metadata. One big problem with the current thrift
structure is that we don't really know the (variable) size of a column chunk
and have to thrift decode everything. A byte array with either a fixed size or
a length attribute would be one way to get random access to column chunks.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]