corwinjoy commented on code in PR #242:
URL: https://github.com/apache/parquet-format/pull/242#discussion_r1604031528
##########
src/main/thrift/parquet.thrift:
##########
@@ -1165,6 +1317,62 @@ struct FileMetaData {
9: optional binary footer_signing_key_metadata
}
+/** Metadata for a column in this file. */
+struct FileColumnMetadataV3 {
+ /** All column chunks in this file (one per row group) **/
+ 1: required list<ColumnChunkV3> columns
Review Comment:
@emkornfield Thanks for that comment emkornfield! This is the bottleneck we
are struggling with, as per previous PR, is getting fast access random rowgroup
reads. Having the columns (and rowgroups below) stored in a fashion that allows
random access would be a huge improvement here!
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]