zombee0 commented on code in PR #63:
URL: https://github.com/apache/parquet-format/pull/63#discussion_r1452022578
##########
src/main/thrift/parquet.thrift:
##########
@@ -595,6 +607,43 @@ union ColumnOrder {
1: TypeDefinedOrder TYPE_ORDER;
}
+struct PageLocation {
+/** Offset of the page in the file **/
+ 1: required i64 offset
+
+/** Size of the page, including header. The same as
PageHeader.compressed_page_size **/
+ 2: required i32 compressed_page_size
+
+/** Index within the RowGroup of the first row of the page **/
Review Comment:
@lekv @rdblue have we achieve this for nested column? i mean split on record
boundaries to ensure that no records are split across pages, thanks a lot!
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]