[ https://issues.apache.org/jira/browse/PARQUET-2261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17789310#comment-17789310 ]
ASF GitHub Bot commented on PARQUET-2261: ----------------------------------------- wgtmac commented on code in PR #1177: URL: https://github.com/apache/parquet-mr/pull/1177#discussion_r1403906914 ########## parquet-column/src/main/java/org/apache/parquet/internal/column/columnindex/ColumnIndex.java: ########## @@ -57,4 +57,16 @@ public interface ColumnIndex extends Visitor<PrimitiveIterator.OfInt> { */ public List<ByteBuffer> getMaxValues(); + /** + * @return the unmodifiable list of the repetition level histograms for each page concatenated together; used for + * converting to the related thrift object + */ + public List<Long> getRepetitionLevelHistogram(); Review Comment: Yes, I don't want to add outliers here ... > [Format] Add statistics that reflect decoded size to metadata > ------------------------------------------------------------- > > Key: PARQUET-2261 > URL: https://issues.apache.org/jira/browse/PARQUET-2261 > Project: Parquet > Issue Type: New Feature > Components: parquet-format > Reporter: Micah Kornfield > Assignee: Micah Kornfield > Priority: Major > Fix For: format-2.10.0 > > -- This message was sent by Atlassian Jira (v8.20.10#820010)