raunaqmorarka commented on code in PR #216:
URL: https://github.com/apache/parquet-format/pull/216#discussion_r1333328796


##########
src/main/thrift/parquet.thrift:
##########
@@ -216,7 +216,12 @@ struct Statistics {
    /** count of distinct values occurring */
    4: optional i64 distinct_count;
    /**
-    * Min and max values for the column, determined by its ColumnOrder.
+    * lower and upper bound values for the column, determined by its 
ColumnOrder.

Review Comment:
   Given the feature in https://issues.apache.org/jira/browse/PARQUET-1685, I 
want to assume that all existing stats are truncated. Going forward we should 
have a flag to explicitly indicate whether or not truncation took place and 
applications should perform aggregation pushdown only if that flag is found to 
indicate no truncation. But I think adding that flag can be tackled separately 
as a follow-up.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@parquet.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to