emkornfield commented on code in PR #252:
URL: https://github.com/apache/parquet-format/pull/252#discussion_r1622698456


##########
src/main/thrift/parquet.thrift:
##########
@@ -243,42 +243,39 @@ struct SizeStatistics {
  */
 struct Statistics {
    /**
-    * DEPRECATED: min and max value of the column. Use min_value and max_value.
+    * max/min: deprecated, used to encoded signed orderable values, ignoring
+    *   the columns ColumnOrder
+    * max_value/min_value: PLAIN encoded values, sans length prefix if varlen
+    * max8/min8: up to 8-bytes:
+    *   FLOAT, DOUBLE: bitcasted to INT32 and INT64, respectively

Review Comment:
   I think we might want to be more specific here about values less then 8 
bytes are translated into 8 bytes.  In practice it doesn't make a difference 
for readers but it would be good to limit ambiguity.  I assume we do a normal 
cast from 1/4 integer byte values to 8 bytes values rather then just embedding 
them?



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to