szehon-ho commented on code in PR #4945:
URL: https://github.com/apache/iceberg/pull/4945#discussion_r893980855


##########
format/spec.md:
##########
@@ -496,6 +496,7 @@ A snapshot consists of the following fields:
 | _optional_ |            | **`manifests`**          | A list of manifest file 
locations. Must be omitted if `manifest-list` is present |
 | _optional_ | _required_ | **`summary`**            | A string map that 
summarizes the snapshot changes, including `operation` (see below) |
 | _optional_ | _optional_ | **`schema-id`**          | ID of the table's 
current schema when the snapshot was created |
+|            | _optional_ | **`statistics`**         | A list of [statistics 
files' metadata](#statistics-file). The field should be retained by writers, 
unless writer updates the statistics, or knows they became obsolete. |

Review Comment:
   Trying to understand this (sorry if missed previous discussion), does a 
puffin file or list of puffin files have statistics for all data referenced by 
a snapshot?  We will not support statistics per data file?



##########
format/spec.md:
##########
@@ -496,6 +496,7 @@ A snapshot consists of the following fields:
 | _optional_ |            | **`manifests`**          | A list of manifest file 
locations. Must be omitted if `manifest-list` is present |
 | _optional_ | _required_ | **`summary`**            | A string map that 
summarizes the snapshot changes, including `operation` (see below) |
 | _optional_ | _optional_ | **`schema-id`**          | ID of the table's 
current schema when the snapshot was created |
+|            | _optional_ | **`statistics`**         | A list of [statistics 
files' metadata](#statistics-file). The field should be retained by writers, 
unless writer updates the statistics, or knows they became obsolete. |

Review Comment:
   Trying to understand this (sorry if missed previous discussion), does a 
puffin file or list of puffin files have statistics for all data referenced by 
a snapshot?  We will not support statistics per data file?



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to