huleilei commented on PR #4460:
URL: https://github.com/apache/iceberg/pull/4460#issuecomment-1087627662
>
@rdblue
The function of `write-id` is not only to track file change when data files
are compact. The old name of `write-id` is file-sequence-name. We just want to
distinguish the name of `sequence-number`. The `write-id` is fixed for a data
file, because the data file is unchanged. For example in the insert operation,
the info in manifest-entry is:
```
{"status":1,"snapshot_id":{"long":4470921710704596635},"sequence_number":null,"writer_id":{"long":1},"data_file":{"content":0,"file_path":"file:/var/folders/lw/pf18s0dj1lv67sthrh4089lh0000gp/T/hive5259352165618126566/table/data/c2=foo/00009-11-a8106282-5ddb-400b-a4c1-bdd10889fc31-00001.parquet","file_format":"PARQUET","partition":{"c2":{"string":"foo"}},"record_count":1,"file_size_in_bytes":841,"column_sizes":{"array":[{"key":1,"value":51},{"key":2,"value":54},{"key":3,"value":49}]},"value_counts":{"array":[{"key":1,"value":1},{"key":2,"value":1},{"key":3,"value":1}]},"null_value_counts":{"array":[{"key":1,"value":0},{"key":2,"value":0},{"key":3,"value":1}]},"nan_value_counts":{"array":[]},"lower_bounds":{"array":[{"key":1,"value":"\u0001\u0000\u0000\u0000"},{"key":2,"value":"foo"}]},"upper_bounds":{"array":[{"key":1,"value":"\u0001\u0000\u0000\u0000"},{"key":2,"value":"foo"}]},"key_metadata":null,"split_offsets":{"array":[4]},"equality_ids":null,"sort_order_id":{"int":0}}}
```
After performing update and delete operations on the table, the value of
`write-id` in the data
file(`00009-11-a8106282-5ddb-400b-a4c1-bdd10889fc31-00001.parquet`) is still 1.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]