deepeye opened a new issue #3447:
URL: https://github.com/apache/iceberg/issues/3447
In 10 hours, the number of data files has increased by four times, and the
file size has increased by five times, even though I have performed the
operations of expire snapshots & compact data files。
I do not know why?Please help me!
iceberg table case:
```
CREATE TABLE iceberg_user(
user_id STRING,
`name` STRING,
PRIMARY KEY (`user_id`) NOT ENFORCED
)
WITH(
'connector'='iceberg',
'format-version' = '2',
'engine.hive.enabled' = 'true',
'write.distribution-mode'='hash',
'write.upsert.enable'='true',
'write.metadata.delete-after-commit.enabled' = 'true',
'write.metadata.previous-versions-max' = '6',
'history.expire.max-snapshot-age-ms' = '180000',
'flink.rewrite.enable' = 'true',
'flink.rewrite.parallelism' = '1',
'flink.rewrite.target-file-size-bytes' = '134217728',
'flink.rewrite.max-files-count' = '50',
'catalog-name'='hive_prod',
'catalog-database'='hch_data_center',
'catalog-table'='iceberg_user',
'uri'='thrift://flink001:9083',
'io-impl'='org.apache.iceberg.aliyun.oss.OSSFileIO',
'oss.endpoint'='',
'oss.access.key.id'='',
'oss.access.key.secret'='',
'warehouse'='oss://bucket/iceberg/warehouse'
);
```
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]