deepeye opened a new issue #3447:
URL: https://github.com/apache/iceberg/issues/3447


   In 10 hours, the number of data files has increased by four times, and the 
file size has increased by five times, even though I have performed the 
operations of expire snapshots & compact data files。
   I do not know why?Please help me!
   
   iceberg table case:
   ```
   CREATE TABLE iceberg_user(
     user_id STRING,
     `name` STRING,
     PRIMARY KEY (`user_id`) NOT ENFORCED
   )
   WITH(
       'connector'='iceberg',
       'format-version' = '2',
       'engine.hive.enabled' = 'true',
       'write.distribution-mode'='hash',
       'write.upsert.enable'='true',
       'write.metadata.delete-after-commit.enabled' = 'true',
       'write.metadata.previous-versions-max' = '6',
       'history.expire.max-snapshot-age-ms' = '180000',
       'flink.rewrite.enable' = 'true',
       'flink.rewrite.parallelism' = '1',
       'flink.rewrite.target-file-size-bytes' = '134217728',
       'flink.rewrite.max-files-count' = '50',
       'catalog-name'='hive_prod',
       'catalog-database'='hch_data_center',
       'catalog-table'='iceberg_user',
       'uri'='thrift://flink001:9083',
       'io-impl'='org.apache.iceberg.aliyun.oss.OSSFileIO',
       'oss.endpoint'='',
       'oss.access.key.id'='',
       'oss.access.key.secret'='',
       'warehouse'='oss://bucket/iceberg/warehouse'
   );
   ```
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to