mannerni commented on issue #8427: URL: https://github.com/apache/iceberg/issues/8427#issuecomment-3913381913
Hi, We have faced similar error while using trino (479) with iceberg REST catalog (1.10.1). May be with trino as mentioned above. I suspect the issue is with optimization commands. We run optimize table by sizes (2/4/40/128MB) -> optmize manifest -> remove-snapshots -> remove-orphan-files. These 4 commands are run without any wait in between them. The leading command causing seems to be remove-orphan-files. The severe problem is we are not able to read that parquet file at all. Need some reliable handling and recommendation to avoid file corruption etc. Error detail: (Error opening Iceberg split s3://xxxxx-us-east-1-ts-zbc/t001/tablename/data/time_month=2022-03/id_bucket=3/20260209_035531_00064_t2j6q-856320fd-41db-4f68-acd1-317788e9238e.parquet (offset=14926945, length=3745233): s3://xxxxx-us-east-1-ts-zbc/t001/tablename/data/time_month=2022-03/id_bucket=3/20260209_035531_00064_t2j6q-856320fd-41db-4f68-acd1-317788e9238e.parquet)". -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
