hguercan commented on issue #13763:
URL: https://github.com/apache/iceberg/issues/13763#issuecomment-3210443481

   Thanks to @yogevyuval he put our attention to the high number of 
"added-data-files" where the duplicate file path references are happening. We 
checked the commits before and the number is insanely high. For the other 
commits its mostly two to three digits and occasionally some thousands but 
never that huge as "46015" or "236825". Accordingly it behaves the same with 
the "added-records" and "added-files-size". 
   
   We are not sure how this could be explained by having such a big number. The 
operation-type we are seeing before that failure/duplicate containing commits 
were maintenance jobs (replace type). 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to