[
https://issues.apache.org/jira/browse/HUDI-3840?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
sivabalan narayanan updated HUDI-3840:
--------------------------------------
Fix Version/s: 0.12.0
> Warn logs about not able to read replace commit metadata
> ---------------------------------------------------------
>
> Key: HUDI-3840
> URL: https://issues.apache.org/jira/browse/HUDI-3840
> Project: Apache Hudi
> Issue Type: Task
> Components: spark
> Reporter: sivabalan narayanan
> Priority: Major
> Fix For: 0.12.0
>
>
> I was trying out spark streaming sink w/ hudi and saw warn logs as below.
> {code:java}
> 22/04/09 15:54:16 WARN AbstractTableFileSystemView: Could not read commit
> details from
> /tmp/hudi_streaming_kafka/COPY_ON_WRITE/.hoodie/20220409154917240.replacecommit
> 22/04/09 15:54:16 WARN AbstractTableFileSystemView: Could not read commit
> details from
> /tmp/hudi_streaming_kafka/COPY_ON_WRITE/.hoodie/20220409155011647.replacecommit
> {code}
> But ran some validations and ensured data was intact. Further investigation
> revealed that, this happens just after archival, where in the replace commit
> shown above were part of the list of instants that got archived. So, may be
> active timeline reloading is missed somewhere. Since its a warn log and does
> not cause any correctness issue, filing a low priority ticket.
>
> Steps to repo:
> spark streaming write to Hudi COW table w/ async clustering. make archival
> aggressive and you should see these logs at some point
>
>
--
This message was sent by Atlassian Jira
(v8.20.1#820001)