[ 
https://issues.apache.org/jira/browse/HDDS-12377?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Arafat Khan resolved HDDS-12377.
--------------------------------
    Fix Version/s: 2.0.0
       Resolution: Fixed

> Improve error handling of OM background tasks processing in case of abrupt 
> crash of Recon
> -----------------------------------------------------------------------------------------
>
>                 Key: HDDS-12377
>                 URL: https://issues.apache.org/jira/browse/HDDS-12377
>             Project: Apache Ozone
>          Issue Type: Task
>          Components: Ozone Recon
>            Reporter: Devesh Kumar Singh
>            Assignee: Devesh Kumar Singh
>            Priority: Major
>              Labels: pull-request-available
>             Fix For: 2.0.0
>
>
> If Recon has applied incremental DB updates and just before consuming those 
> events, if Recon crashed due to some unexpected error or CU restarted the 
> Recon during that time, then on restart of Recon again, recon will not try to 
> consume those events again and due to this edge case, OM DB updates will be 
> missed, So there are 2 solutions to fix this gap:
>  * On restart, check if incremental DB update task lastSequence number not 
> matching with lastUpdatedSeq number of underlying task, then just run 
> reprocess for such tasks.
>  * Another way, maintain lastUpdatedSequence number with each event 
> consumption and then start applying from there on restart, but this may not 
> be worth to implement the complex handling for this edge case.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to