[ 
https://issues.apache.org/jira/browse/HDDS-8345?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Duong resolved HDDS-8345.
-------------------------
    Resolution: Fixed

PR merged. Thanks [~hemantk] for the contribution. 

> [snapshot] OM process crash on restart due to Snapshot Chain corruption
> -----------------------------------------------------------------------
>
>                 Key: HDDS-8345
>                 URL: https://issues.apache.org/jira/browse/HDDS-8345
>             Project: Apache Ozone
>          Issue Type: Bug
>          Components: Ozone Manager, Snapshot
>            Reporter: Jyotirmoy Sinha
>            Assignee: Hemant Kumar
>            Priority: Major
>              Labels: ozone-snapshot, pull-request-available
>
> Scenario - Create 13k+ snapshots on a cluster and then restart the ozone 
> services.
> The OM process crashed with Snapshot Chain corruption exception since its not 
> able to find any of the SST files.
> Stacktrace - 
> {code:java}
> 2023-03-31 07:43:34,951 ERROR org.apache.hadoop.ozone.om.OzoneManagerStarter: 
> OM start failed with exception
> java.io.IOException: Snapshot Chain corruption:  previous snapshotID given 
> but no associated snapshot found in snapshot chain: SnapshotID 
> 1197e0c1-99d1-43b9-9b33-424a6c09b35a
>         at 
> org.apache.hadoop.ozone.om.SnapshotChainManager.addSnapshotGlobal(SnapshotChainManager.java:86)
>         at 
> org.apache.hadoop.ozone.om.SnapshotChainManager.addSnapshot(SnapshotChainManager.java:288)
>         at 
> org.apache.hadoop.ozone.om.SnapshotChainManager.loadFromSnapshotInfoTable(SnapshotChainManager.java:279)
>         at 
> org.apache.hadoop.ozone.om.SnapshotChainManager.<init>(SnapshotChainManager.java:63)
>         at 
> org.apache.hadoop.ozone.om.OmMetadataManagerImpl.start(OmMetadataManagerImpl.java:481)
>         at 
> org.apache.hadoop.ozone.om.OmMetadataManagerImpl.<init>(OmMetadataManagerImpl.java:320)
>         at 
> org.apache.hadoop.ozone.om.OzoneManager.instantiateServices(OzoneManager.java:747)
>         at 
> org.apache.hadoop.ozone.om.OzoneManager.<init>(OzoneManager.java:627)
>         at 
> org.apache.hadoop.ozone.om.OzoneManager.createOm(OzoneManager.java:712)
>         at 
> org.apache.hadoop.ozone.om.OzoneManagerStarter$OMStarterHelper.start(OzoneManagerStarter.java:189)
>         at 
> org.apache.hadoop.ozone.om.OzoneManagerStarter.startOm(OzoneManagerStarter.java:86)
>         at 
> org.apache.hadoop.ozone.om.OzoneManagerStarter.call(OzoneManagerStarter.java:74)
>         at org.apache.hadoop.hdds.cli.GenericCli.call(GenericCli.java:38)
>         at picocli.CommandLine.executeUserObject(CommandLine.java:1953)
>         at picocli.CommandLine.access$1300(CommandLine.java:145)
>         at 
> picocli.CommandLine$RunLast.executeUserObjectOfLastSubcommandWithSameParent(CommandLine.java:2352)
>         at picocli.CommandLine$RunLast.handle(CommandLine.java:2346)
>         at picocli.CommandLine$RunLast.handle(CommandLine.java:2311)
>         at 
> picocli.CommandLine$AbstractParseResultHandler.execute(CommandLine.java:2179)
>         at picocli.CommandLine.execute(CommandLine.java:2078)
>         at org.apache.hadoop.hdds.cli.GenericCli.execute(GenericCli.java:100)
>         at org.apache.hadoop.hdds.cli.GenericCli.run(GenericCli.java:91)
>         at 
> org.apache.hadoop.ozone.om.OzoneManagerStarter.main(OzoneManagerStarter.java:58)
> 2023-03-31 07:43:34,955 INFO org.apache.hadoop.ozone.om.OzoneManagerStarter: 
> SHUTDOWN_MSG:
> /************************************************************
> SHUTDOWN_MSG: Shutting down OzoneManager at 
> jspriv02-8.jspriv02.root.hwx.site/172.27.115.2
> ************************************************************/  {code}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to