Jyotirmoy Sinha created HDDS-10421:
--------------------------------------
Summary: [snapshot-LR] OM corruption due to NullPointerException
in an LR setup
Key: HDDS-10421
URL: https://issues.apache.org/jira/browse/HDDS-10421
Project: Apache Ozone
Issue Type: Bug
Components: Snapshot
Reporter: Jyotirmoy Sinha
Scenario :
* Generate data over parallel threads over various volume/buckets
* Perform parallel snapshot create/delete/list operations over above buckets
* Perform parallel snapdiff operations over each bucket
* Perform parallel read operations of snapshot contents
Nullpointer exception in OM logs -
{code:java}
2024-02-14 21:50:59,574 [OM StateMachine ApplyTransaction Thread - 0] WARN
org.apache.hadoop.ozone.om.snapshot.SnapshotCache: Key:
'/vol-yu9w1/buck-kmkp4/snap1707922796' does not exist in cache.
2024-02-14 21:50:59,907 [OM StateMachine ApplyTransaction Thread - 0] ERROR
org.apache.hadoop.ozone.om.ratis.OzoneManagerStateMachine: Terminating with
exit status 1: Request cmdType: SnapshotPurge
clientId: "client-62D7790F8E6C"
SnapshotPurgeRequest {
snapshotDBKeys: "/vol-yu9w1/buck-kmkp4/snap1707922796"
}
failed with exception
java.lang.NullPointerException
at
org.apache.hadoop.ozone.om.snapshot.SnapshotUtils.getNextActiveSnapshot(SnapshotUtils.java:152)
at
org.apache.hadoop.ozone.om.request.snapshot.OMSnapshotPurgeRequest.validateAndUpdateCache(OMSnapshotPurgeRequest.java:99)
at
org.apache.hadoop.ozone.protocolPB.OzoneManagerRequestHandler.handleWriteRequest(OzoneManagerRequestHandler.java:375)
at
org.apache.hadoop.ozone.om.ratis.OzoneManagerStateMachine.runCommand(OzoneManagerStateMachine.java:568)
at
org.apache.hadoop.ozone.om.ratis.OzoneManagerStateMachine.lambda$1(OzoneManagerStateMachine.java:359)
at
java.util.concurrent.CompletableFuture$AsyncSupply.run(CompletableFuture.java:1604)
at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
at java.lang.Thread.run(Thread.java:748)
2024-02-14 21:50:59,912 [shutdown-hook-0] INFO
org.apache.ranger.audit.provider.AuditProviderFactory: ==>
JVMShutdownHook.run() {code}
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]