Rakesh Radhakrishnan created HDDS-3897:
------------------------------------------
Summary: OM startup failing to replay ratis log entries when
configuring a different segment size
Key: HDDS-3897
URL: https://issues.apache.org/jira/browse/HDDS-3897
Project: Hadoop Distributed Data Store
Issue Type: Bug
Reporter: Rakesh Radhakrishnan
OM is not able to read log entries which was created with a different log
segment size.
More details and the steps to re-produce the issue:-
*1)* Configured OM with a {{ozone.om.ratis.segment.size=16KB}} and
{{ozone.om.ratis.segment.preallocated.size=16KB}}. Then perform user ops like
createFile, deleteFile.
*2)* Stop OM and reconfigure the segment size to 16MB like,
{{ozone.om.ratis.segment.size=16MB}} and
{{ozone.om.ratis.segment.preallocated.size=16MB}}.
*3)* Now OM startup is failing to replay all these transactions and throwing
following exception.
*Ozone-om.log*
{code:java}
2020-06-28 22:54:31,468 INFO org.eclipse.jetty.server.Server:
jetty-9.4.26.v20200117; built: 2020-01-17T12:35:33.676Z; git:
7b38981d25d14afb4a12ff1f2596756144edf695; jvm 1.8.0_232-b09
2020-06-28 22:54:31,480 ERROR
org.apache.hadoop.ozone.om.request.key.OMKeyDeleteRequest: Key delete failed.
Volume:vol2, Bucket:bucket2,
Keyfsperf-Jun-27-2020/dir0/dir2/dir2/ve1320.halxg.cloudera.com8296f35232-2ed6-4d3b-8392-db848f707dda.
Exception:{}
KEY_NOT_FOUND org.apache.hadoop.ozone.om.exceptions.OMException: Key not found
at
org.apache.hadoop.ozone.om.request.key.OMKeyDeleteRequest.validateAndUpdateCache(OMKeyDeleteRequest.java:135)
at
org.apache.hadoop.ozone.protocolPB.OzoneManagerRequestHandler.handleWriteRequest(OzoneManagerRequestHandler.java:240)
at
org.apache.hadoop.ozone.om.ratis.OzoneManagerStateMachine.runCommand(OzoneManagerStateMachine.java:418)
at
org.apache.hadoop.ozone.om.ratis.OzoneManagerStateMachine.lambda$applyTransaction$1(OzoneManagerStateMachine.java:236)
at
java.util.concurrent.CompletableFuture$AsyncSupply.run(CompletableFuture.java:1604)
at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
at java.lang.Thread.run(Thread.java:748)
2020-06-28 22:54:31,481 ERROR
org.apache.hadoop.ozone.om.request.key.OMKeyDeleteRequest: Key delete failed.
Volume:vol2, Bucket:bucket2,
Keyfsperf-Jun-27-2020/dir0/dir3/dir0/ve1320.halxg.cloudera.com96bac30bc0-332c-442e-9691-244cb96b7c90.
Exception:{}
KEY_NOT_FOUND org.apache.hadoop.ozone.om.exceptions.OMException: Key not found
at
org.apache.hadoop.ozone.om.request.key.OMKeyDeleteRequest.validateAndUpdateCache(OMKeyDeleteRequest.java:135)
at
org.apache.hadoop.ozone.protocolPB.OzoneManagerRequestHandler.handleWriteRequest(OzoneManagerRequestHandler.java:240)
at
org.apache.hadoop.ozone.om.ratis.OzoneManagerStateMachine.runCommand(OzoneManagerStateMachine.java:418)
at
org.apache.hadoop.ozone.om.ratis.OzoneManagerStateMachine.lambda$applyTransaction$1(OzoneManagerStateMachine.java:236)
at
java.util.concurrent.CompletableFuture$AsyncSupply.run(CompletableFuture.java:1604)
at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
at java.lang.Thread.run(Thread.java:748)
{code}
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]