[
https://issues.apache.org/jira/browse/HDDS-13551?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Bablu Raul updated HDDS-13551:
------------------------------
Description:
When I stop the SCM leader(i.e data-10:LEADER), the system correctly triggers a
leader election and successfully elects a new SCM leader. This can be verified
through the CLI, where the new leader is visible
{code:java}
data-17:FOLLOWER
data-1:FOLLOWER
data-10:LEADER{code}
{code:java}
data-17:FOLLOWER
data-1:LEADER
data-10:FOLLOWER {code}
{code:java}
2025-08-07 05:15:11,854|INFO|MainThread|machine.py:205 -
run()||GUID=51debc2b-956b-4e05-b036-ced4aa0547f4|at
org.apache.hadoop.hdds.scm.server.SCMClientProtocolServer.reconcileContainer(SCMClientProtocolServer.java:1542)
2025-08-07 05:15:11,854|INFO|MainThread|machine.py:205 -
run()||GUID=51debc2b-956b-4e05-b036-ced4aa0547f4|at
org.apache.hadoop.hdds.scm.protocol.StorageContainerLocationProtocolServerSideTranslatorPB.reconcileContainer(StorageContainerLocationProtocolServerSideTranslatorPB.java:1360)
2025-08-07 05:15:11,854|INFO|MainThread|machine.py:205 -
run()||GUID=51debc2b-956b-4e05-b036-ced4aa0547f4|at
org.apache.hadoop.hdds.scm.protocol.StorageContainerLocationProtocolServerSideTranslatorPB.processRequest(StorageContainerLocationProtocolServerSideTranslatorPB.java:739)
2025-08-07 05:15:11,854|INFO|MainThread|machine.py:205 -
run()||GUID=51debc2b-956b-4e05-b036-ced4aa0547f4|at
org.apache.hadoop.hdds.server.OzoneProtocolMessageDispatcher.processRequest(OzoneProtocolMessageDispatcher.java:89)
2025-08-07 05:15:11,855|INFO|MainThread|machine.py:205 -
run()||GUID=51debc2b-956b-4e05-b036-ced4aa0547f4|at
org.apache.hadoop.hdds.scm.protocol.StorageContainerLocationProtocolServerSideTranslatorPB.submitRequest(StorageContainerLocationProtocolServerSideTranslatorPB.java:235)
2025-08-07 05:15:11,855|INFO|MainThread|machine.py:205 -
run()||GUID=51debc2b-956b-4e05-b036-ced4aa0547f4|at
org.apache.hadoop.hdds.protocol.proto.StorageContainerLocationProtocolProtos$StorageContainerLocationProtocolService$2.callBlockingMethod(StorageContainerLocationProtocolProtos.java)
2025-08-07 05:15:11,855|INFO|MainThread|machine.py:205 -
run()||GUID=51debc2b-956b-4e05-b036-ced4aa0547f4|at
org.apache.hadoop.ipc.ProtobufRpcEngine$Server$ProtoBufRpcInvoker.call(ProtobufRpcEngine.java:533)
2025-08-07 05:15:11,855|INFO|MainThread|machine.py:205 -
run()||GUID=51debc2b-956b-4e05-b036-ced4aa0547f4|at
org.apache.hadoop.ipc.RPC$Server.call(RPC.java:1070) 2025-08-07
05:15:11,855|INFO|MainThread|machine.py:205 -
run()||GUID=51debc2b-956b-4e05-b036-ced4aa0547f4|at
org.apache.hadoop.ipc.Server$RpcCall.run(Server.java:994) 2025-08-07
05:15:11,856|INFO|MainThread|machine.py:205 -
run()||GUID=51debc2b-956b-4e05-b036-ced4aa0547f4|at
org.apache.hadoop.ipc.Server$RpcCall.run(Server.java:922) 2025-08-07
05:15:11,856|INFO|MainThread|machine.py:205 -
run()||GUID=51debc2b-956b-4e05-b036-ced4aa0547f4|at
java.security.AccessController.doPrivileged(Native Method) 2025-08-07
05:15:11,856|INFO|MainThread|machine.py:205 -
run()||GUID=51debc2b-956b-4e05-b036-ced4aa0547f4|at
javax.security.auth.Subject.doAs(Subject.java:422) 2025-08-07
05:15:11,856|INFO|MainThread|machine.py:205 -
run()||GUID=51debc2b-956b-4e05-b036-ced4aa0547f4|at
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1910)
2025-08
{code}
was:
When I stop the SCM leader, the system correctly triggers a leader election and
successfully elects a new SCM leader. This can be verified through the CLI,
where the new leader is visible
{code:java}
data-17:FOLLOWER data-17:FOLLOWER{code}
> ServerNotLeaderException after SCM leader is stopped and new leader is
> elected
> -------------------------------------------------------------------------------
>
> Key: HDDS-13551
> URL: https://issues.apache.org/jira/browse/HDDS-13551
> Project: Apache Ozone
> Issue Type: Bug
> Reporter: Bablu Raul
> Priority: Major
>
> When I stop the SCM leader(i.e data-10:LEADER), the system correctly triggers
> a leader election and successfully elects a new SCM leader. This can be
> verified through the CLI, where the new leader is visible
> {code:java}
> data-17:FOLLOWER
> data-1:FOLLOWER
> data-10:LEADER{code}
> {code:java}
> data-17:FOLLOWER
> data-1:LEADER
> data-10:FOLLOWER {code}
> {code:java}
> 2025-08-07 05:15:11,854|INFO|MainThread|machine.py:205 -
> run()||GUID=51debc2b-956b-4e05-b036-ced4aa0547f4|at
> org.apache.hadoop.hdds.scm.server.SCMClientProtocolServer.reconcileContainer(SCMClientProtocolServer.java:1542)
> 2025-08-07 05:15:11,854|INFO|MainThread|machine.py:205 -
> run()||GUID=51debc2b-956b-4e05-b036-ced4aa0547f4|at
> org.apache.hadoop.hdds.scm.protocol.StorageContainerLocationProtocolServerSideTranslatorPB.reconcileContainer(StorageContainerLocationProtocolServerSideTranslatorPB.java:1360)
> 2025-08-07 05:15:11,854|INFO|MainThread|machine.py:205 -
> run()||GUID=51debc2b-956b-4e05-b036-ced4aa0547f4|at
> org.apache.hadoop.hdds.scm.protocol.StorageContainerLocationProtocolServerSideTranslatorPB.processRequest(StorageContainerLocationProtocolServerSideTranslatorPB.java:739)
> 2025-08-07 05:15:11,854|INFO|MainThread|machine.py:205 -
> run()||GUID=51debc2b-956b-4e05-b036-ced4aa0547f4|at
> org.apache.hadoop.hdds.server.OzoneProtocolMessageDispatcher.processRequest(OzoneProtocolMessageDispatcher.java:89)
> 2025-08-07 05:15:11,855|INFO|MainThread|machine.py:205 -
> run()||GUID=51debc2b-956b-4e05-b036-ced4aa0547f4|at
> org.apache.hadoop.hdds.scm.protocol.StorageContainerLocationProtocolServerSideTranslatorPB.submitRequest(StorageContainerLocationProtocolServerSideTranslatorPB.java:235)
> 2025-08-07 05:15:11,855|INFO|MainThread|machine.py:205 -
> run()||GUID=51debc2b-956b-4e05-b036-ced4aa0547f4|at
> org.apache.hadoop.hdds.protocol.proto.StorageContainerLocationProtocolProtos$StorageContainerLocationProtocolService$2.callBlockingMethod(StorageContainerLocationProtocolProtos.java)
> 2025-08-07 05:15:11,855|INFO|MainThread|machine.py:205 -
> run()||GUID=51debc2b-956b-4e05-b036-ced4aa0547f4|at
> org.apache.hadoop.ipc.ProtobufRpcEngine$Server$ProtoBufRpcInvoker.call(ProtobufRpcEngine.java:533)
> 2025-08-07 05:15:11,855|INFO|MainThread|machine.py:205 -
> run()||GUID=51debc2b-956b-4e05-b036-ced4aa0547f4|at
> org.apache.hadoop.ipc.RPC$Server.call(RPC.java:1070) 2025-08-07
> 05:15:11,855|INFO|MainThread|machine.py:205 -
> run()||GUID=51debc2b-956b-4e05-b036-ced4aa0547f4|at
> org.apache.hadoop.ipc.Server$RpcCall.run(Server.java:994) 2025-08-07
> 05:15:11,856|INFO|MainThread|machine.py:205 -
> run()||GUID=51debc2b-956b-4e05-b036-ced4aa0547f4|at
> org.apache.hadoop.ipc.Server$RpcCall.run(Server.java:922) 2025-08-07
> 05:15:11,856|INFO|MainThread|machine.py:205 -
> run()||GUID=51debc2b-956b-4e05-b036-ced4aa0547f4|at
> java.security.AccessController.doPrivileged(Native Method) 2025-08-07
> 05:15:11,856|INFO|MainThread|machine.py:205 -
> run()||GUID=51debc2b-956b-4e05-b036-ced4aa0547f4|at
> javax.security.auth.Subject.doAs(Subject.java:422) 2025-08-07
> 05:15:11,856|INFO|MainThread|machine.py:205 -
> run()||GUID=51debc2b-956b-4e05-b036-ced4aa0547f4|at
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1910)
> 2025-08
> {code}
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]