[
https://issues.apache.org/jira/browse/YARN-11716?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
John Doe updated YARN-11716:
----------------------------
Description:
MemoryFederationStateStore.close() sets the membership map to null without
synchronization.
If a concurrent call to registerSubCluster() reads the membership reference
between the null assignment and its own map access, it will dereference a null
object and throw a NullPointerException, crashing the Federation state store
service.
was:
{code:java}
org.apache.hadoop.yarn.server.nodemanager.containermanager.application.ApplicationImpl:
Can't handle this event at current state
org.apache.hadoop.yarn.state.InvalidStateTransitionException: Invalid event:
FINISH_APPLICATION at NEW
at
org.apache.hadoop.yarn.state.StateMachineFactory.doTransition(StateMachineFactory.java:305)
at
org.apache.hadoop.yarn.state.StateMachineFactory.access$500(StateMachineFactory.java:46)
at
org.apache.hadoop.yarn.state.StateMachineFactory$InternalStateMachine.doTransition(StateMachineFactory.java:487)
at
org.apache.hadoop.yarn.server.nodemanager.containermanager.application.ApplicationImpl.handle(ApplicationImpl.java:627)
at
org.apache.hadoop.yarn.server.nodemanager.containermanager.application.ApplicationImpl.handle(ApplicationImpl.java:75)
at
org.apache.hadoop.yarn.server.nodemanager.containermanager.ContainerManagerImpl$ApplicationEventDispatcher.handle(ContainerManagerImpl.java:1508)
at
org.apache.hadoop.yarn.server.nodemanager.containermanager.ContainerManagerImpl$ApplicationEventDispatcher.handle(ContainerManagerImpl.java:1501)
at
org.apache.hadoop.yarn.event.AsyncDispatcher.dispatch(AsyncDispatcher.java:197)
at
org.apache.hadoop.yarn.event.AsyncDispatcher$1.run(AsyncDispatcher.java:126)
at java.lang.Thread.run(Thread.java:745 {code}
> NullPointerException in MemoryFederationStateStore when close() races with
> registerSubCluster()
> -----------------------------------------------------------------------------------------------
>
> Key: YARN-11716
> URL: https://issues.apache.org/jira/browse/YARN-11716
> Project: Hadoop YARN
> Issue Type: Bug
> Components: yarn
> Affects Versions: 2.8.0
> Reporter: John Doe
> Priority: Major
>
> MemoryFederationStateStore.close() sets the membership map to null without
> synchronization.
> If a concurrent call to registerSubCluster() reads the membership reference
> between the null assignment and its own map access, it will dereference a
> null object and throw a NullPointerException, crashing the Federation state
> store service.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]