[ 
https://issues.apache.org/jira/browse/MAPREDUCE-7123?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16546054#comment-16546054
 ] 

Amithsha edited comment on MAPREDUCE-7123 at 7/18/18 4:30 AM:
--------------------------------------------------------------

>From the stack trace Found that the error is from Mapreduce 2.7.1 Client code. 
 Where the Resource manager and nodemanager is running on 2.9.0.
 So the client code is 2.7.1 and error is from that client code which may not 
be handled in 2.9.0.


was (Author: amithsha):
>From the stack trace Found that the error is running from Mapreduce 2.7.1 
Where the Resource manager and nodemanager is running on 2.9.0.
So the client code is 2.7.1 and error is from that client code which may not be 
handled in 2.9.0.

> AM Failed with Communication error to RM
> ----------------------------------------
>
>                 Key: MAPREDUCE-7123
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-7123
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: mrv2
>    Affects Versions: 2.9.0
>            Reporter: Amithsha
>            Priority: Major
>
> During the restart of nodemanagers in 300 node cluster some jobs failed with 
> the following exceptions.
> But the nodes where the AM launched is not the part of cluster.
> FATAL [AsyncDispatcher event handler] 
> org.apache.hadoop.yarn.event.AsyncDispatcher: Error in dispatcher thread 
> java.lang.NullPointerException at 
> org.apache.hadoop.mapreduce.v2.app.job.impl.JobImpl$UpdatedNodesTransition.transition(JobImpl.java:2146)
>  at 
> org.apache.hadoop.mapreduce.v2.app.job.impl.JobImpl$UpdatedNodesTransition.transition(JobImpl.java:2139)
>  at 
> org.apache.hadoop.yarn.state.StateMachineFactory$SingleInternalArc.doTransition(StateMachineFactory.java:362)
>  at 
> org.apache.hadoop.yarn.state.StateMachineFactory.doTransition(StateMachineFactory.java:302)
>  at 
> org.apache.hadoop.yarn.state.StateMachineFactory.access$300(StateMachineFactory.java:46)
>  at 
> org.apache.hadoop.yarn.state.StateMachineFactory$InternalStateMachine.doTransition(StateMachineFactory.java:448)
>  at 
> org.apache.hadoop.mapreduce.v2.app.job.impl.JobImpl.handle(JobImpl.java:998) 
> at 
> org.apache.hadoop.mapreduce.v2.app.job.impl.JobImpl.handle(JobImpl.java:138) 
> at 
> org.apache.hadoop.mapreduce.v2.app.MRAppMaster$JobEventDispatcher.handle(MRAppMaster.java:1346)
>  at 
> org.apache.hadoop.mapreduce.v2.app.MRAppMaster$JobEventDispatcher.handle(MRAppMaster.java:1342)
>  at 
> org.apache.hadoop.yarn.event.AsyncDispatcher.dispatch(AsyncDispatcher.java:183)
>  at 
> org.apache.hadoop.yarn.event.AsyncDispatcher$1.run(AsyncDispatcher.java:109) 
> at java.lang.Thread.run(Thread.java:745) 2018-07-14 12:34:53,425 ERROR 
> [RMCommunicator Allocator] 
> org.apache.hadoop.mapreduce.v2.app.rm.RMCommunicator: ERROR IN CONTACTING RM. 
> java.lang.NullPointerException at 
> org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator.handleUpdatedNodes(RMContainerAllocator.java:875)
>  at 
> org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator.getResources(RMContainerAllocator.java:776)
>  at 
> org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator.heartbeat(RMContainerAllocator.java:256)
>  at 
> org.apache.hadoop.mapreduce.v2.app.rm.RMCommunicator$AllocatorRunnable.run(RMCommunicator.java:281)
>  at java.lang.Thread.run(Thread.java:745) 2018-07-14 12:34:53,427 INFO 
> [AsyncDispatcher ShutDown handler] 
> org.apache.hadoop.yarn.event.AsyncDispatcher: Exiting, bbye..



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to