[
https://issues.apache.org/jira/browse/MAPREDUCE-3030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13107767#comment-13107767
]
Devaraj K commented on MAPREDUCE-3030:
--------------------------------------
This issue is coming due to the changes of MAPREDUCE-2965.
{code:title=NodeId.java |borderStyle=solid}
public boolean equals(Object obj) {
if (this == obj)
return true;
if (!super.equals(obj))
return false;
if (getClass() != obj.getClass())
return false;
NodeId other = (NodeId) obj;
if (!this.getHost().equals(other.getHost()))
return false;
if (this.getPort() != other.getPort())
return false;
return true;
}
{code}
Here the super.equals(obj) is always returning false, nodeId are becoming not
equal even if the states are equal.
RMNode rmNode = this.rmContext.getRMNodes().get(nodeId) in
ResourceTrackerService.java is always returning null, even if the node was
registered already and it is causing to log continuously in resource manager
logs. Due to this submitted application progress is always 0.
> RM is not processing heartbeat and continuously giving the message 'Node not
> found rebooting'
> ---------------------------------------------------------------------------------------------
>
> Key: MAPREDUCE-3030
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-3030
> Project: Hadoop Map/Reduce
> Issue Type: Bug
> Components: resourcemanager
> Affects Versions: 0.24.0
> Reporter: Devaraj K
> Assignee: Devaraj K
> Priority: Blocker
>
> {code:title=Node Manager Logs|borderStyle=solid}
> 2011-09-19 13:39:29,816 INFO webapp.WebApps (WebApps.java:start(162)) -
> Registered webapp guice modules
> 2011-09-19 13:39:29,817 INFO service.AbstractService
> (AbstractService.java:start(61)) -
> Service:org.apache.hadoop.yarn.server.nodemanager.webapp.WebServer is started.
> 2011-09-19 13:39:29,818 INFO service.AbstractService
> (AbstractService.java:start(61)) - Service:Dispatcher is started.
> 2011-09-19 13:39:29,819 INFO nodemanager.NodeStatusUpdaterImpl
> (NodeStatusUpdaterImpl.java:start(133)) - Configured ContainerManager Address
> is 10.18.52.124:45454
> 2011-09-19 13:39:29,819 INFO ipc.YarnRPC (YarnRPC.java:create(47)) -
> Creating YarnRPC for org.apache.hadoop.yarn.ipc.HadoopYarnProtoRPC
> 2011-09-19 13:39:29,822 INFO ipc.HadoopYarnRPC
> (HadoopYarnProtoRPC.java:getProxy(49)) - Creating a HadoopYarnProtoRpc proxy
> for protocol interface org.apache.hadoop.yarn.server.api.ResourceTracker
> 2011-09-19 13:39:29,862 INFO nodemanager.NodeStatusUpdaterImpl
> (NodeStatusUpdaterImpl.java:registerWithRM(165)) - Connected to
> ResourceManager at 0.0.0.0:8025
> 2011-09-19 13:39:30,369 INFO nodemanager.NodeStatusUpdaterImpl
> (NodeStatusUpdaterImpl.java:registerWithRM(189)) - Registered with
> ResourceManager as 10.18.52.124:45454 with total resource of memory: 8192,
> 2011-09-19 13:39:30,369 INFO service.AbstractService
> (AbstractService.java:start(61)) -
> Service:org.apache.hadoop.yarn.server.nodemanager.NodeStatusUpdaterImpl is
> started.
> 2011-09-19 13:39:30,371 INFO service.AbstractService
> (AbstractService.java:start(61)) -
> Service:org.apache.hadoop.yarn.server.nodemanager.NodeManager is started.
> {code}
> {code:title=Resource Manager Logs|borderStyle=solid}
> 2011-09-19 14:01:03,238 INFO resourcemanager.ResourceTrackerService
> (ResourceTrackerService.java:nodeHeartbeat(201)) - Node not found rebooting
> 10.18.52.124:45454
> Call:
> protocol=org.apache.hadoop.yarn.proto.ResourceTracker$ResourceTrackerService$BlockingInterface,
> method=nodeHeartbeat
> 2011-09-19 14:01:04,240 INFO resourcemanager.ResourceTrackerService
> (ResourceTrackerService.java:nodeHeartbeat(201)) - Node not found rebooting
> 10.18.52.124:45454
> Call:
> protocol=org.apache.hadoop.yarn.proto.ResourceTracker$ResourceTrackerService$BlockingInterface,
> method=nodeHeartbeat
> 2011-09-19 14:01:05,242 INFO resourcemanager.ResourceTrackerService
> (ResourceTrackerService.java:nodeHeartbeat(201)) - Node not found rebooting
> 10.18.52.124:45454
> Call:
> protocol=org.apache.hadoop.yarn.proto.ResourceTracker$ResourceTrackerService$BlockingInterface,
> method=nodeHeartbeat
> 2011-09-19 14:01:06,244 INFO resourcemanager.ResourceTrackerService
> (ResourceTrackerService.java:nodeHeartbeat(201)) - Node not found rebooting
> 10.18.52.124:45454
> Call:
> protocol=org.apache.hadoop.yarn.proto.ResourceTracker$ResourceTrackerService$BlockingInterface,
> method=nodeHeartbeat
> 2011-09-19 14:01:07,246 INFO resourcemanager.ResourceTrackerService
> (ResourceTrackerService.java:nodeHeartbeat(201)) - Node not found rebooting
> 10.18.52.124:45454
> Call:
> protocol=org.apache.hadoop.yarn.proto.ResourceTracker$ResourceTrackerService$BlockingInterface,
> method=nodeHeartbeat
> 2011-09-19 14:01:08,247 INFO resourcemanager.ResourceTrackerService
> (ResourceTrackerService.java:nodeHeartbeat(201)) - Node not found rebooting
> 10.18.52.124:45454
> {code}
> Node Manager is registered with Resource manager and the for every heartbeat,
> it is printing the above message.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira