[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13107767#comment-13107767
 ] 

Devaraj K commented on MAPREDUCE-3030:
--------------------------------------


This issue is coming due to the changes of MAPREDUCE-2965. 


{code:title=NodeId.java |borderStyle=solid}
public boolean equals(Object obj) {
    if (this == obj)
      return true;
    if (!super.equals(obj))
      return false;
    if (getClass() != obj.getClass())
      return false;
    NodeId other = (NodeId) obj;
    if (!this.getHost().equals(other.getHost()))
      return false;
    if (this.getPort() != other.getPort())
      return false;
    return true;
  }
{code}

Here the super.equals(obj) is always returning false, nodeId are becoming not 
equal even if the states are equal. 

RMNode rmNode = this.rmContext.getRMNodes().get(nodeId) in 
ResourceTrackerService.java is always returning null, even if the node was 
registered already and it is causing to log continuously in resource manager 
logs. Due to this submitted application progress is always 0.


> RM is not processing heartbeat and continuously giving the message 'Node not 
> found rebooting'
> ---------------------------------------------------------------------------------------------
>
>                 Key: MAPREDUCE-3030
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3030
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: resourcemanager
>    Affects Versions: 0.24.0
>            Reporter: Devaraj K
>            Assignee: Devaraj K
>            Priority: Blocker
>
> {code:title=Node Manager Logs|borderStyle=solid}
> 2011-09-19 13:39:29,816 INFO  webapp.WebApps (WebApps.java:start(162)) - 
> Registered webapp guice modules
> 2011-09-19 13:39:29,817 INFO  service.AbstractService 
> (AbstractService.java:start(61)) - 
> Service:org.apache.hadoop.yarn.server.nodemanager.webapp.WebServer is started.
> 2011-09-19 13:39:29,818 INFO  service.AbstractService 
> (AbstractService.java:start(61)) - Service:Dispatcher is started.
> 2011-09-19 13:39:29,819 INFO  nodemanager.NodeStatusUpdaterImpl 
> (NodeStatusUpdaterImpl.java:start(133)) - Configured ContainerManager Address 
> is 10.18.52.124:45454
> 2011-09-19 13:39:29,819 INFO  ipc.YarnRPC (YarnRPC.java:create(47)) - 
> Creating YarnRPC for org.apache.hadoop.yarn.ipc.HadoopYarnProtoRPC
> 2011-09-19 13:39:29,822 INFO  ipc.HadoopYarnRPC 
> (HadoopYarnProtoRPC.java:getProxy(49)) - Creating a HadoopYarnProtoRpc proxy 
> for protocol interface org.apache.hadoop.yarn.server.api.ResourceTracker
> 2011-09-19 13:39:29,862 INFO  nodemanager.NodeStatusUpdaterImpl 
> (NodeStatusUpdaterImpl.java:registerWithRM(165)) - Connected to 
> ResourceManager at 0.0.0.0:8025
> 2011-09-19 13:39:30,369 INFO  nodemanager.NodeStatusUpdaterImpl 
> (NodeStatusUpdaterImpl.java:registerWithRM(189)) - Registered with 
> ResourceManager as 10.18.52.124:45454 with total resource of memory: 8192, 
> 2011-09-19 13:39:30,369 INFO  service.AbstractService 
> (AbstractService.java:start(61)) - 
> Service:org.apache.hadoop.yarn.server.nodemanager.NodeStatusUpdaterImpl is 
> started.
> 2011-09-19 13:39:30,371 INFO  service.AbstractService 
> (AbstractService.java:start(61)) - 
> Service:org.apache.hadoop.yarn.server.nodemanager.NodeManager is started.
> {code}
> {code:title=Resource Manager Logs|borderStyle=solid}
> 2011-09-19 14:01:03,238 INFO  resourcemanager.ResourceTrackerService 
> (ResourceTrackerService.java:nodeHeartbeat(201)) - Node not found rebooting 
> 10.18.52.124:45454
> Call: 
> protocol=org.apache.hadoop.yarn.proto.ResourceTracker$ResourceTrackerService$BlockingInterface,
>  method=nodeHeartbeat
> 2011-09-19 14:01:04,240 INFO  resourcemanager.ResourceTrackerService 
> (ResourceTrackerService.java:nodeHeartbeat(201)) - Node not found rebooting 
> 10.18.52.124:45454
> Call: 
> protocol=org.apache.hadoop.yarn.proto.ResourceTracker$ResourceTrackerService$BlockingInterface,
>  method=nodeHeartbeat
> 2011-09-19 14:01:05,242 INFO  resourcemanager.ResourceTrackerService 
> (ResourceTrackerService.java:nodeHeartbeat(201)) - Node not found rebooting 
> 10.18.52.124:45454
> Call: 
> protocol=org.apache.hadoop.yarn.proto.ResourceTracker$ResourceTrackerService$BlockingInterface,
>  method=nodeHeartbeat
> 2011-09-19 14:01:06,244 INFO  resourcemanager.ResourceTrackerService 
> (ResourceTrackerService.java:nodeHeartbeat(201)) - Node not found rebooting 
> 10.18.52.124:45454
> Call: 
> protocol=org.apache.hadoop.yarn.proto.ResourceTracker$ResourceTrackerService$BlockingInterface,
>  method=nodeHeartbeat
> 2011-09-19 14:01:07,246 INFO  resourcemanager.ResourceTrackerService 
> (ResourceTrackerService.java:nodeHeartbeat(201)) - Node not found rebooting 
> 10.18.52.124:45454
> Call: 
> protocol=org.apache.hadoop.yarn.proto.ResourceTracker$ResourceTrackerService$BlockingInterface,
>  method=nodeHeartbeat
> 2011-09-19 14:01:08,247 INFO  resourcemanager.ResourceTrackerService 
> (ResourceTrackerService.java:nodeHeartbeat(201)) - Node not found rebooting 
> 10.18.52.124:45454
> {code}
> Node Manager is registered with Resource manager and the for every heartbeat, 
> it is printing the above message.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to