[ 
https://issues.apache.org/jira/browse/YARN-3990?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14645567#comment-14645567
 ] 

Bibin A Chundatt commented on YARN-3990:
----------------------------------------

[~rohithsharma]

{code}
2015-07-29 19:39:03,409 | INFO  | ResourceManager Event Processor | Added node 
host-7:26009 clusterResource: <memory:178400, vCores:64> | 
CapacityScheduler.java:1358
2015-07-29 19:39:03,409 | INFO  | AsyncDispatcher event handler | Size of 
event-queue is 3000 | AsyncDispatcher.java:235
2015-07-29 19:39:03,409 | DEBUG | Socket Reader #1 for port 26003 |  got #2125 
| Server.java:1790
2015-07-29 19:39:03,409 | DEBUG | IPC Server handler 7 on 26003 | IPC Server 
handler 7 on 26003: 
org.apache.hadoop.yarn.server.api.ResourceTrackerPB.nodeHeartbeat from 
172.168.100.7:24999 Call#2125 Retry#0 for RpcKind RPC_PROTOCOL_BUFFER | 
Server.java:2058
2015-07-29 19:39:03,410 | DEBUG | IPC Server handler 7 on 26003 | 
PrivilegedAction as:mapred/hadoop.hadoop....@hadoop.com (auth:KERBEROS) 
from:org.apache.hadoop.ipc.Server$Handler.run(Server.java:2082) | 
UserGroupInformation.java:1696
2015-07-29 19:39:03,410 | INFO  | AsyncDispatcher event handler | Size of 
event-queue is 4000 | AsyncDispatcher.java:235
2015-07-29 19:39:03,410 | INFO  | AsyncDispatcher event handler | Size of 
event-queue is 5000 | AsyncDispatcher.java:235
2015-07-29 19:39:03,411 | INFO  | AsyncDispatcher event handler | Size of 
event-queue is 6000 | AsyncDispatcher.java:235
2015-07-29 19:39:03,412 | INFO  | AsyncDispatcher event handler | Size of 
event-queue is 7000 | AsyncDispatcher.java:235
2015-07-29 19:39:03,412 | INFO  | IPC Server handler 7 on 26003 | Size of 
event-queue is 7000 | AsyncDispatcher.java:235
2015-07-29 19:39:03,412 | INFO  | AsyncDispatcher event handler | Size of 
event-queue is 8000 | AsyncDispatcher.java:235
2015-07-29 19:39:03,413 | INFO  | AsyncDispatcher event handler | Size of 
event-queue is 9000 | AsyncDispatcher.java:235
2015-07-29 19:39:03,414 | INFO  | AsyncDispatcher event handler | Size of 
event-queue is 10000 | AsyncDispatcher.java:235
2015-07-29 19:39:03,414 | INFO  | AsyncDispatcher event handler | Size of 
event-queue is 11000 | AsyncDispatcher.java:235
2015-07-29 19:39:03,415 | DEBUG | IPC Server handler 7 on 26003 | Served: 
nodeHeartbeat queueTime= 1 procesingTime= 5 | ProtobufRpcEngine.java:631
2015-07-29 19:39:03,415 | INFO  | AsyncDispatcher event handler | Size of 
event-queue is 12000 | AsyncDispatcher.java:235
2015-07-29 19:39:03,416 | DEBUG | IPC Server handler 7 on 26003 | Adding 
saslServer wrapped token of size 100 as call response. | Server.java:2460
2015-07-29 19:39:03,416 | DEBUG | IPC Server handler 7 on 26003 | IPC Server 
handler 7 on 26003: responding to 
org.apache.hadoop.yarn.server.api.ResourceTrackerPB.nodeHeartbeat from 
172.168.100.7:24999 Call#2125 Retry#0 | Server.java:994
2015-07-29 19:39:03,416 | INFO  | AsyncDispatcher event handler | Size of 
event-queue is 13000 | AsyncDispatcher.java:235
2015-07-29 19:39:03,416 | DEBUG | IPC Server handler 7 on 26003 | IPC Server 
handler 7 on 26003: responding to 
org.apache.hadoop.yarn.server.api.ResourceTrackerPB.nodeHeartbeat from 
172.168.100.7:24999 Call#2125 Retry#0 Wrote 118 bytes. | Server.java:1013
2015-07-29 19:39:03,416 | INFO  | AsyncDispatcher event handler | Size of 
event-queue is 14000 | AsyncDispatcher.java:235
2015-07-29 19:39:03,417 | INFO  | AsyncDispatcher event handler | Size of 
event-queue is 15000 | AsyncDispatcher.java:235
2015-07-29 19:39:03,418 | INFO  | AsyncDispatcher event handler | Size of 
event-queue is 16000 | AsyncDispatcher.java:235
2015-07-29 19:39:03,419 | INFO  | AsyncDispatcher event handler | Size of 
event-queue is 17000 | AsyncDispatcher.java:235
2015-07-29 19:39:03,419 | INFO  | AsyncDispatcher event handler | Size of 
event-queue is 18000 | AsyncDispatcher.java:235
2015-07-29 19:39:03,420 | INFO  | AsyncDispatcher event handler | Size of 
event-queue is 19000 | AsyncDispatcher.java:235
2015-07-29 19:39:03,421 | INFO  | AsyncDispatcher event handler | Size of 
event-queue is 20000 | AsyncDispatcher.java:235
2015-07-29 19:39:03,421 | DEBUG | AsyncDispatcher event handler | Dispatching 
the event 
org.apache.hadoop.yarn.server.resourcemanager.rmapp.RMAppNodeUpdateEvent.EventType:
 NODE_UPDATE | AsyncDispatcher.java:166
2015-07-29 19:39:03,421 | DEBUG | AsyncDispatcher event handler | Processing 
event for application_1438101193238_224125 of type NODE_UPDATE | 
RMAppImpl.java:741
2015-07-29 19:39:03,421 | DEBUG | AsyncDispatcher event handler | Dispatching 
the event 
org.apache.hadoop.yarn.server.resourcemanager.rmapp.RMAppNodeUpdateEvent.EventType:
 NODE_UPDATE | AsyncDispatcher.java:166
2015-07-29 19:39:03,421 | DEBUG | AsyncDispatcher event handler | Processing 
event for application_1438101193238_224126 of type NODE_UPDATE | 
RMAppImpl.java:741
2015-07-29 19:39:03,422 | DEBUG | AsyncDispatcher event handler | Dispatching 
the event 
org.apache.hadoop.yarn.server.resourcemanager.rmapp.RMAppNodeUpdateEvent.EventType:
 NODE_UPDATE | AsyncDispatcher.

{code}
Was able to reproduce the same. Attaching logs

> AsyncDispatcher may overloaded with RMAppNodeUpdateEvent when Node is 
> connected/disconnected
> --------------------------------------------------------------------------------------------
>
>                 Key: YARN-3990
>                 URL: https://issues.apache.org/jira/browse/YARN-3990
>             Project: Hadoop YARN
>          Issue Type: Bug
>          Components: resourcemanager
>            Reporter: Rohith Sharma K S
>            Assignee: Bibin A Chundatt
>            Priority: Critical
>
> Whenever node is added or removed, NodeListManager sends RMAppNodeUpdateEvent 
> to all the applications that are in the rmcontext. But for 
> finished/killed/failed applications it is not required to send these events. 
> Additional check for wheather app is finished/killed/failed would minimizes 
> the unnecessary events
> {code}
>   public void handle(NodesListManagerEvent event) {
>     RMNode eventNode = event.getNode();
>     switch (event.getType()) {
>     case NODE_UNUSABLE:
>       LOG.debug(eventNode + " reported unusable");
>       unusableRMNodesConcurrentSet.add(eventNode);
>       for(RMApp app: rmContext.getRMApps().values()) {
>         this.rmContext
>             .getDispatcher()
>             .getEventHandler()
>             .handle(
>                 new RMAppNodeUpdateEvent(app.getApplicationId(), eventNode,
>                     RMAppNodeUpdateType.NODE_UNUSABLE));
>       }
>       break;
>     case NODE_USABLE:
>       if (unusableRMNodesConcurrentSet.contains(eventNode)) {
>         LOG.debug(eventNode + " reported usable");
>         unusableRMNodesConcurrentSet.remove(eventNode);
>       }
>       for (RMApp app : rmContext.getRMApps().values()) {
>         this.rmContext
>             .getDispatcher()
>             .getEventHandler()
>             .handle(
>                 new RMAppNodeUpdateEvent(app.getApplicationId(), eventNode,
>                     RMAppNodeUpdateType.NODE_USABLE));
>       }
>       break;
>     default:
>       LOG.error("Ignoring invalid eventtype " + event.getType());
>     }
>   }
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to