Hey Guys,

My ApplicationMaster is being killed by the NodeManager because of memory 
consumption, and I don't understand why. I'm using -Xmx512M, and setting my 
resource request to 2048.


    .addCommand("java -Xmx512M -cp './package/*' kafka.yarn.ApplicationMaster " 
...

    ...

    private var memory = 2048

    resource.setMemory(memory)
    containerCtx.setResource(resource)
    containerCtx.setCommands(cmds.toList)
    containerCtx.setLocalResources(Collections.singletonMap("package", 
packageResource))
    appCtx.setApplicationId(appId)
    appCtx.setUser(user.getShortUserName)
    appCtx.setAMContainerSpec(containerCtx)
    request.setApplicationSubmissionContext(appCtx)
    applicationsManager.submitApplication(request)

When this runs, I see (in my NodeManager's logs):


2011-09-21 09:35:19,112 INFO  monitor.ContainersMonitorImpl 
(ContainersMonitorImpl.java:run(402)) - Memory usage of ProcessTree 28134 for 
container-id container_1316559026783_0003_01_000001 : Virtual 2260938752 bytes, 
limit : 2147483648 bytes; Physical 71540736 bytes, limit -1 bytes
2011-09-21 09:35:19,112 WARN  monitor.ContainersMonitorImpl 
(ContainersMonitorImpl.java:isProcessTreeOverLimit(289)) - Process tree for 
container: container_1316559026783_0003_01_000001 has processes older than 1 
iteration running over the configured limit. Limit=2147483648, current usage = 
2260938752
2011-09-21 09:35:19,113 WARN  monitor.ContainersMonitorImpl 
(ContainersMonitorImpl.java:run(453)) - Container 
[pid=28134,containerID=container_1316559026783_0003_01_000001] is running 
beyond memory-limits. Current usage : 2260938752bytes. Limit : 2147483648bytes. 
Killing container. 
Dump of the process-tree for container_1316559026783_0003_01_000001 : 
        |- PID PPID PGRPID SESSID CMD_NAME USER_MODE_TIME(MILLIS) 
SYSTEM_TIME(MILLIS) VMEM_USAGE(BYTES) RSSMEM_USAGE(PAGES) FULL_CMD_LINE
        |- 28134 25886 28134 28134 (bash) 0 0 108638208 303 /bin/bash -c java 
-Xmx512M -cp './package/*' kafka.yarn.ApplicationMaster 3 1 1316559026783 
com.linkedin.TODO 1 
1>/tmp/logs/application_1316559026783_0003/container_1316559026783_0003_01_000001/stdout
 
2>/tmp/logs/application_1316559026783_0003/container_1316559026783_0003_01_000001/stderr
  
        |- 28137 28134 28134 28134 (java) 92 3 2152300544 17163 java -Xmx512M 
-cp ./package/* kafka.yarn.ApplicationMaster 3 1 1316559026783 
com.linkedin.TODO 1 

2011-09-21 09:35:19,113 INFO  monitor.ContainersMonitorImpl 
(ContainersMonitorImpl.java:run(463)) - Removed ProcessTree with root 28134

It appears that YARN is honoring my 2048 command, yet my process is somehow 
taking 2260938752 bytes. I don't think that I'm using nearly that much in 
permgen, and my heap is limited to 512. I don't have any JNI stuff running 
(that I know of), so it's unclear to me what's going on here. The only thing 
that I can think of is that Java's Runtime exec is forking, and copying its 
entire JVM memory footprint for the fork.

Has anyone seen this? Am I doing something dumb?

Thanks!
Chris

Reply via email to