Prabhu Joseph created YARN-11457:
------------------------------------

             Summary: NodeManager Resource Leak when handling a container log 
with colon 
                 Key: YARN-11457
                 URL: https://issues.apache.org/jira/browse/YARN-11457
             Project: Hadoop YARN
          Issue Type: Bug
          Components: nodemanager
    Affects Versions: 3.3.3
            Reporter: Prabhu Joseph
            Assignee: Vineeth Naroju
         Attachments: Screenshot 2023-03-16 at 1.02.22 PM.png, Screenshot 
2023-03-16 at 1.02.45 PM.png, Screenshot 2023-03-16 at 1.02.57 PM.png

NodeManager Leaks the resources when handling a container log with colon. The 
Illegal file name is not handled and leads to resource leak at NodeManager side.

 
{code:java}
2023-03-14 11:03:53,390 WARN org.apache.hadoop.util.concurrent.ExecutorHelper 
(ContainersLauncher #2683): Caught exception in thread ContainersLauncher 
#2683: 
java.lang.IllegalArgumentException: java.net.URISyntaxException: Relative path 
in absolute URI: taskmanager.log.2023-03-14 09:44-1
        at org.apache.hadoop.fs.Path.initialize(Path.java:263)
        at org.apache.hadoop.fs.Path.<init>(Path.java:221)
        at org.apache.hadoop.fs.Path.<init>(Path.java:129)
        at org.apache.hadoop.fs.Globber.doGlob(Globber.java:270)
        at org.apache.hadoop.fs.Globber.glob(Globber.java:149)
        at org.apache.hadoop.fs.FileSystem.globStatus(FileSystem.java:2096)
        at org.apache.hadoop.fs.FileSystem.globStatus(FileSystem.java:2078)
        at 
org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.handleContainerExitWithFailure(ContainerLaunch.java:653)
        at 
org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.handleContainerExitCode(ContainerLaunch.java:593)
        at 
org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.call(ContainerLaunch.java:337)
        at 
org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.call(ContainerLaunch.java:101)
        at java.util.concurrent.FutureTask.run(FutureTask.java:266)
        at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
        at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
        at java.lang.Thread.run(Thread.java:750)
Caused by: java.net.URISyntaxException: Relative path in absolute URI: 
taskmanager.log.2023-03-14 09:44-1
        at java.net.URI.checkPath(URI.java:1823)
        at java.net.URI.<init>(URI.java:745)
        at org.apache.hadoop.fs.Path.initialize(Path.java:260)
        ... 14 more 
{code}

NodeManager status details shows Application stuck in FINISHING_CONTAINER_WAIT, 
Containers stuck in KILLING state.

 !Screenshot 2023-03-16 at 1.02.57 PM.png|height=100,width=250!

 !Screenshot 2023-03-16 at 1.02.45 PM.png|height=100,width=250!

 !Screenshot 2023-03-16 at 1.02.22 PM.png|height=250,width=250!




--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org

Reply via email to