Prabhu Joseph created YARN-11457:
------------------------------------
Summary: NodeManager Resource Leak when handling a container log
with colon
Key: YARN-11457
URL: https://issues.apache.org/jira/browse/YARN-11457
Project: Hadoop YARN
Issue Type: Bug
Components: nodemanager
Affects Versions: 3.3.3
Reporter: Prabhu Joseph
Assignee: Vineeth Naroju
Attachments: Screenshot 2023-03-16 at 1.02.22 PM.png, Screenshot
2023-03-16 at 1.02.45 PM.png, Screenshot 2023-03-16 at 1.02.57 PM.png
NodeManager Leaks the resources when handling a container log with colon. The
Illegal file name is not handled and leads to resource leak at NodeManager side.
{code:java}
2023-03-14 11:03:53,390 WARN org.apache.hadoop.util.concurrent.ExecutorHelper
(ContainersLauncher #2683): Caught exception in thread ContainersLauncher
#2683:
java.lang.IllegalArgumentException: java.net.URISyntaxException: Relative path
in absolute URI: taskmanager.log.2023-03-14 09:44-1
at org.apache.hadoop.fs.Path.initialize(Path.java:263)
at org.apache.hadoop.fs.Path.<init>(Path.java:221)
at org.apache.hadoop.fs.Path.<init>(Path.java:129)
at org.apache.hadoop.fs.Globber.doGlob(Globber.java:270)
at org.apache.hadoop.fs.Globber.glob(Globber.java:149)
at org.apache.hadoop.fs.FileSystem.globStatus(FileSystem.java:2096)
at org.apache.hadoop.fs.FileSystem.globStatus(FileSystem.java:2078)
at
org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.handleContainerExitWithFailure(ContainerLaunch.java:653)
at
org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.handleContainerExitCode(ContainerLaunch.java:593)
at
org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.call(ContainerLaunch.java:337)
at
org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.call(ContainerLaunch.java:101)
at java.util.concurrent.FutureTask.run(FutureTask.java:266)
at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
at java.lang.Thread.run(Thread.java:750)
Caused by: java.net.URISyntaxException: Relative path in absolute URI:
taskmanager.log.2023-03-14 09:44-1
at java.net.URI.checkPath(URI.java:1823)
at java.net.URI.<init>(URI.java:745)
at org.apache.hadoop.fs.Path.initialize(Path.java:260)
... 14 more
{code}
NodeManager status details shows Application stuck in FINISHING_CONTAINER_WAIT,
Containers stuck in KILLING state.
!Screenshot 2023-03-16 at 1.02.57 PM.png|height=100,width=250!
!Screenshot 2023-03-16 at 1.02.45 PM.png|height=100,width=250!
!Screenshot 2023-03-16 at 1.02.22 PM.png|height=250,width=250!
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]