Szilard Nemeth created YARN-9370:
------------------------------------
Summary: Better logging in recoverAssignedGpus in class
GpuResourceAllocator
Key: YARN-9370
URL: https://issues.apache.org/jira/browse/YARN-9370
Project: Hadoop YARN
Issue Type: Improvement
Reporter: Szilard Nemeth
The last line of
org.apache.hadoop.yarn.server.nodemanager.containermanager.linux.resources.gpu.GpuResourceAllocator#recoverAssignedGpus
is this:
{code:java}
usedDevices.put(gpuDevice, containerId);
{code}
We should have an info (or if not info, at least a debug) level log to indicate
that a container is allocated to a GPU device during recovery.
Please also check recovery related code, maybe there can be some room for
improvement around logging.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]