Jason Lowe created YARN-3136:
Summary: getTransferredContainers can be a bottleneck during AM
Project: Hadoop YARN
Issue Type: Bug
Affects Versions: 2.6.0
Reporter: Jason Lowe
While examining RM stack traces on a busy cluster I noticed a pattern of AMs
stuck waiting for the scheduler lock trying to call getTransferredContainers.
The scheduler lock is highly contended, especially on a large cluster with many
nodes heartbeating, and it would be nice if we could find a way to eliminate
the need to grab this lock during this call. We've already done similar work
during AM allocate calls to make sure they don't needlessly grab the scheduler
lock, and it would be good to do so here as well, if possible.
This message was sent by Atlassian JIRA