Ahmed Hussein created YARN-9597:
-----------------------------------
Summary: Memory efficiency in speculator
Key: YARN-9597
URL: https://issues.apache.org/jira/browse/YARN-9597
Project: Hadoop YARN
Issue Type: Improvement
Reporter: Ahmed Hussein
The data structures in speculator and runtime-estimator are bloating. Data
elements such as (taskID, TA-ID, task stats, tasks speculated, tasks
finished..etc) are added to the concurrent maps but never removed.
For long running jobs, there are couple of issues:
# memory leakage: the speculator memory usage increases over time.
# performance: keeping large structures in the heap affects the performance
due to locality and cache misses.
*Suggested Fixes:*
- When a TA transitions to {{MoveContainerToSucceededFinishingTransition}}, the
TA notifies the speculator. The latter handles the event by cleaning the
internal structure accordingly.
- When a task transitions is failed/killed, the speculator is notified to clean
the internal data structure.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]