Ahmed Hussein created YARN-9597:
-----------------------------------

             Summary: Memory efficiency in speculator 
                 Key: YARN-9597
                 URL: https://issues.apache.org/jira/browse/YARN-9597
             Project: Hadoop YARN
          Issue Type: Improvement
            Reporter: Ahmed Hussein


The data structures in speculator and runtime-estimator are bloating. Data 
elements such as (taskID, TA-ID, task stats, tasks speculated, tasks 
finished..etc) are added to the concurrent maps but never removed.

For long running jobs, there are couple of issues:
 # memory leakage: the speculator memory usage increases over time. 
 # performance: keeping large structures in the heap affects the performance 
due to locality and cache misses.

*Suggested Fixes:*

- When a TA transitions to {{MoveContainerToSucceededFinishingTransition}}, the 
TA notifies the speculator. The latter handles the event by cleaning the 
internal structure accordingly.
- When a task transitions is failed/killed, the speculator is notified to clean 
the internal data structure.

 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to