Swapnil Daingade created MYRIAD-127:
---------------------------------------

             Summary: Recommend nodes to launch NodeManagers optimized for 
locality based on input data for Yarn jobs
                 Key: MYRIAD-127
                 URL: https://issues.apache.org/jira/browse/MYRIAD-127
             Project: Myriad
          Issue Type: New Feature
            Reporter: Swapnil Daingade
            Assignee: Swapnil Daingade


Hadoop/Yarn works on the principle of taking computation to the data. Thus data 
locality is important for getting optimal performance. When a yarn job is 
launched, the user specifies the dfs location of the data being operated on. 
Looking at all these locations (from various running yarn jobs) being operated 
on, we can try to predict the best location for launching NodeManagers 
optimized for locality.

For. e.g We have a 20 node mesos cluster and a user has a 5 node yarn cluster 
running a few jobs. Looking at the data being operated on by the yarn jobs, we 
can come up with a recommendation for which 5 nodes to launch NodeManagers on 
optimized for locality. 

In a more autonomous mode, we could flex down some of the existing NM's and 
flexup new ones on new nodes.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to