[ 
https://issues.apache.org/jira/browse/YARN-7399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16261167#comment-16261167
 ] 

Billie Rinaldi commented on YARN-7399:
--------------------------------------

[~eyang], thanks for providing the image, it makes the goals very clear.

What would the ServiceTemplate look like? I assume the recommended templates 
are for displaying in the UI. How are recommended templates populated (that is, 
do we need a method for adding service templates for the recommended list)? 
Will all users have the same recommended templates?

For the update methods, will we provide a way to merge the 
Service/ServiceTemplate into an existing service, or will it just overwrite the 
existing one? If it is overwriting, perhaps put would be a better name for the 
methods. Would it make sense to support versioning of Service or 
ServiceTemplate objects? We don't do that now, so it could be left for future 
discussion/implementation.

Do the get/update/remove methods refer to one service instance? In this case 
they should have a user parameter as well?

> Yarn services metadata storage improvement
> ------------------------------------------
>
>                 Key: YARN-7399
>                 URL: https://issues.apache.org/jira/browse/YARN-7399
>             Project: Hadoop YARN
>          Issue Type: Improvement
>          Components: yarn-native-services
>            Reporter: Eric Yang
>            Assignee: Eric Yang
>         Attachments: YARN-7399.001.patch, YARN-7399.png
>
>
> In Slider, metadata is stored in user's home directory. Slider command line 
> interface interacts with HDFS directly to list deployed applications and 
> invoke YARN API or HDFS API to provide information to user. This design works 
> for a single user manage his/her own applications. When this design has been 
> ported to Yarn services, it becomes apparent that this design is difficult to 
> list all deployed applications on Hadoop cluster for administrator to manage 
> applications. Resource Manager needs to crawl through every user's home 
> directory to compile metadata about deployed applications. This can trigger 
> high load on namenode to list hundreds or thousands of list directory calls 
> owned by different users. Hence, it might be best to centralize the metadata 
> storage to Solr or HBase to reduce number of IO calls to namenode for manage 
> applications.
> In Slider, one application is composed of metainfo, specifications in json, 
> and payload of zip file that contains application code and deployment code. 
> Both meta information, and zip file payload are stored in the same 
> application directory in HDFS. This works well for distributed applications 
> without central application manager that oversee all application.
> In the next generation of application management, we like to centralize 
> metainfo and specifications in json to a centralized storage managed by YARN 
> user, and keep the payload zip file in user's home directory or in docker 
> registry. This arrangement can provide a faster lookup for metainfo when we 
> list all deployed applications and services on YARN dashboard.
> When we centralize metainfo to YARN user, we also need to build ACL to 
> enforce who can manage applications, and make update. The current proposal is:
> yarn.admin.acl - list of groups that can submit/reconfigure/pause/kill all 
> applications
> normal users - submit/reconfigure/pause/kill his/her own applications



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org

Reply via email to