[
https://issues.apache.org/jira/browse/LIVY-866?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17411686#comment-17411686
]
Akshat Bordia commented on LIVY-866:
------------------------------------
Create PR for review: [[LIVY-866] Optimizing Yarn GetApplications Query to
prevent additional load on Yarn and Livy by akshatb1 · Pull Request #327 ·
apache/incubator-livy
(github.com)|https://github.com/apache/incubator-livy/pull/327].
> Optimize Yarn GetApplications Query to prevent additional load on Yarn and
> Livy
> -------------------------------------------------------------------------------
>
> Key: LIVY-866
> URL: https://issues.apache.org/jira/browse/LIVY-866
> Project: Livy
> Issue Type: Improvement
> Components: Server
> Reporter: Akshat Bordia
> Priority: Major
> Time Spent: 10m
> Remaining Estimate: 0h
>
> Currently Livy queries Yarn applications by applicationType : Spark. This
> will put heavy load on Yarn clusters if there are thousands or more Spark
> application in all states (running, finished, failed, queued etc.).
> A better approach would be to query the applications by tags in addition to
> job type since Livy only needs to track application with certain application
> tags. However, YarnClient does not expose any API to query applications by
> tags.
>
> We can extend the YarnClientImpl which would allow us to use protected
> rmClient in YarnClient and we can implement getApplications method with
> GetApplicationsRequest where we can include tags.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)