[jira] [Commented] (YARN-5889) Improve user-limit calculation in capacity scheduler

Wangda Tan (JIRA) Mon, 09 Jan 2017 08:39:15 -0800

    [ 
https://issues.apache.org/jira/browse/YARN-5889?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15812187#comment-15812187
 ]


Wangda Tan commented on YARN-5889:
----------------------------------

Thanks [~sunilg] for providing the patch and reviews from [~eepayne].

Quickly scanned the patch,

If everybody agrees with approach in my comment: 
https://issues.apache.org/jira/browse/YARN-5889?focusedCommentId=15749129&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-15749129

{code} 
active-user-limit = max(
   resource-used-by-active-users / #active-users,
   queue-capacity * MULP
)
{code} 

We can:
1) Update {{resource-used-by-active-users}} when any user enters/exits set of 
active users.
2) Update #active-users (using approaches like ActiveUsersManager approach)

Both of #1/#2 can be done within the same thread, and since we will have an 
identical UL for all the active users, I think this can be done without adding 
a new thread. Please correct me if I'm wrong.

It looks like Sunil and Eric have different implementation suggestions, I have 
looked at details of the two approaches. However I think the goals are:
a. Can always get up-to-date UL
b. Avoid re-computation of UL as much as possible.
Any approach can achieve above goals is good to me.

And in addition, we can add a UsersManager to LQ to manager all user-related 
information, such as user-limit / active-user-limit / #active-users. 
Ideally, LQ/PCPP should get activeUserLimit / userLimit from UsersManager

And for last, we can look at logics for #active-users per partition, now we 
have an identical #active-users for all partition, which causes some problems. 
Since we will make a major changes to logics around this, it could be a good 
chance to fix the problem as well.

> Improve user-limit calculation in capacity scheduler
> ----------------------------------------------------
>
>                 Key: YARN-5889
>                 URL: https://issues.apache.org/jira/browse/YARN-5889
>             Project: Hadoop YARN
>          Issue Type: Bug
>          Components: capacity scheduler
>            Reporter: Sunil G
>            Assignee: Sunil G
>         Attachments: YARN-5889.0001.patch, 
> YARN-5889.0001.suggested.patchnotes, YARN-5889.v0.patch, YARN-5889.v1.patch, 
> YARN-5889.v2.patch
>
>
> Currently user-limit is computed during every heartbeat allocation cycle with 
> a write lock. To improve performance, this tickets is focussing on moving 
> user-limit calculation out of heartbeat allocation flow.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Commented] (YARN-5889) Improve user-limit calculation in capacity scheduler

Reply via email to