[jira] [Commented] (YARN-8250) Create another implementation of ContainerScheduler to support NM overallocation

Arun Suresh (JIRA) Fri, 08 Jun 2018 14:48:14 -0700


    [ 
https://issues.apache.org/jira/browse/YARN-8250?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16506607#comment-16506607
 ]


Arun Suresh commented on YARN-8250:
-----------------------------------

Apologize for the late reply.

{quote}
3) Upon any container completed or finished event, do not try to launch any 
container.

4) Introduce a periodic check (in ContainersMonitor thread) that launches 
OPPORTUNISTIC container. Ideally, the period is configurable so that the 
latency to launch OPPORTUNISTIC containers can be reduced.
{quote}

My only issue is, what would 3) accomplish ? Again, point of opportunistic 
containers is to have it start up as fast as possible. If there is capacity at 
the time a container completes AND there are no G containers waiting to start, 
why not start the first O container in queue ?

Forgive me if my understanding is a bit off, but I am guessing the point of the 
JIRA is to ensure G container startup time is not impacted right ? Wouldnt a 
simple approach be: Check if container is opportunistic, and if container is to 
be killed and if over-allocation is turned on, assume 
{{sleep-delay-before-sigkill.ms}} == 0. This will ensure 'kill -9' is called 
immediately.

Please do not assume I am trying to 'block' via arbitrary argument :) I am just 
saying, we probably need a more data-driven approach while making changes. 
[~haibochen], can you provide some numbers to demonstrate the container 
start-time deterioration ? Just wanted to know how much of an issue it is. In 
our clusters, we pause containers and we've never had much of a problem with G 
container startup times, because of the queuing.

[~kkaranasos], thoughts ?

> Create another implementation of ContainerScheduler to support NM 
> overallocation
> --------------------------------------------------------------------------------
>
>                 Key: YARN-8250
>                 URL: https://issues.apache.org/jira/browse/YARN-8250
>             Project: Hadoop YARN
>          Issue Type: Sub-task
>            Reporter: Haibo Chen
>            Assignee: Haibo Chen
>            Priority: Major
>         Attachments: YARN-8250-YARN-1011.00.patch, 
> YARN-8250-YARN-1011.01.patch, YARN-8250-YARN-1011.02.patch
>
>
> YARN-6675 adds NM over-allocation support by modifying the existing 
> ContainerScheduler and providing a utilizationBased resource tracker.
> However, the implementation adds a lot of complexity to ContainerScheduler, 
> and future tweak of over-allocation strategy based on how much containers 
> have been launched is even more complicated.
> As such, this Jira proposes a new ContainerScheduler that always launch 
> guaranteed containers immediately and queues opportunistic containers. It 
> relies on a periodical check to launch opportunistic containers. 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Commented] (YARN-8250) Create another implementation of ContainerScheduler to support NM overallocation

Reply via email to