Sanil Jain created SAMZA-2579:
---------------------------------

             Summary: Force Restart Container feature for Container Placements 
                 Key: SAMZA-2579
                 URL: https://issues.apache.org/jira/browse/SAMZA-2579
             Project: Samza
          Issue Type: New Feature
            Reporter: Sanil Jain
            Assignee: Sanil Jain


The current restart ability works in the following way:
1. Tries to fetch resources on a host
2. Stops the active container if resources are accrued
3. Tried to start the container on host accrued

In production we have seen following observation with ATC / concourse with this
1. CDP jobs are configured to use resources for peak which leads to no headroom 
left on host for requesting additional resources
2. This leads to restart requests failing due to not able to get resources on 
that host

A fix to this is to implement a force-restart utility for CDP, in this version 
we will stop the container first and then accure resources. The upside being we 
will atleast free up the resources on the host before issuing resource request, 
downside being it will be a best effort scenario to bring that contianer back 
up on that host



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to