Sanil15 opened a new pull request #1414:
URL: https://github.com/apache/samza/pull/1414


   **Changes**: The current restart ability for container placements works in 
the following way:
   1. Tries to fetch resources on a host
   2. Stops the active container if resources are accrued
   3. Tried to start the container on host accrued
   
   In production, we have seen the following observation at Linkedin
   1. Some jobs are configured to use resources for the peak which leads to no 
headroom left on a host for requesting additional resources
   2. This leads to restart requests failing due to not able to get resources 
on that host
   
   A fix to this is to implement a force-restart utility , in this version we 
will stop the container first and then accrue resources. The upside being we 
will at least free up the resources on the host before issuing resource 
request, the downside being it will be a best-effort scenario to bring that 
container back up on that host
   
   **API Changes:** Added new param values to destinationHost param for 
container placement request message
   
   LAST_SEEN: Tries to restart a container on last seen host with RESERVE -> 
STOP -> MOVE policy
   
   FORCE_RESTART_LAST_SEEN: Tries to restart a container on last seen host with 
STOP -> RESERVE -> MOVE policy
   
   **Upgrade Instructions:** None
   
   **Usage Instructions:** None
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to