Sanil15 opened a new pull request #1414: URL: https://github.com/apache/samza/pull/1414
**Changes**: The current restart ability for container placements works in the following way: 1. Tries to fetch resources on a host 2. Stops the active container if resources are accrued 3. Tried to start the container on host accrued In production, we have seen the following observation at Linkedin 1. Some jobs are configured to use resources for the peak which leads to no headroom left on a host for requesting additional resources 2. This leads to restart requests failing due to not able to get resources on that host A fix to this is to implement a force-restart utility , in this version we will stop the container first and then accrue resources. The upside being we will at least free up the resources on the host before issuing resource request, the downside being it will be a best-effort scenario to bring that container back up on that host **API Changes:** Added new param values to destinationHost param for container placement request message LAST_SEEN: Tries to restart a container on last seen host with RESERVE -> STOP -> MOVE policy FORCE_RESTART_LAST_SEEN: Tries to restart a container on last seen host with STOP -> RESERVE -> MOVE policy **Upgrade Instructions:** None **Usage Instructions:** None ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected]
