[
https://issues.apache.org/jira/browse/YARN-4205?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15525225#comment-15525225
]
Gour Saha commented on YARN-4205:
---------------------------------
YARN-4692 also needs total execution timeout of application. It does not need
queue or state-store timeouts. The only additional thing it needs, is the
monitor start time to provide options like LAUNCH_FIRST and LAUNCH_EVERYTIME on
top of SUBMISSION.
YARN-4692 is for long running or semi long-running services. Long running
services are meant to run forever. Semi long-running services are meant to run
for several hours or few days or weeks. For the semi long-running usecases, we
have applications like a CI (continuous-integration) app or a System Test app,
which typically needs several hours to finish its job. CI app or System Test
app owners are not going to be happy that their app did not run at all, even
though it was given a timeout of 10 hours because YARN got a chance to allocate
resource to it at the 9:59 hour mark since submission. That is why we need
LAUNCH_FIRST.
LAUNCH_EVERYTIME will cover scenarios like say a load test app needs to run
end-to-end for at least 2 days straight to certify a product. If such an app is
pre-empted at say the 47 hour mark, it needs a fresh 48 hour lifetime the next
time it is re-launched.
> Add a service for monitoring application life time out
> ------------------------------------------------------
>
> Key: YARN-4205
> URL: https://issues.apache.org/jira/browse/YARN-4205
> Project: Hadoop YARN
> Issue Type: Sub-task
> Components: scheduler
> Reporter: nijel
> Assignee: Rohith Sharma K S
> Attachments: 0001-YARN-4205.patch, 0002-YARN-4205.patch,
> 0003-YARN-4205.patch, 0004-YARN-4205.patch, 0005-YARN-4205.patch,
> 0006-YARN-4205.patch, YARN-4205_01.patch, YARN-4205_02.patch,
> YARN-4205_03.patch
>
>
> This JIRA intend to provide a lifetime monitor service.
> The service will monitor the applications where the life time is configured.
> If the application is running beyond the lifetime, it will be killed.
> The lifetime will be considered from the submit time.
> The thread monitoring interval is configurable.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]