[
https://issues.apache.org/jira/browse/YARN-8122?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16450345#comment-16450345
]
Eric Yang commented on YARN-8122:
---------------------------------
By using patch 005, the attached log indicates there is 4 failures of
containers with exit code 7. There are 29 containers allocated within 400
seconds. The failure rate is 13%, and the test json set
yarn.service.container-health-threshold.percent at 90%. Health threshold
monitor never reported unhealthy in the first 480 second window.
> Component health threshold monitor
> ----------------------------------
>
> Key: YARN-8122
> URL: https://issues.apache.org/jira/browse/YARN-8122
> Project: Hadoop YARN
> Issue Type: Sub-task
> Reporter: Gour Saha
> Assignee: Gour Saha
> Priority: Major
> Attachments: YARN-8122.001.patch, YARN-8122.002.patch,
> YARN-8122.003.patch, YARN-8122.004.patch, YARN-8122.005.patch,
> YARN-8122.draft.patch, YARN-8122.test.json, YARN-8122.test.log
>
>
> Slider supported component health threshold monitoring with SLIDER-1246. It
> would be good to have this feature for YARN Service too.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]