[
https://issues.apache.org/jira/browse/SLIDER-1216?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Gour Saha updated SLIDER-1216:
------------------------------
Description: This will track phase-2 of SLIDER-1185 where a significant
chunk of the work is already done. This will track a few of the remaining
sub-tasks which can come in the next release. (was: Today, the options to
debug a failing Slider application are painful. One option is to traverse
several links in RM UI starting from the application link and going all the way
down to the container logs. An app-owner might have access to a gateway, but
still logs might not be available until the app dies if the cluster is not
enabled with rolling log aggregation.
Slider provides capability to create apps with friendly names and hence to a
certain extent hides the YARN application id. It is not difficult to find the
YARN application id, but then app owners are more used to referring their apps
by their well-known names. All interactions from the command line using the
Slider client requires only the app name.
It would be great to provide container diagnostics (live and dead) like
absolute links to container logs in RM UI (links for live and dead containers
will be different), additional YARN-level diagnostics (specifically for
failed/killed containers), etc. With the absolute log links, an app owner can
directly jump to the container logs without having to hunt through RM UI. All
these information should be made available from Slider client so that
app-owners can query directly from cmd-line using app names. Consumers of
Slider client as an SDK will be able to call appropriate APIs and get these
diagnostic information. For example, Ambari Slider Views can now show these
diagnostics directly in the Ambari UI, relieving the app owners of the pain to
traverse the RM UI. Eventually, when the app dies/completes these container
diagnostics should be published to YARN status, such that debugging of a failed
application becomes easy as well.
At a high level, I am thinking of a cmd-line like -
slider diagnostics --name <app-name> --containers
)
> [Phase 2] Increase Slider applications (live/dead) debuggability by providing
> container (live and dead) diagnostics from cmd-line and YARN status
> -------------------------------------------------------------------------------------------------------------------------------------------------
>
> Key: SLIDER-1216
> URL: https://issues.apache.org/jira/browse/SLIDER-1216
> Project: Slider
> Issue Type: Bug
> Components: appmaster, client
> Affects Versions: Slider 0.91
> Reporter: Gour Saha
> Assignee: Gour Saha
> Fix For: Slider 1.0.0
>
>
> This will track phase-2 of SLIDER-1185 where a significant chunk of the work
> is already done. This will track a few of the remaining sub-tasks which can
> come in the next release.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)