[
https://issues.apache.org/jira/browse/EAGLE-920?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jayesh updated EAGLE-920:
-------------------------
Fix Version/s: (was: v0.5.0)
v0.5.1
> mr failed job trouble shooting
> ------------------------------
>
> Key: EAGLE-920
> URL: https://issues.apache.org/jira/browse/EAGLE-920
> Project: Eagle
> Issue Type: Improvement
> Components: App::Job Performance Monitor
> Affects Versions: v0.5.0
> Reporter: wujinhu
> Assignee: wujinhu
> Fix For: v0.5.1
>
>
> We will follow below steps when we find a failed mr job.
> 1. get error category distribution of the job via api
> query=TaskAttemptErrorCategoryService[@site="sandbox" and
> @jobId="job_1486726244016_162594"]<@errorCategory>{count}
> 2. get error category - error message mapping and failed task attempts list
> query=JobErrorMappingService[@site="sandbox" and
> @jobId="job_1486726244016_162594" and
> @errorCategory="java.lang.RuntimeException"]
> 3. dive into one task attempt
> query=TaskAttemptExecutionService[@site="sandbox" and
> @taskAttemptId="attempt_1486726244016_162594_m_002451_1"]
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)