[
https://issues.apache.org/jira/browse/YARN-1783?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13920105#comment-13920105
]
Arpit Gupta commented on YARN-1783:
-----------------------------------
application that took the longest is application_1393347856479_0014
{code}
14/02/25 17:41:01 INFO mapreduce.Job: Job job_1393347856479_0017 running in
uber mode : false
14/02/25 17:41:01 INFO mapreduce.Job: map 0% reduce 0%
2014-02-25 17:41:02,145|beaver.machine|INFO|RUNNING: /usr/bin/yarn application
-list -appStates NEW,NEW_SAVING,SUBMITTED,ACCEPTED,RUNNING
2014-02-25 17:41:03,419|beaver.machine|INFO|Total number of applications
(application-types: [] and states: [NEW, NEW_SAVING, SUBMITTED, ACCEPTED,
RUNNING]):6
2014-02-25 17:41:03,419|beaver.machine|INFO|Application-Id
Application-Name Application-Type User Queue
State Final-State Progress
Tracking-URL
2014-02-25 17:41:03,420|beaver.machine|INFO|application_1393347856479_0013
Sleep job MAPREDUCE hrt_qa default
ACCEPTED UNDEFINED 0%
N/A
2014-02-25 17:41:03,420|beaver.machine|INFO|application_1393347856479_0014
test_mapred_ha_pending_job_rm_1393349992-1 MAPREDUCE
hrt_qa default ACCEPTED UNDEFINED
0% null
2014-02-25 17:41:03,420|beaver.machine|INFO|application_1393347856479_0018
test_mapred_ha_pending_job_rm_1393349992-3 MAPREDUCE
hrt_qa default RUNNING UNDEFINED
5% http://hor12n10.gq1.ygridcore.net:41840
2014-02-25 17:41:03,420|beaver.machine|INFO|application_1393347856479_0017
test_mapred_ha_pending_job_rm_1393349992-2 MAPREDUCE
hrt_qa default RUNNING UNDEFINED
5% http://hor12n10.gq1.ygridcore.net:51732
2014-02-25 17:41:03,421|beaver.machine|INFO|application_1393347856479_0016
test_mapred_ha_pending_job_rm_1393349992-4 MAPREDUCE
hrt_qa default RUNNING UNDEFINED
5% http://hor12n08.gq1.ygridcore.net:50966
2014-02-25 17:41:03,421|beaver.machine|INFO|application_1393347856479_0015
test_mapred_ha_pending_job_rm_1393349992-0 MAPREDUCE
hrt_qa default RUNNING UNDEFINED
35.01% http://hor12n08.gq1.ygridcore.net:54998
{code}
and this is when it completed
{code}
2014-02-25 20:52:32,992|beaver.machine|INFO|Total number of applications
(application-types: [] and states: [NEW, NEW_SAVING, SUBMITTED, ACCEPTED,
RUNNING]):1
2014-02-25 20:52:32,993|beaver.machine|INFO|Application-Id
Application-Name Application-Type User Queue
State Final-State Progress
Tracking-URL
2014-02-25 20:52:32,993|beaver.machine|INFO|application_1393347856479_0014
test_mapred_ha_pending_job_rm_1393349992-1 MAPREDUCE
hrt_qa default RUNNING UNDEFINED
86.01% http://hor12n08.gq1.ygridcore.net:46622
14/02/25 20:52:35 INFO mapreduce.Job: map 100% reduce 100%
14/02/25 20:52:37 INFO mapreduce.Job: Job job_1393347856479_0014 completed
successfully
14/02/25 20:52:37 INFO mapreduce.Job: Counters: 49
File System Counters
{code}
> yarn application does not make any progress even when no other application is
> running when RM is being restarted in the background
> ----------------------------------------------------------------------------------------------------------------------------------
>
> Key: YARN-1783
> URL: https://issues.apache.org/jira/browse/YARN-1783
> Project: Hadoop YARN
> Issue Type: Bug
> Affects Versions: 2.4.0
> Reporter: Arpit Gupta
> Assignee: Jian He
> Priority: Critical
>
> Noticed that during HA tests some tests took over 3 hours to run when the
> test failed.
> Looking at the logs i see the application made no progress for a very long
> time. However if i look at application log from yarn it actually ran in 5 mins
> I am seeing same behavior when RM was being restarted in the background and
> when both RM and AM were being restarted.
--
This message was sent by Atlassian JIRA
(v6.2#6252)