Mostafa Mokhtar created TEZ-2263:
------------------------------------
Summary: Tez : Don't try to recover from a failed commit
Key: TEZ-2263
URL: https://issues.apache.org/jira/browse/TEZ-2263
Project: Apache Tez
Issue Type: Bug
Affects Versions: 0.5.0
Reporter: Mostafa Mokhtar
Fix For: 0.5.0
Commit fails then Tez tries to recover which fails again.
{code}
5499174247-2015-04-01 16:23:20,600 INFO [main] app.RecoveryParser: Found
summary file in attempt directory,
summaryFile=hdfs://cn105-10.l42scl.hortonworks.com:8020/tmp/hive/mmokhtar/_tez_session_dir/8b149f3c-3947-4b34-a5e2-92657aa68e96/.tez/application_1426707664723_1086/recovery/1/summary,
path=hdfs://cn105-10.l42scl.hortonworks.com:8020/tmp/hive/mmokhtar/_tez_session_dir/8b149f3c-3947-4b34-a5e2-92657aa68e96/.tez/application_1426707664723_1086/recovery/1
5499174696-2015-04-01 16:23:20,600 INFO [main] app.RecoveryParser: Using
hdfs://cn105-10.l42scl.hortonworks.com:8020/tmp/hive/mmokhtar/_tez_session_dir/8b149f3c-3947-4b34-a5e2-92657aa68e96/.tez/application_1426707664723_1086/recovery/1
for recovering data from previous attempt
5499174963-2015-04-01 16:23:20,690 INFO [main] app.RecoveryParser: Parsing
summary file,
path=hdfs://cn105-10.l42scl.hortonworks.com:8020/tmp/hive/mmokhtar/_tez_session_dir/8b149f3c-3947-4b34-a5e2-92657aa68e96/.tez/application_1426707664723_1086/recovery/1/summary,
len=4024, lastModTime=1427919788998
5499175254-2015-04-01 16:23:20,786 INFO [main] app.RecoveryParser: Reached end
of summary stream
5499175340-2015-04-01 16:23:21,087 INFO [main] app.RecoveryParser: Checking if
DAG is in recoverable state, dagId=dag_1426707664723_1086_1
5499175468-2015-04-01 16:23:21,088 WARN [main] app.RecoveryParser: Found last
inProgress DAG but not recoverable: dagId=dag_1426707664723_1086_1,
dagCompleted=false
5499175622-2015-04-01 16:23:21,088 INFO [main] app.RecoveryParser: Trying to
recover dag from recovery file, dagId=dag_1426707664723_1086_1,
dataDir=hdfs://cn105-10.l42scl.hortonworks.com:8020/tmp/hive/mmokhtar/_tez_session_dir/8b149f3c-3947-4b34-a5e2-92657aa68e96/.tez/application_1426707664723_1086/recovery/1,
intoCurrentDir=hdfs://cn105-10.l42scl.hortonworks.com:8020/tmp/hive/mmokhtar/_tez_session_dir/8b149f3c-3947-4b34-a5e2-92657aa68e96/.tez/application_1426707664723_1086/recovery/2
5499176102-2015-04-01 16:23:21,091 INFO [main] app.RecoveryParser: Copying DAG
data into Current Attempt directory,
filePath=hdfs://cn105-10.l42scl.hortonworks.com:8020/tmp/hive/mmokhtar/_tez_session_dir/8b149f3c-3947-4b34-a5e2-92657aa68e96/.tez/application_1426707664723_1086/recovery/2/dag_1426707664723_1086_1.recovery
5499176413-2015-04-01 16:23:21,211 INFO [main] app.RecoveryParser: Recovering
from event, eventType=DAG_SUBMITTED, event=dagID=dag_1426707664723_1086_1,
submitTime=1427917169723
5499176580-2015-04-01 16:23:21,309 INFO [main] app.DAGAppMaster: Generating DAG
graphviz file, dagId=dag_1426707664723_1086_1,
filePath=/grid/0/cluster/yarn/log/application_1426707664723_1086/container_1426707664723_1086_02_000001/dag_1426707664723_1086_1.dot
5499176829-2015-04-01 16:23:21,347 INFO [main] app.DAGAppMaster: Writing DAG
plan to:
/grid/0/cluster/yarn/log/application_1426707664723_1086/container_1426707664723_1086_02_000001/dag_1426707664723_1086_1-tez-dag.pb.txt
5499177039-2015-04-01 16:23:22,576 INFO [main] app.RecoveryParser: Finished
copying data from previous attempt into current attempt
5499177160-2015-04-01 16:23:22,576 INFO [main] app.RecoveryParser: Trying to
create data recovered flag file,
filePath=hdfs://cn105-10.l42scl.hortonworks.com:8020/tmp/hive/mmokhtar/_tez_session_dir/8b149f3c-3947-4b34-a5e2-92657aa68e96/.tez/application_1426707664723_1086/recovery/2/dataRecovered
5499177445-2015-04-01 16:23:22,601 INFO [main] app.DAGAppMaster: In Session
mode. Waiting for DAG over RPC
5499177541-2015-04-01 16:23:22,601 INFO [main] app.DAGAppMaster: Found previous
DAG in completed or non-recoverable state, dagId=dag_1426707664723_1086_1,
isCompleted=false, isNonRecoverable=true, state=null, failureReason=DAG Commit
was in progress, not recoverable, dagId=dag_1426707664723_1086_1
5499177829-2015-04-01 16:23:22,601 INFO [main] common.TezUtilsInternal:
Redirecting log file based on addend: dag_1426707664723_1086_1
5499177953-
5499177954-LogType:syslog_dag_1426707664723_1086_1
5499177994-Log Upload Time:1-Apr-2015 16:24:30
5499178030-LogLength:521
5499178044-Log Contents:
5499178058-2015-04-01 16:23:22,604 INFO [main] impl.DAGImpl: Recovered DAG:
dag_1426707664723_1086_1 finished with state: FAILED
5499178176-2015-04-01 16:23:22,604 INFO [main] impl.DAGImpl:
dag_1426707664723_1086_1 transitioned from NEW to FAILED
5499178283-2015-04-01 16:23:22,604 INFO [AsyncDispatcher event handler]
app.DAGAppMaster: DAG completed, dagId=dag_1426707664723_1086_1, dagState=FAILED
5499178425-2015-04-01 16:23:22,605 INFO [AsyncDispatcher event handler]
common.TezUtilsInternal: Redirecting log file based on addend:
dag_1426707664723_1086_1_post
5499178579-
5499178580-LogType:syslog_dag_1426707664723_1086_1_post
5499178625-Log Upload Time:1-Apr-2015 16:24:30
5499178661-LogLength:4021
5499178676-Log Contents:
5499178690-2015-04-01 16:23:22,605 INFO [AsyncDispatcher event handler]
app.DAGAppMaster: Waiting for next DAG to be submitted.
5499178807-2015-04-01 16:24:01,681 INFO [IPC Server handler 0 on 53890]
client.DAGClientHandler: Received message to shutdown AM
5499178925-2015-04-01 16:24:01,682 INFO [IPC Server handler 0 on 53890]
rm.TaskSchedulerEventHandler: TaskScheduler notified that it should unregister
from RM
5499179073-2015-04-01 16:24:01,682 INFO [IPC Server handler 0 on 53890]
app.DAGAppMaster: No current running DAG, shutting down the AM
5499179197-2015-04-01 16:24:01,682 INFO [IPC Server handler 0 on 53890]
app.DAGAppMaster: DAGAppMasterShutdownHandler invoked
5499179312-2015-04-01 16:24:01,682 INFO [IPC Server handler 0 on 53890]
app.DAGAppMaster: Handling DAGAppMaster shutdown
5499179422-2015-04-01 16:24:01,683 INFO [AMShutdownThread] app.DAGAppMaster:
Sleeping for 5 seconds before shutting down
5499179532-2015-04-01 16:24:04,151 INFO [HistoryEventHandlingThread]
ats.ATSHistoryLoggingService: Event queue stats,
eventsProcessedSinceLastUpdate=2, eventQueueSize=0
5499179690-2015-04-01 16:24:06,683 INFO [AMShutdownThread] app.DAGAppMaster:
Calling stop for all the services
5499179790-2015-04-01 16:24:06,686 INFO [AMShutdownThread]
history.HistoryEventHandler: Stopping HistoryEventHandler
5499179896-2015-04-01 16:24:06,686 INFO [AMShutdownThread]
recovery.RecoveryService: Stopping RecoveryService
5499179995-2015-04-01 16:24:06,686 INFO [AMShutdownThread]
ats.ATSHistoryLoggingService: Stopping ATSService, eventQueueBacklog=0
5499180114-2015-04-01 16:24:06,686 INFO [RecoveryEventHandlingThread]
recovery.RecoveryService: EventQueue take interrupted. Returning
5499180238-2015-04-01 16:24:06,692 INFO [DelayedContainerManager]
rm.YarnTaskSchedulerService: AllocatedContainerManager Thread interrupted
5499180367-2015-04-01 16:24:06,697 INFO [AMShutdownThread]
rm.YarnTaskSchedulerService: Unregistering application from RM,
exitStatus=SUCCEEDED, exitMessage=Session stats:submittedDAGs=0,
successfulDAGs=0, failedDAGs=1, killedDAGs=0
5499180589-, trackingURL=
5499180604-2015-04-01 16:24:06,713 INFO [AMShutdownThread] impl.AMRMClientImpl:
Waiting for application to be successfully unregistered.
5499180730-2015-04-01 16:24:06,819 INFO [AMShutdownThread]
rm.YarnTaskSchedulerService: Successfully unregistered application from RM
5499180853-2015-04-01 16:24:06,821 INFO [AMShutdownThread] ipc.Server: Stopping
server on 50998
5499180938-2015-04-01 16:24:06,821 INFO [AMRM Callback Handler Thread]
impl.AMRMClientAsyncImpl: Interrupted while waiting for queue
5499181060:java.lang.InterruptedException
5499181091- at
java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject.reportInterruptAfterWait(AbstractQueuedSynchronizer.java:2017)
5499181228- at
java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject.await(AbstractQueuedSynchronizer.java:2052)
5499181346- at
java.util.concurrent.LinkedBlockingQueue.take(LinkedBlockingQueue.java:442)
5499181426- at
org.apache.hadoop.yarn.client.api.async.impl.AMRMClientAsyncImpl$CallbackHandlerThread.run(AMRMClientAsyncImpl.java:274)
5499181551-2015-04-01 16:24:06,823 INFO [IPC Server Responder] ipc.Server:
Stopping IPC Server Responder
5499181645-2015-04-01 16:24:06,822 INFO [IPC Server listener on 50998]
ipc.Server: Stopping IPC Server listener on 50998
5499181755-2015-04-01 16:24:06,822 INFO [AMShutdownThread] ipc.Server: Stopping
server on 53890
5499181840-2015-04-01 16:24:06,826 INFO [IPC Server listener on 53890]
ipc.Server: Stopping IPC Server listener on 53890
5499181950-2015-04-01 16:24:06,826 INFO [IPC Server Responder] ipc.Server:
Stopping IPC Server Responder
5499182044-2015-04-01 16:24:06,828 INFO [Thread-1] app.DAGAppMaster:
DAGAppMasterShutdownHook invoked
5499182135-2015-04-01 16:24:06,828 INFO [Thread-1] app.DAGAppMaster: The
shutdown handler is still running, waiting for it to complete
5499182259-2015-04-01 16:24:06,839 WARN [AMShutdownThread] app.DAGAppMaster:
Failed to delete tez scratch data dir,
path=hdfs://cn105-10.l42scl.hortonworks.com:8020/tmp/hive/mmokhtar/_tez_session_dir/8b149f3c-3947-4b34-a5e2-92657aa68e96/.tez/application_1426707664723_1086
5499182521-2015-04-01 16:24:06,839 INFO [AMShutdownThread] app.DAGAppMaster:
Exiting DAGAppMaster..GoodBye!
5499182618-2015-04-01 16:24:06,839 INFO [Thread-1] app.DAGAppMaster: The
shutdown handler has completed
5499182711-
5499182712-
5499182713-
5499182714-Container: container_1426707664723_1086_01_000304 on
cn110-10.l42scl.hortonworks.com_45454
5499182805-============================================================================================
5499182898-LogType:stderr
5499182913-Log Upload Time:1-Apr-2015 16:24:30
5499182949-LogLength:0
5499182961-Log Contents:
5499182975-
5499182976-LogType:stdout
5499182991-Log Upload Time:1-Apr-2015 16:24:30
5499183027-LogLength:81042
5499183043-Log Contents:
5499183057-9.076: [GC [PSYoungGen: 415194K->108031K(756736K)]
1463770K->1182454K(7674880K), 0.1355580 secs] [Times: user=0.59 sys=0.13,
real=0.13 secs]
5499183199-16.885: [GC [PSYoungGen: 446979K->70470K(756736K)]
1521402K->1144901K(7674880K), 0.1141880 secs] [Times: user=0.41 sys=0.08,
real=0.11 secs]
5499183341-32.610: [GC [PSYoungGen: 401811K->23054K(756736K)]
1476242K->1097493K(7674880K), 0.0198940 secs] [Times: user=0.26 sys=0.00,
real=0.02 secs]
5499183483-37.397: [GC [PSYoungGen: 354557K->108009K(756736K)]
2477572K->2246623K(7674880K), 0.0591560 secs] [Times: user=0.82 sys=0.06,
real=0.06 secs]
5499183626-42.607: [GC [PSYoungGen: 434535K->15928K(649216K)]
2573148K->2154549K(7567360K), 0.0639210 secs] [Times: user=0.45 sys=0.03,
real=0.06 secs]
5499183768-47.641: [GC [PSYoungGen: 553804K->22893K(689152K)]
3741001K->3210098K(7607296K), 0.0727270 secs] [Times: user=0.55 sys=0.06,
real=0.07 secs]
5499183910-53.107: [GC [PSYoungGen: 343874K->137457K(646656K)]
4579655K->4384105K(7564800K), 0.1012020 secs] [Times: user=0.50 sys=0.14,
real=0.10 secs]
5499184053-62.586: [GC [PSYoungGen: 484074K->89878K(676864K)]
4730722K->4336778K(7595008K), 0.0985300 secs] [Times: user=0.56 sys=0.08,
real=0.10 secs]
5499184195-76.005: [GC [PSYoungGen: 449294K->16827K(668672K)]
4696194K->4274117K(7586816K), 0.0315610 secs] [Times: user=0.36 sys=0.03,
real=0.03 secs]
5499184337-79.777: [GC [PSYoungGen: 211179K->19599K(680448K)]
5517045K->5332144K(7598592K), 0.0304100 secs] [Times: user=0.37 sys=0.02,
real=0.03 secs]
5499184479-81.315: [GC [PSYoungGen: 150008K->78090K(677888K)]
5462554K->5399597K(7596032K), 0.0293570 secs] [Times: user=0.39 sys=0.02,
real=0.03 secs]
5499184621-82.455: [GC [PSYoungGen: 237557K->512K(687616K)]
5559064K->5324779K(7605760K), 0.0384990 secs] [Times: user=0.34 sys=0.02,
real=0.04 secs]
5499184761-84.067: [GC [PSYoungGen: 210827K->14117K(687616K)]
6583671K->6387049K(7605760K), 0.0517180 secs] [Times: user=0.66 sys=0.01,
real=0.05 secs]
5499184903-88.416: [GC [PSYoungGen: 268920K->15351K(696320K)]
6641851K->6394989K(7614464K), 0.0787950 secs] [Times: user=0.51 sys=0.02,
real=0.08 secs]
5499185045-101.043: [GC [PSYoungGen: 376721K->448K(691200K)]
6756359K->6387846K(7609344K), 0.0282280 secs] [Times: user=0.44 sys=0.02,
real=0.03 secs]
5499185186-103.105: [GC [PSYoungGen: 82965K->13643K(705536K)]
6470363K->6401129K(7623680K), 0.1482160 secs] [Times: user=0.53 sys=0.00,
real=0.15 secs]
5499185328-103.253: [GC [PSYoungGen: 13643K->0K(699392K)]
6401129K->6400834K(7617536K), 0.0805200 secs] [Times: user=0.56 sys=0.02,
real=0.08 secs]
5499185466-103.334: [Full GC [PSYoungGen: 0K->0K(699392K)] [ParOldGen:
6400834K->26007K(6918144K)] 6400834K->26007K(7617536K) [PSPermGen:
33081K->33057K(66560K)], 0.4079400 secs] [Times: user=1.08 sys=0.17, real=0.41
secs]
5499185679-108.478: [GC [PSYoungGen: 279515K->23409K(718848K)]
1354098K->1097992K(7636992K), 0.0143960 secs] [Times: user=0.06 sys=0.00,
real=0.01 secs]
5499185822-121.280: [GC [PSYoungGen: 322641K->279K(709632K)]
1397224K->1082530K(7627776K), 0.0187460 secs] [Times: user=0.10 sys=0.00,
real=0.01 secs]
5499185963-126.595: [GC [PSYoungGen: 345663K->20073K(731648K)]
2476491K->2150964K(7649792K), 0.0305890 secs] [Times: user=0.19 sys=0.00,
real=0.03 secs]
5499186106-141.120: [GC [PSYoungGen: 596191K->14241K(723456K)]
3775658K->3200292K(7641600K), 0.0189990 secs] [Times: user=0.27 sys=0.00,
real=0.02 secs]
--
18043554745-2015-04-01 16:23:08,508 INFO [AsyncDispatcher event handler]
impl.DAGImpl: No output committers for vertex: Map 49
18043554860-2015-04-01 16:23:08,508 INFO [AsyncDispatcher event handler]
impl.DAGImpl: No output committers for vertex: Map 44
18043554975-2015-04-01 16:23:08,508 INFO [AsyncDispatcher event handler]
impl.DAGImpl: No output committers for vertex: Map 45
18043555090-2015-04-01 16:23:08,508 INFO [AsyncDispatcher event handler]
impl.DAGImpl: No output committers for vertex: Map 50
18043555205-2015-04-01 16:23:08,508 INFO [AsyncDispatcher event handler]
impl.DAGImpl: No output committers for vertex: Map 25
18043555320-2015-04-01 16:23:08,508 INFO [AsyncDispatcher event handler]
impl.DAGImpl: No output committers for vertex: Map 17
18043555435-2015-04-01 16:23:08,508 INFO [AsyncDispatcher event handler]
impl.DAGImpl: No output committers for vertex: Map 23
18043555550-2015-04-01 16:23:08,508 INFO [AsyncDispatcher event handler]
impl.DAGImpl: No output committers for vertex: Map 24
18043555665-2015-04-01 16:23:08,508 INFO [AsyncDispatcher event handler]
impl.DAGImpl: No output committers for vertex: Map 21
18043555780-2015-04-01 16:23:08,508 INFO [AsyncDispatcher event handler]
impl.DAGImpl: No output committers for vertex: Map 22
18043555895-2015-04-01 16:23:08,508 INFO [AsyncDispatcher event handler]
impl.DAGImpl: No output committers for vertex: Reducer 38
18043556014-2015-04-01 16:23:08,508 INFO [AsyncDispatcher event handler]
impl.DAGImpl: No output committers for vertex: Map 20
18043556129-2015-04-01 16:23:08,508 INFO [AsyncDispatcher event handler]
impl.DAGImpl: No output committers for vertex: Reducer 31
18043556248-2015-04-01 16:23:08,508 INFO [AsyncDispatcher event handler]
impl.DAGImpl: No output committers for vertex: Reducer 37
18043556367-2015-04-01 16:23:08,508 INFO [AsyncDispatcher event handler]
impl.DAGImpl: No output committers for vertex: Map 29
18043556482-2015-04-01 16:23:08,508 INFO [AsyncDispatcher event handler]
impl.DAGImpl: No output committers for vertex: Reducer 30
18043556601-2015-04-01 16:23:08,508 INFO [AsyncDispatcher event handler]
impl.DAGImpl: No output committers for vertex: Map 27
18043556716-2015-04-01 16:23:08,508 INFO [AsyncDispatcher event handler]
impl.DAGImpl: No output committers for vertex: Map 28
18043556831-2015-04-01 16:23:08,508 INFO [AsyncDispatcher event handler]
impl.DAGImpl: No output committers for vertex: Reducer 3
18043556949-2015-04-01 16:23:08,508 INFO [AsyncDispatcher event handler]
impl.DAGImpl: No output committers for vertex: Map 26
18043557064-2015-04-01 16:23:08,508 INFO [AsyncDispatcher event handler]
impl.DAGImpl: No output committers for vertex: Reducer 2
18043557182-2015-04-01 16:23:08,508 INFO [AsyncDispatcher event handler]
impl.DAGImpl: No output committers for vertex: Map 32
18043557297-2015-04-01 16:23:08,508 INFO [AsyncDispatcher event handler]
impl.DAGImpl: No output committers for vertex: Reducer 36
18043557416-2015-04-01 16:23:08,508 INFO [AsyncDispatcher event handler]
impl.DAGImpl: No output committers for vertex: Reducer 35
18043557535-2015-04-01 16:23:08,508 INFO [AsyncDispatcher event handler]
impl.DAGImpl: No output committers for vertex: Reducer 34
18043557654-2015-04-01 16:23:08,508 INFO [AsyncDispatcher event handler]
impl.DAGImpl: No output committers for vertex: Map 33
18043557769-2015-04-01 16:23:08,508 INFO [AsyncDispatcher event handler]
impl.DAGImpl: No output committers for vertex: Map 1
18043557883-2015-04-01 16:23:08,508 INFO [AsyncDispatcher event handler]
impl.DAGImpl: No output committers for vertex: Map 39
18043557998-2015-04-01 16:23:08,508 INFO [AsyncDispatcher event handler]
impl.DAGImpl: No output committers for vertex: Map 5
18043558112-2015-04-01 16:23:08,508 INFO [AsyncDispatcher event handler]
impl.DAGImpl: No output committers for vertex: Map 4
18043558226-2015-04-01 16:23:08,508 INFO [AsyncDispatcher event handler]
impl.DAGImpl: No output committers for vertex: Map 43
18043558341-2015-04-01 16:23:08,508 INFO [AsyncDispatcher event handler]
impl.DAGImpl: No output committers for vertex: Map 53
18043558456-2015-04-01 16:23:08,508 INFO [AsyncDispatcher event handler]
impl.DAGImpl: No output committers for vertex: Map 42
18043558571-2015-04-01 16:23:08,508 INFO [AsyncDispatcher event handler]
impl.DAGImpl: No output committers for vertex: Map 54
18043558686-2015-04-01 16:23:08,508 INFO [AsyncDispatcher event handler]
impl.DAGImpl: No output committers for vertex: Map 41
18043558801-2015-04-01 16:23:08,508 INFO [AsyncDispatcher event handler]
impl.DAGImpl: No output committers for vertex: Map 51
18043558916-2015-04-01 16:23:08,508 INFO [AsyncDispatcher event handler]
impl.DAGImpl: No output committers for vertex: Map 14
18043559031-2015-04-01 16:23:08,508 INFO [AsyncDispatcher event handler]
impl.DAGImpl: No output committers for vertex: Reducer 7
18043559149-2015-04-01 16:23:08,508 INFO [AsyncDispatcher event handler]
impl.DAGImpl: No output committers for vertex: Map 13
18043559264-2015-04-01 16:23:08,508 INFO [AsyncDispatcher event handler]
impl.DAGImpl: No output committers for vertex: Reducer 6
18043559382-2015-04-01 16:23:08,508 INFO [AsyncDispatcher event handler]
impl.DAGImpl: No output committers for vertex: Map 16
18043559497-2015-04-01 16:23:08,508 INFO [AsyncDispatcher event handler]
impl.DAGImpl: No output committers for vertex: Map 19
18043559612-2015-04-01 16:23:08,508 INFO [AsyncDispatcher event handler]
impl.DAGImpl: No output committers for vertex: Map 18
18043559727-2015-04-01 16:23:08,508 INFO [AsyncDispatcher event handler]
impl.DAGImpl: No output committers for vertex: Map 15
18043559842-2015-04-01 16:23:08,509 INFO [AsyncDispatcher event handler]
impl.DAGImpl: No exclusive output committers for vertex: Reducer 12
18043559971-2015-04-01 16:23:08,509 INFO [AsyncDispatcher event handler]
impl.DAGImpl: No output committers for vertex: Reducer 10
18043560090-2015-04-01 16:23:08,509 INFO [AsyncDispatcher event handler]
impl.DAGImpl: No output committers for vertex: Reducer 11
18043560209-2015-04-01 16:23:08,509 INFO [AsyncDispatcher event handler]
impl.DAGImpl: No output committers for vertex: Reducer 8
18043560327-2015-04-01 16:23:08,509 INFO [AsyncDispatcher event handler]
impl.DAGImpl: No output committers for vertex: Reducer 9
18043560445-2015-04-01 16:23:08,857 FATAL [AsyncDispatcher event handler]
event.AsyncDispatcher: Error in dispatcher thread
18043560557:org.apache.tez.common.counters.LimitExceededException: Too many
counters: 1201 max=1200
18043560645- at
org.apache.tez.common.counters.Limits.checkCounters(Limits.java:87)
18043560717- at
org.apache.tez.common.counters.Limits.incrCounters(Limits.java:94)
18043560788- at
org.apache.tez.common.counters.AbstractCounterGroup.addCounter(AbstractCounterGroup.java:75)
18043560885- at
org.apache.tez.common.counters.AbstractCounterGroup.addCounterImpl(AbstractCounterGroup.java:92)
18043560986- at
org.apache.tez.common.counters.AbstractCounterGroup.findCounter(AbstractCounterGroup.java:103)
18043561085- at
org.apache.tez.common.counters.AbstractCounterGroup.incrAllCounters(AbstractCounterGroup.java:198)
18043561188- at
org.apache.tez.common.counters.AbstractCounters.incrAllCounters(AbstractCounters.java:363)
18043561283- at
org.apache.tez.dag.app.dag.impl.DAGImpl.incrTaskCounters(DAGImpl.java:598)
18043561362- at
org.apache.tez.dag.app.dag.impl.DAGImpl.getAllCounters(DAGImpl.java:588)
18043561439- at
org.apache.tez.dag.app.dag.impl.DAGImpl.logJobHistoryFinishedEvent(DAGImpl.java:994)
18043561528- at
org.apache.tez.dag.app.dag.impl.DAGImpl.finished(DAGImpl.java:1135)
18043561600- at
org.apache.tez.dag.app.dag.impl.DAGImpl.checkDAGForCompletion(DAGImpl.java:1048)
18043561685- at
org.apache.tez.dag.app.dag.impl.DAGImpl$VertexCompletedTransition.transition(DAGImpl.java:1708)
18043561785- at
org.apache.tez.dag.app.dag.impl.DAGImpl$VertexCompletedTransition.transition(DAGImpl.java:1665)
18043561885- at
org.apache.hadoop.yarn.state.StateMachineFactory$MultipleInternalArc.doTransition(StateMachineFactory.java:385)
18043562001- at
org.apache.hadoop.yarn.state.StateMachineFactory.doTransition(StateMachineFactory.java:302)
18043562097- at
org.apache.hadoop.yarn.state.StateMachineFactory.access$300(StateMachineFactory.java:46)
18043562190- at
org.apache.hadoop.yarn.state.StateMachineFactory$InternalStateMachine.doTransition(StateMachineFactory.java:448)
18043562307- at
org.apache.tez.dag.app.dag.impl.DAGImpl.handle(DAGImpl.java:944)
18043562376- at
org.apache.tez.dag.app.dag.impl.DAGImpl.handle(DAGImpl.java:126)
18043562445- at
org.apache.tez.dag.app.DAGAppMaster$DagEventDispatcher.handle(DAGAppMaster.java:1686)
18043562535- at
org.apache.tez.dag.app.DAGAppMaster$DagEventDispatcher.handle(DAGAppMaster.java:1677)
18043562625- at
org.apache.hadoop.yarn.event.AsyncDispatcher.dispatch(AsyncDispatcher.java:173)
18043562709- at
org.apache.hadoop.yarn.event.AsyncDispatcher$1.run(AsyncDispatcher.java:106)
18043562790- at java.lang.Thread.run(Thread.java:745)
18043562832-2015-04-01 16:23:08,882 INFO [AsyncDispatcher event handler]
event.AsyncDispatcher: Exiting, bbye..
18043562932-2015-04-01 16:23:08,885 INFO [Thread-1] app.DAGAppMaster:
DAGAppMasterShutdownHook invoked
18043563023-2015-04-01 16:23:08,885 INFO [Thread-1] app.DAGAppMaster:
DAGAppMaster received a signal. Signaling TaskScheduler
18043563137-2015-04-01 16:23:08,885 INFO [Thread-1]
rm.TaskSchedulerEventHandler: TaskScheduler notified that iSignalled was : true
18043563257-2015-04-01 16:23:08,899 INFO [Thread-1]
history.HistoryEventHandler: Stopping HistoryEventHandler
18043563355-2015-04-01 16:23:08,900 INFO [Thread-1] recovery.RecoveryService:
Stopping RecoveryService
18043563446-2015-04-01 16:23:08,900 INFO [Thread-1] recovery.RecoveryService:
Closing Summary Stream
18043563535-2015-04-01 16:23:08,900 INFO [RecoveryEventHandlingThread]
recovery.RecoveryService: EventQueue take interrupted. Returning
18043563659-2015-04-01 16:23:09,033 INFO [Thread-1] recovery.RecoveryService:
Closing Output Stream for DAG dag_1426707664723_1086_1
18043563780-2015-04-01 16:23:09,062 INFO [Thread-1]
ats.ATSHistoryLoggingService: Stopping ATSService, eventQueueBacklog=0
18043563891-2015-04-01 16:23:09,064 INFO [Thread-1]
launcher.ContainerLauncherImpl: Sending a stop request to the NM for
ContainerId: container_1426707664723_1086_01_000319
18043564052-2015-04-01 16:23:09,064 INFO [Thread-1]
impl.ContainerManagementProtocolProxy: Opening proxy :
cn113-10.l42scl.hortonworks.com:45454
18043564185-2015-04-01 16:23:09,097 INFO [Thread-1]
launcher.ContainerLauncherImpl: Sending a stop request to the NM for
ContainerId: container_1426707664723_1086_01_000047
18043564346-2015-04-01 16:23:09,097 INFO [Thread-1]
impl.ContainerManagementProtocolProxy: Opening proxy :
cn122-10.l42scl.hortonworks.com:45454
18043564479-2015-04-01 16:23:09,114 INFO [Thread-1]
launcher.ContainerLauncherImpl: Sending a stop request to the NM for
ContainerId: container_1426707664723_1086_01_000306
18043564640-2015-04-01 16:23:09,114 INFO [Thread-1]
impl.ContainerManagementProtocolProxy: Opening proxy :
cn113-10.l42scl.hortonworks.com:45454
18043564773-2015-04-01 16:23:09,120 INFO [Thread-1]
launcher.ContainerLauncherImpl: Sending a stop request to the NM for
ContainerId: container_1426707664723_1086_01_000104
18043564934-2015-04-01 16:23:09,120 INFO [Thread-1]
impl.ContainerManagementProtocolProxy: Opening proxy :
cn111-10.l42scl.hortonworks.com:45454
18043565067-2015-04-01 16:23:09,145 INFO [Thread-1]
launcher.ContainerLauncherImpl: Sending a stop request to the NM for
ContainerId: container_1426707664723_1086_01_000140
18043565228-2015-04-01 16:23:09,145 INFO [Thread-1]
impl.ContainerManagementProtocolProxy: Opening proxy :
cn120-10.l42scl.hortonworks.com:45454
18043565361-2015-04-01 16:23:09,152 INFO [Thread-1]
launcher.ContainerLauncherImpl: Sending a stop request to the NM for
ContainerId: container_1426707664723_1086_01_000236
18043565522-2015-04-01 16:23:09,152 INFO [Thread-1]
impl.ContainerManagementProtocolProxy: Opening proxy :
cn107-10.l42scl.hortonworks.com:45454
18043565655-2015-04-01 16:23:09,159 INFO [Thread-1]
launcher.ContainerLauncherImpl: Sending a stop request to the NM for
ContainerId: container_1426707664723_1086_01_000255
18043565816-2015-04-01 16:23:09,159 INFO [Thread-1]
impl.ContainerManagementProtocolProxy: Opening proxy :
cn116-10.l42scl.hortonworks.com:45454
18043565949-2015-04-01 16:23:09,182 INFO [Thread-1]
launcher.ContainerLauncherImpl: Sending a stop request to the NM for
ContainerId: container_1426707664723_1086_01_000074
{code}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)