[ https://issues.apache.org/jira/browse/HIVE-6638?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13948801#comment-13948801 ]
Mohammad Kamrul Islam commented on HIVE-6638: --------------------------------------------- In case, anyone is interested. The testing is an involved process and choreographed. I tested it as follows: set mapred.map.tasks.speculative.execution=false; set mapred.job.map.memory.mb=4096; set hive.merge.mapfiles=false; set hive.input.format=org.apache.hadoop.hive.ql.io.HiveInputFormat; create table load_overwrite (key string, value string) stored as textfile; load data local inpath '/tmp/data/' into table load_overwrite; select key from load_overwrite where length(key) > 0 ; Assuming /tmp/data has four copies of kv1.txt. Tested against Hadoop 2.3 in single node Mac machine. The four tasks will run kind of sequentially. Important: When to kill MRAM? I killed the MRAM when the second one finished. It could be anytime before the last one finished. Command used: "jps |grep MRAppMaster |cut -d' ' -f1|xargs kill" I was monitoring in two ways: 1. cd HADOOP_LOG_DIR/userlogs/<app-id> and ran "grep -R "New Final Path" *". This will show what tasks are completed with file written to HDFS. 2. run hadoop fs -lsr hdfs://localhost:9000/tmp/hive-<ID>/. It will show all the tasks' output during the execution. At the end , it is cleaned up. Anyway, if you can kill MRAM during the execution, you should see there are only 4 output files . More importantly, you will see the completed (before MRAM was killed) task never rerun. Also you get the correct result. > Hive needs to implement recovery for Application Master restart > ---------------------------------------------------------------- > > Key: HIVE-6638 > URL: https://issues.apache.org/jira/browse/HIVE-6638 > Project: Hive > Issue Type: Improvement > Components: Query Processor > Affects Versions: 0.11.0, 0.12.0, 0.13.0 > Reporter: Ashutosh Chauhan > Assignee: Mohammad Kamrul Islam > Attachments: HIVE-6638.1.patch > > > Currently, if AM restarts, whole job is restarted. Although, job and > subsequently query would still finish to completion, it would be nice if Hive > don't need to redo all the work done under previous AM. -- This message was sent by Atlassian JIRA (v6.2#6252)