insert overwrite directory leaves behind uncommitted/tmp files from failed tasks
--------------------------------------------------------------------------------

                 Key: HIVE-131
                 URL: https://issues.apache.org/jira/browse/HIVE-131
             Project: Hadoop Hive
          Issue Type: Bug
          Components: Query Processor
            Reporter: Joydeep Sen Sarma
            Priority: Critical


_tmp files are getting left behind on insert overwrite directory:

/user/jssarma/ctst1/40422_m_000195_0.deflate  <r 3> 13285 2008-12-07 01:47  
rw-r--r-- jssarma supergroup
/user/jssarma/ctst1/40422_m_000196_0.deflate  <r 3> 3055  2008-12-07 01:46  
rw-r--r-- jssarma supergroup
/user/jssarma/ctst1/_tmp.40422_m_000033_0 <r 3> 0 2008-12-07 01:53  rw-r--r-- 
jssarma supergroup
/user/jssarma/ctst1/_tmp.40422_m_000037_1 <r 3> 0 2008-12-07 01:53  rw-r--r-- 
jssarma supergroup


this happened with speculative execution. the code looks good (in fact in this 
case many speculative tasks were launched - and only a couple caused problems). 
Almost seems like these files did not appear in the namespace until after the 
map-reduce job finished and the movetask did a listing of the output dir ..

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to