All,

I have a Hive 1.3 cluster running in production, and there was a question about 
INSERT OVERWRITE queries on tables. If I perform an INSERT OVERWRITE query on a 
table, and the query fails half way through, will the old data still exist in 
the table? I'm not completely clear on the exact process which INSERT OVERWRITE 
follows, but I believe it puts the data into the staging folder, and then does 
a remove and move, which should be safe. It could also just wipe the folder 
before the query starts however, which would cause issues if the query itself 
failed. Can someone give me a definitive answer on this? Pointers to the source 
code or documentation that explains this would be even better.

Thanks!

David McGinnis

Reply via email to