The benefit of using the partitioned approach is really nicely described in the 
oreilly book "Programming Hive". (Thanks for writing it Edward)
For me the ability to drop a single partition if there's any doubt about the 
quality of the data of just one job is a large benefit.

From: Edward Capriolo [mailto:edlinuxg...@gmail.com]
Sent: Thursday, January 24, 2013 3:52 PM
To: user@hive.apache.org
Subject: Re: Loading a Hive table simultaneously from 2 different sources

Partition the table and load the data into different partitions. That or build 
the data outside he table and then use scripting to move the data in using LOAD 
DATA INPATH or copying.
On Thu, Jan 24, 2013 at 9:44 AM, Krishnan K 
<kkrishna...@gmail.com<mailto:kkrishna...@gmail.com>> wrote:
Hi All,

Could you please let me know what would happen if we try to load a table from 2 
different sources at the same time ?

I had tried this earlier and got an error for 1 load job and while the other 
job loaded the data successfully into the table..

I guess it was because of lock acquired on the table by the first load process.

Is there anyway to handle this ?

Please give your insights.

Regards,
Krishnan



Reply via email to