insert - Hadoop vs. Hive

2011-03-30 Thread David Zonsheine
Hi,

I'm trying to compare adding files to hdfs for Hive usage using Hive inserts
vs. adding to the hdfs directly then using Hive.

Any comments, blogging about this?

Thanks a lot,

David Zonsheine


Re: insert - Hadoop vs. Hive

2011-03-30 Thread Ashish Thusoo
If the data is already in the right format you should use LOAD syntax in Hive. 
This basically moves files into hdfs (so it should be not less performant than 
hdfs). If the data is not in the correct format or it needs to be transformed 
then the insert statement needs to be used.

Ashish
On Mar 30, 2011, at 6:30 AM, David Zonsheine wrote:

 
 Hi,
 
 I'm trying to compare adding files to hdfs for Hive usage using Hive inserts 
 vs. adding to the hdfs directly then using Hive.
 
 Any comments, blogging about this?
 
 Thanks a lot,
 
 David Zonsheine