Hi,

I was wondering how I can query data stored in HBase and remembered Hive's 
HBase 

integration:
http://wiki.apache.org/hadoop/Hive/HBaseIntegration

After watching John Sichi's video 
(http://developer.yahoo.com/blogs/hadoop/posts/2010/04/hundreds_of_hadoop_fans_at_the/

) I have a better idea about what functionality this integration provides, but 
I still have some questions.

Would it be correct to say that Hive-HBase integration makes the following data 
flow possible:

0) Hive or Files => Custom HQL statement that aggregates data  ==> HBase
1) HBase ==> Custom HQL statement that aggregates data  ==> HBase
2) HBase ==> Custom HQL statement that aggregates data  ==> output (console?)

Of the above, 1) is what I'm wondering the most about right now.

In other words, it seems to me that Hive may be able to look at *just* data 
stored in HBase *without* the typical data/files in HDFS that Hive normally 
runs 

its MR jobs against.

Is this correct?

Thanks,
Otis
----
Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch
Hadoop ecosystem search :: http://search-hadoop.com/

Reply via email to