Question regarding region scans in HBase integration

Daniel Einspanjer Sat, 11 Sep 2010 19:01:27 -0700

I was trying to spend a little time this weekend catching up with thecurrent state of HBase integration for Hive. One thing that I haven'tseen mentioned is how exactly Hive scans an HBase table during a SELECT.

Does Hive have logic that allows it to intelligently scan only theparticipating regions during a SELECT query that uses the rowkey? Ifnot, I recently wrote some code that allows a MapReduce job toeffectively select the regions based on a list of start/end rowkeyranges. If this might be useful to the Hive integration, I could createa Jira and take a look at trying to set up a patch.


Daniel Einspanjer
Metrics Architect
Mozilla Corporation

Question regarding region scans in HBase integration

Reply via email to