Re: Implementing Index Table for Accumulo Hive Queries

Josh Elser Mon, 09 Jan 2017 11:39:12 -0800

Hi Mike,

As far as I understand it, the Hive storage handler APIs (which is howthe Accumulo integration is implemented) doesn't expose any ability todo use index tables to answer some query.

This means that the only thing you can do to make queries faster, wouldbe to create a number of tables, pivoted on the columns you care about,putting the important columns in the rowId. Then, you would have to knowwhich table to use at the application layer.

Admittedly, this is pretty lacking. I'd have to go look at the Hivecommunity to see if this is something that's been built there.


- Josh

Fagan, Michael wrote:

Hi,

I am looking to utilize an index table to avoid full table scans and speed up 
hive queries against an external accumulo table.

Has anyone done this yet? Can someone point me in the right direction?

Regards,
Mike Fagan

Re: Implementing Index Table for Accumulo Hive Queries

Reply via email to