[
https://issues.apache.org/jira/browse/HIVE-9452?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Alan Gates updated HIVE-9452:
-----------------------------
Affects Version/s: hbase-metastore-branch
> Use HBase to store Hive metadata
> --------------------------------
>
> Key: HIVE-9452
> URL: https://issues.apache.org/jira/browse/HIVE-9452
> Project: Hive
> Issue Type: Improvement
> Components: Metastore
> Affects Versions: hbase-metastore-branch
> Reporter: Alan Gates
> Assignee: Alan Gates
> Attachments: HBaseMetastoreApproach.pdf
>
>
> This is an umbrella JIRA for a project to explore using HBase to store the
> Hive data catalog (ie the metastore). This project has several goals:
> # The current metastore implementation is slow when tables have thousands or
> more partitions. With Tez and Spark engines we are pushing Hive to a point
> where queries only take a few seconds to run. But planning the query can
> take as long as running it. Much of this time is spent in metadata
> operations.
> # Due to scale limitations we have never allowed tasks to communicate
> directly with the metastore. However, with the development of LLAP this
> requirement will have to be relaxed. If we can relax this there are other
> use cases that could benefit from this.
> # Eating our own dogfood. Rather than using external systems to store our
> metadata there are benefits to using other components in the Hadoop system.
> The proposal is to create a new branch and work on the prototype there.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)