[ 
https://issues.apache.org/jira/browse/TAJO-16?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

JaeHwa Jung updated TAJO-16:
----------------------------

    Attachment: TAJO-16.patch

I'm so glad to upload this patch.

>From now on, Tajo can connect HiveMetastore and can get lots of data stored on 
>HDFS through HiveMetaStore. Of course, there were some limitation as follows:

- current version support just text file format.
- current version support just uncompressed type.

But, I'll implement unsupported functions time soon.

If you want to use HiveMetaStore as CatalogServer, you have to start up 
HiveMetaStore server as follows:
{code:xml}
$HIVE_HOME/bin/hive --service metastore
{code}
 
Or you can start up HiveMetaStore server through HCatalog as follows:
{code:xml}
$HCATALOG_HOME/sbin/hcat_server.sh start
{code}
If you start up HCatalog server, HCatalogServer start up HiveMetaStore server 
actually.

And then, you have to update catalog-site.xml as follows:
{code:xml}
  <property>
    <name>tajo.catalog.store.class</name>
    <value>org.apache.tajo.catalog.store.HCatalogStore</value>
  </property>
  <property>
    <name>tajo.catalog.uri</name>
    <value>thrift://localhost:10001</value>
  </property>
{code}
'tajo.catalog.url' is HiveMetaStore sever uri. You must specify your address on 
this property. 

And you can build Tajo in accordance with your hive version. If you want to 
build Tajo in accordance with apache hive 0.11.0, you can build as follows:
{code:xml}
mvn clean install -Phive-0.11.0
{code}
Or if you want to build Tajo in accordance with hive-0.10.0 for cdh4.3.0, you 
can build as follows:
{code:xml}
mvm clean install -Phive-0.10.0-cdh4.3.0
{code}
In other version, you must modify pom.xml file which located at 
'$TAJO_HOME/tajo-catalog/tajo-catalog-server'.


> Enable Tajo catalog to access Hive metastore.
> ---------------------------------------------
>
>                 Key: TAJO-16
>                 URL: https://issues.apache.org/jira/browse/TAJO-16
>             Project: Tajo
>          Issue Type: Improvement
>            Reporter: JaeHwa Jung
>            Assignee: JaeHwa Jung
>             Fix For: 0.8-incubating
>
>         Attachments: TAJO-16.patch, TAJO-16.pdf
>
>
> Hive has been widely used in this area. Many users have maintained lots of 
> big tables through Hive metastore. It would be great to enable Tajo catalog 
> to access Hive metastore. Hive metastore already provides thrift API. We 
> could employ it.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Reply via email to