[ 
https://issues.apache.org/jira/browse/TAJO-289?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

JaeHwa Jung updated TAJO-289:
-----------------------------

    Attachment: TAJO-289.patch

I'm so glad to upload this patch.
>From now on, Tajo can connect HiveMetastore and can get lots of data stored on 
>HDFS through HiveMetaStore. Of course, there were some limitation as follows:

- current version support just text file format.
- current version cannot support compression type.

But, I'll implement unsupported functions time soon.

If you want to use HiveMetaStore as CatalogServer, you have to start up 
HiveMetaStore server as follows:
{code:xml}
$HIVE_HOME/bin/hive --service metastore
{code}
 
Or you can start up HiveMetaStore server through HCatalog as follows:
{code:xml}
$HCATALOG_HOME/sbin/hcat_server.sh start
{code}
If you start up HCatalog server, HCatalogServer start up HiveMetaStore server 
actually.

If you start up HiveMetaStore server, you have to update catalog-site.xml as 
follows:
{code:xml}
  <property>
    <name>tajo.catalog.store.class</name>
    <value>org.apache.tajo.catalog.store.HCatalogStore</value>
  </property>
  <property>
    <name>tajo.catalog.uri</name>
    <value>thrift://localhost:10001</value>
  </property>
{code}
'tajo.catalog.url' is HiveMetaStore sever uri. You must specify your address on 
this property. 

And I borrowed JobConf from CDH and I removed static resource files 
declaratives. If JobConf use static resource files (mapred-default.xml, 
mapred-site.xml), TajoWorker made error message as folllows:
{code:xml}
Service:org.apache.tajo.worker.TajoWorkerManagerService is started.
2013-11-08 23:44:03,411 INFO  worker.TaskRunnerManager 
(TaskRunnerManager.java:run(139)) - FinishedQueryMasterTaskCleanThread started: 
expire interval minutes = 720
2013-11-08 23:44:03,419 ERROR service.CompositeService 
(CompositeService.java:start(72)) - Error starting services 
org.apache.tajo.worker.TajoWorker
org.jboss.netty.channel.ChannelException: Failed to bind to: 
0.0.0.0/0.0.0.0:8082
        at 
org.jboss.netty.bootstrap.ServerBootstrap.bind(ServerBootstrap.java:272)
        at 
org.apache.tajo.pullserver.TajoPullServerService.start(TajoPullServerService.java:237)
        at 
org.apache.hadoop.yarn.service.CompositeService.start(CompositeService.java:68)
        at org.apache.tajo.worker.TajoWorker.start(TajoWorker.java:257)
        at org.apache.tajo.worker.TajoWorker.startWorker(TajoWorker.java:125)
        at org.apache.tajo.worker.TajoWorker.main(TajoWorker.java:659)
Caused by: java.net.BindException: Address already in use
        at sun.nio.ch.Net.bind(Native Method)
        at 
sun.nio.ch.ServerSocketChannelImpl.bind(ServerSocketChannelImpl.java:124)
        at sun.nio.ch.ServerSocketAdaptor.bind(ServerSocketAdaptor.java:59)
        at 
org.jboss.netty.channel.socket.nio.NioServerBoss$RegisterTask.run(NioServerBoss.java:193)
        at 
org.jboss.netty.channel.socket.nio.AbstractNioSelector.processTaskQueue(AbstractNioSelector.java:366)
        at 
org.jboss.netty.channel.socket.nio.AbstractNioSelector.run(AbstractNioSelector.java:290)
        at 
org.jboss.netty.channel.socket.nio.NioServerBoss.run(NioServerBoss.java:42)
        at 
java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:895)
        at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:918)
        at java.lang.Thread.run(Thread.java:680)
{code}
After I modified JobConf, TajoWorker start up always successfully. 

Finally, my development environments are as follows:
- hadoop 2.0.x-alpha
- hive-0.11.0  
- hcatalog-0.5.0

> HCatalogStore supports SELECT statement
> ---------------------------------------
>
>                 Key: TAJO-289
>                 URL: https://issues.apache.org/jira/browse/TAJO-289
>             Project: Tajo
>          Issue Type: Sub-task
>          Components: catalog
>    Affects Versions: 0.8-incubating
>            Reporter: JaeHwa Jung
>            Assignee: JaeHwa Jung
>             Fix For: 0.8-incubating
>
>         Attachments: TAJO-289.patch
>
>
> I designed Tajo to use Hive metastore with HCatalog.
> In this case, Tajo has to have an interface to connect to HCatalog.
> And I think that this connect is very useful to connect to another catalogs 
> as follows:
>  some different Hive catalogs, HBase catalogs
> So, I named this interface as CatalogDriver. It will has some properties as 
> follows:
> - catalog namespace name
> - catalog URI
> - catalog driver class



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Reply via email to