[
https://issues.apache.org/jira/browse/TAJO-1388?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15393630#comment-15393630
]
Byunghwa Yun commented on TAJO-1388:
------------------------------------
Hi, [~jhkim].
Before I will create the PR of github, I hope to conform below DDL.
I remove ‘kafka.zk’ table option and use the storage plugin of tajo.
Over Kafka 0.9.0 version uses the URIs of brokers , that is bootstrap.servers
property, instead of zookeeper quorum.
For example.
storage-site.json
{
"spaces": {
"kafka_cluster1": {
"uri": "kafka://host1:9092,host2:9092,host3:9092"
}
}
}
DDL
CREATE TABLE [IF NOT EXISTS] <table_name> [(column_list)] TABLESPACE
kafka_cluster1
using kafka with (‘kafka.topic’=’<kafka_topic_name>’,[other options])
CREATE EXTERNAL TABLE [IF NOT EXISTS] <table_name> (column_list)
using kafka with (‘kafka.topic’=’<kafka_topic_name>’,[other options]) LOCATION
'kafka://host1:9092,host2:9092,host3:9092'
> [Umbrella] Kafka Storage Integration.
> -------------------------------------
>
> Key: TAJO-1388
> URL: https://issues.apache.org/jira/browse/TAJO-1388
> Project: Tajo
> Issue Type: New Feature
> Components: Storage
> Reporter: YeonSu Han
> Assignee: Byunghwa Yun
> Labels: kafka_storage
> Attachments: Kafka _Storage_Ingegration_draft.pdf
>
>
> Apache Kafka is one of the widely used message queueing system. If we can use
> the Kafka as Tajo storage, analysis area of Tajo user is be broaden. For
> example, as realtime analysis.
> For this, I propose 'Kafka storage'. Please review my proposal and give your
> opinion.
> * Table Creation
> {code:sql}
> CREATE [EXTERNAL] TABLE [IF NOT EXISTS] <table_name> [(<column_name>
> <data_type>, ... )]
> using kafka with
> (‘kafka.topic’=’<kafka_topic_name>’,‘kafka.zk’=’<kafka_zookeeper_info>’,[other
> options])
> {code}
> ** Use “kafka” keyword in “using” clause for creating kafka table in Tajo.
> ** kafka table name is mapped to a Tajo table name with , 'kafka.topic'
> property.
> * Column mapping of kafka message
> ** Delimited line mapping (default)
> ** json mapping
> ** ...
> * Concept
> ** The topic of kafka correspond to table.
> ** The partition of kafka correspond to file.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)