[
https://issues.apache.org/jira/browse/FLINK-2168?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15814733#comment-15814733
]
Fabian Hueske commented on FLINK-2168:
--------------------------------------
I think we should limit this issue to implement a projectable
{{BatchTableSource}} for HBase.
As such, all we need to do is to return a {{DataSet}} of a specific type. There
is no need to do any query parsing or query push down at the moment.
A user needs to configure the table source with the following information:
* name of the HBase table to scan
* a list of columns to scan with types and deserializers to convert the
{{byte[]}} into the column type.
Given this information, the table source needs to connect to HBase, fetch the
columns, deserialize the fields, and create the output records.
A projectable table source also needs to be able to restrict the column based
on a set of columns it is configured with. But which columns is automatically
figured out by Calcite.
> Add HBaseTableSource
> --------------------
>
> Key: FLINK-2168
> URL: https://issues.apache.org/jira/browse/FLINK-2168
> Project: Flink
> Issue Type: New Feature
> Components: Table API & SQL
> Affects Versions: 0.9
> Reporter: Fabian Hueske
> Assignee: ramkrishna.s.vasudevan
> Priority: Minor
> Labels: starter
>
> Add a {{HBaseTableSource}} to read data from a HBase table. The
> {{HBaseTableSource}} should implement the {{ProjectableTableSource}}
> (FLINK-3848) and {{FilterableTableSource}} (FLINK-3849) interfaces.
> The implementation can be based on Flink's {{TableInputFormat}}.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)