[
https://issues.apache.org/jira/browse/FLINK-10245?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16607943#comment-16607943
]
Hequn Cheng edited comment on FLINK-10245 at 9/8/18 8:24 AM:
-------------------------------------------------------------
[~dangdangdang] Hi, thanks for posting the doc. Could you give comment access
to us?
Comments mainly include:
- Rename HBaseTableBuilder to HbaseBuilder. This task is dedicated to
Datastream Sink and TableBuilder may be used for TableSink.
- Buffer
** I think it's better to use rowkey to deduplicate data. We only have to keep
the last row for the same key.
** Besides count and checkpoint, take time into consideration, i.e., 10
seconds to flush data.
** We can use Hbase batch api to write data in bulk mode.
What do you think?
was (Author: hequn8128):
[~dangdangdang] Hi, thanks for posting the doc. Could you give comment access
to us?
> Add DataStream HBase Sink
> -------------------------
>
> Key: FLINK-10245
> URL: https://issues.apache.org/jira/browse/FLINK-10245
> Project: Flink
> Issue Type: Sub-task
> Components: Streaming Connectors
> Reporter: Shimin Yang
> Assignee: Shimin Yang
> Priority: Major
> Labels: pull-request-available
>
> Design documentation:
> [https://docs.google.com/document/d/1of0cYd73CtKGPt-UL3WVFTTBsVEre-TNRzoAt5u2PdQ/edit?usp=sharing]
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)