[ https://issues.apache.org/jira/browse/BAHIR-99?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16097656#comment-16097656 ]
ASF GitHub Bot commented on BAHIR-99: ------------------------------------- Github user rmetzger commented on a diff in the pull request: https://github.com/apache/bahir-flink/pull/17#discussion_r128921534 --- Diff: flink-connector-kudu/README.md --- @@ -0,0 +1,75 @@ +# Flink Kudu Connector +This connector provides a source and sink to [Apache Kudu](http://kudu.apache.org/)™ +To use this connector, add the following dependency to your project: + +``` +<dependency> + <groupId>es.accenture</groupId> + <artifactId>flink-kudu-connector</artifactId> + <version>1.0</version> +</dependency> +``` + + +Data flows patterns: +* Batch + * Kudu -> DataSet\<RowSerializable\> -> Kudu + * Kudu -> DataSet\<RowSerializable\> -> other source + * Other source -> DataSet\<RowSerializable\> -> other source +* Stream + * Other source -> DataStream \<RowSerializable\> -> Kudu + + +```java + +/* Batch mode - DataSet API -*/ + +DataSet<RowSerializable> input = KuduInputBuilder.build(TABLE_SOURCE, KUDU_MASTER) + +// DataSet operations --> .map(), .filter(), reduce(), etc. +//result = input.map(...) + +result.output(new KuduOutputFormat(KUDU_MASTER, TABLE_SINK, columnNames, KuduOutputFormat.CREATE)); + +KuduInputBuilder.env.execute(); --- End diff -- Why do you recommend accessing the execution environment from the `KuduInputBuilder`? > Kudu connector to read/write from/to Kudu > ----------------------------------------- > > Key: BAHIR-99 > URL: https://issues.apache.org/jira/browse/BAHIR-99 > Project: Bahir > Issue Type: New Feature > Components: Flink Streaming Connectors > Affects Versions: Flink-1.0 > Reporter: Rubén Casado > Assignee: Rubén Casado > Fix For: Flink-Next > > > Java library to integrate Apache Kudu and Apache Flink. Main goal is to be > able to read/write data from/to Kudu using the DataSet and DataStream Flink's > APIs. > Data flows patterns: > Batch > - Kudu -> DataSet<RowSerializable> -> Kudu > - Kudu -> DataSet<RowSerializable> -> other source > - Other source -> DataSet<RowSerializable> -> other source > Stream > - Other source -> DataStream <RowSerializable> -> Kudu > Code is available in https://github.com/rubencasado/Flink-Kudu -- This message was sent by Atlassian JIRA (v6.4.14#64029)