[ https://issues.apache.org/jira/browse/SPARK-17935?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15621541#comment-15621541 ]
zhangxinyu commented on SPARK-17935: ------------------------------------ Thanks Cody Koeninger! Here is my opinion: * Kafka producer isn't idempotent. Yes, we only can make sure data is "at least once" in KafkaSink. Howerver, this problem shouldn't be solved here. * KafkaSinkRDD isn't necessary. Yes, it is indeed unnecessary, and let me modify the kafkaSink design doc. This suggestion is very useful! * CachedKafkaProducer is necessary, in case that users require to create two or more prodcuers with different producer paragrams. > Add KafkaForeachWriter in external kafka-0.8.0 for structured streaming module > ------------------------------------------------------------------------------ > > Key: SPARK-17935 > URL: https://issues.apache.org/jira/browse/SPARK-17935 > Project: Spark > Issue Type: Improvement > Components: SQL, Streaming > Affects Versions: 2.0.0 > Reporter: zhangxinyu > > Now spark already supports kafkaInputStream. It would be useful that we add > `KafkaForeachWriter` to output results to kafka in structured streaming > module. > `KafkaForeachWriter.scala` is put in external kafka-0.8.0. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org