Github user ScrapCodes commented on a diff in the pull request:

    https://github.com/apache/spark/pull/17308#discussion_r116158469
  
    --- Diff: 
external/kafka-0-10-sql/src/main/scala/org/apache/spark/sql/kafka010/KafkaSink.scala
 ---
    @@ -30,14 +30,19 @@ private[kafka010] class KafkaSink(
       @volatile private var latestBatchId = -1L
     
       override def toString(): String = "KafkaSink"
    +  private val kafkaParams = new ju.HashMap[String, 
Object](executorKafkaParams)
     
       override def addBatch(batchId: Long, data: DataFrame): Unit = {
         if (batchId <= latestBatchId) {
           logInfo(s"Skipping already committed batch $batchId")
         } else {
           KafkaWriter.write(sqlContext.sparkSession,
    -        data.queryExecution, executorKafkaParams, topic)
    +        data.queryExecution, kafkaParams, topic)
           latestBatchId = batchId
         }
       }
    +
    +  override def stop(): Unit = {
    +    CachedKafkaProducer.close(kafkaParams)
    --- End diff --
    
    That's correct, I have understood, close requires a bit of rethinking, I am 
unable to see a straight forward way of doing it. If you agree, close related 
implementation can be taken out from this PR and be taken up in a separate JIRA 
and PR ?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to