gaborgsomogyi opened a new pull request #24932: [MINOR][SQL][DOCS] 
failOnDataLoss has effect on batch queries so fix the doc
URL: https://github.com/apache/spark/pull/24932
 
 
   ## What changes were proposed in this pull request?
   
   According to the [Kafka integration 
document](https://spark.apache.org/docs/2.4.0/structured-streaming-kafka-integration.html)
 `failOnDataLoss` has effect only on streaming queries. While I was 
implementing the DSv2 Kafka batch sources I've realized it's not true. This 
feature is covered in 
[KafkaDontFailOnDataLossSuite](https://github.com/apache/spark/blob/54da3bbfb2c936827897c52ed6e5f0f428b98e9f/external/kafka-0-10-sql/src/test/scala/org/apache/spark/sql/kafka010/KafkaDontFailOnDataLossSuite.scala#L180).
   
   In this PR I've updated the doc to reflect this behavior.
   
   ## How was this patch tested?
   
   ```
   cd docs/
   SKIP_API=1 jekyll build
   ```
   Manual webpage check.
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

Reply via email to