GitHub user gaborgsomogyi opened a pull request:
https://github.com/apache/spark/pull/20703
[SPARK-19185][SS] Make Kafka consumer cache configurable
## What changes were proposed in this pull request?
Use property `spark.streaming.kafka.consumer.cache.enabled` in structured
streaming introduced [here](https://github.com/apache/spark/pull/18234) that
allows users to enable or disable the cache for Kafka consumers. This property
can be especially handy in cases where issues like
[SPARK-19185](https://issues.apache.org/jira/browse/SPARK-19185) get hit, for
which there isn't a solution committed yet. By default, the cache is still on,
so this change doesn't change any out-of-box behavior. The structured streaming
problem reported in
[SPARK-23526](https://issues.apache.org/jira/browse/SPARK-23526).
## How was this patch tested?
Automated: Passed Jenkins.
Manually:
```
cd docs
jekyll build
SKIP_API=1 jekyll build
```
open _site/structured-streaming-kafka-integration.html
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/gaborgsomogyi/spark SPARK-19185
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/20703.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #20703
----
commit aa8eed44ad1b5e4bdee9c8c32a44fb07b1fa9f2e
Author: Gabor Somogyi <gabor.g.somogyi@...>
Date: 2018-03-01T13:47:49Z
[SPARK-19185][SS] Make Kafka consumer cache configurable
----
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]