GitHub user uncleGen opened a pull request: https://github.com/apache/spark/pull/16629
[SPARK-19185][DStream] Add more clear hint for 'ConcurrentModificationExceptions' ## What changes were proposed in this pull request? When same kafka partition is consumed from multiple threads, task will fail with `ConcurrentModificationExceptions`. KafkaConsumer is not safe for multi-threaded access. So, we may give a more clear hint for users when encounter problems. Besides, a new config `spark.streaming.kafka.consumer.cache.enabled` is added for users to use consumer cache or not ## How was this patch tested? existing ut You can merge this pull request into a Git repository by running: $ git pull https://github.com/uncleGen/spark SPARK-19185 Alternatively you can review and apply these changes as the patch at: https://github.com/apache/spark/pull/16629.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #16629 ---- commit 384710db67fa2440f50ece4df64a8c1b996f7167 Author: uncleGen <husty...@gmail.com> Date: 2017-01-18T03:08:49Z Add more clear hint for 'ConcurrentModificationExceptions' commit b8b44ef0c62267425fb2b8ed3ab16f10d303f7ca Author: uncleGen <husty...@gmail.com> Date: 2017-01-18T03:18:34Z update ---- --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- --------------------------------------------------------------------- To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org