HeartSaVioR commented on issue #24613: [SPARK-27549][SS] Add support for committing kafka offsets per batch for supporting external tooling URL: https://github.com/apache/spark/pull/24613#issuecomment-494024240 > Is there a way for the user to optionally provide a full group.id per Spark Query? You can provide a full `group.id` starting at Spark 3.0.0, but the option should be used with full aware of the impacts. Spark fault-tolerance guarantees come from reliable source and sharing consumer group with other app will simply break it - that's why Spark creates unique consumer group id. https://github.com/apache/spark/blob/master/docs/structured-streaming-kafka-integration.md (search `kafka.group.id` in above link) `query ID` might be considered as unique group id since it can provide both unique and continuous, but it should consider the case where multiple Kafka sources are being used in same query.
---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] With regards, Apache Git Services --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
