HeartSaVioR commented on issue #24613: [SPARK-27549][SS] Add support for 
committing kafka offsets per batch for supporting external tooling
URL: https://github.com/apache/spark/pull/24613#issuecomment-494024240
 
 
   > Is there a way for the user to optionally provide a full group.id per 
Spark Query?
   
   You can provide a full `group.id` starting at Spark 3.0.0, but the option 
should be used with full aware of the impacts. Spark fault-tolerance guarantees 
come from reliable source and sharing consumer group with other app will simply 
break it - that's why Spark creates unique consumer group id.
   
   
https://github.com/apache/spark/blob/master/docs/structured-streaming-kafka-integration.md
   (search `kafka.group.id` in above link)
   
   `query ID` might be considered as unique group id since it can provide both 
unique and continuous, but it should consider the case where multiple Kafka 
sources are being used in same query.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to