[ https://issues.apache.org/jira/browse/KAFKA-2017?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14957204#comment-14957204 ]
Joel Koshy commented on KAFKA-2017: ----------------------------------- I agree with those benefits. If we go that route then I would prefer compaction over other retention policies. You do lose some history, but _typically_ you would have the last couple of state entries available if you overpartition and time-based or size-based retention would anyway keep only a certain amount of history. I think you can merge this in with the offsets topic (and rename it to something like __consumer_state or something like that). If we merge then we will end up having a heterogeneous topic with different keys (group-topic-partition for offsets and group for state) but that should be fine. WRT implementation complexity that was referenced above: I agree it is more complicated to implement than ZK storage if we implement it from scratch but I don't think we need to right? i.e., all of the fault-tolerance and caching logic is already there in offset manager. So on coordinator failover the new coordinator just reads from the state partitions that it now leads and loads into memory (as we already do for offsets). Is there any other implementation complexity that I'm missing? > Persist Coordinator State for Coordinator Failover > -------------------------------------------------- > > Key: KAFKA-2017 > URL: https://issues.apache.org/jira/browse/KAFKA-2017 > Project: Kafka > Issue Type: Sub-task > Components: consumer > Affects Versions: 0.9.0.0 > Reporter: Onur Karaman > Assignee: Guozhang Wang > Fix For: 0.9.0.0 > > Attachments: KAFKA-2017.patch, KAFKA-2017_2015-05-20_09:13:39.patch, > KAFKA-2017_2015-05-21_19:02:47.patch > > > When a coordinator fails, the group membership protocol tries to failover to > a new coordinator without forcing all the consumers rejoin their groups. This > is possible if the coordinator persists its state so that the state can be > transferred during coordinator failover. This state consists of most of the > information in GroupRegistry and ConsumerRegistry. -- This message was sent by Atlassian JIRA (v6.3.4#6332)