[ 
https://issues.apache.org/jira/browse/KAFKA-987?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13719723#comment-13719723
 ] 

Jun Rao commented on KAFKA-987:
-------------------------------

1. The issue on startup is the following. If a consumer starts up from the end 
of the log and there is no new message coming in, no offset will be 
checkpointed to ZK. This will affect tools like ConsumerOffsetChecker.

2. During rebalance, a consumer may pick up offsets committed by other consumer 
instances. If we don't update the offset cache in addPartitionTopicInfo(), we 
will do an extra unnecessary offset update to ZK.

It seems to me that the impact for #1 is bigger than the slight performance 
impact in #2. Another way to do that is to always force the very first offset 
(per partition) write to ZK. However, I am not sure if it's worth the 
complexity.
                
> Avoid checkpointing offsets in Kafka consumer that have not changed since the 
> last commit
> -----------------------------------------------------------------------------------------
>
>                 Key: KAFKA-987
>                 URL: https://issues.apache.org/jira/browse/KAFKA-987
>             Project: Kafka
>          Issue Type: Bug
>    Affects Versions: 0.8
>            Reporter: Swapnil Ghike
>            Assignee: Swapnil Ghike
>              Labels: improvement
>             Fix For: 0.8
>
>         Attachments: kafka-987.patch, kafka-987-v2.patch
>
>
> We need to fix the Kafka zookeeper consumer to avoid checkpointing offsets 
> that have not changed since the last offset commit. This will help reduce the 
> write load on zookeeper.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to