[
https://issues.apache.org/jira/browse/STORM-1017?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14734794#comment-14734794
]
Sriharsha Chintalapani commented on STORM-1017:
-----------------------------------------------
[~RenkaiGe] KafkaSpout uses the following config to store offsets
SpoutConfig(BrokerHosts hosts, String topic, String zkRoot, String id);
zkRoot/id will be the node on zookeeper. You can use zkCli.sh to login into the
zookeeper you are using for this spout and do rmr zkRoot/id make sure this is
the same one you give in Spoutconfig.
> If ignoreZkOffsets set true,KafkaSpout will reset zk offset when recover from
> failure.
> --------------------------------------------------------------------------------------
>
> Key: STORM-1017
> URL: https://issues.apache.org/jira/browse/STORM-1017
> Project: Apache Storm
> Issue Type: Bug
> Components: storm-kafka
> Reporter: Renkai Ge
> Assignee: Sriharsha Chintalapani
>
> when ignoreZkOffsets set true and startOffsetTime =
> kafka.api.OffsetRequest.EarliestTime().
> workers running -> topology shutdown by user and restart -> workers will read
> from earliest time again
> workers running -> one of workers shutdown by accident and supervisor restart
> the worker -> what offset will the restarted worker read from?
> More details on
> https://github.com/apache/storm/pull/493#issuecomment-135783234
> It will cause a lot of unwanted duplicated messages in some scenes.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)