[jira] [Commented] (STORM-495) Add delayed retries to KafkaSpout

ASF GitHub Bot (JIRA) Wed, 01 Oct 2014 17:48:01 -0700

    [ 
https://issues.apache.org/jira/browse/STORM-495?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14155895#comment-14155895
 ]


ASF GitHub Bot commented on STORM-495:
--------------------------------------

Github user rick-kilgore commented on a diff in the pull request:

    https://github.com/apache/storm/pull/254#discussion_r18317774
  
    --- Diff: external/storm-kafka/src/jvm/storm/kafka/SpoutConfig.java ---
    @@ -26,8 +26,19 @@
         public Integer zkPort = null;
         public String zkRoot = null;
         public String id = null;
    +
    +    // setting for how often to save the current kafka offset to ZooKeeper
         public long stateUpdateIntervalMs = 2000;
     
    +    // Exponential back-off retry settings.  These are used when retrying 
messages after a bolt
    +    // calls OutputCollector.fail().
    +    //
    +    // Note: be sure to set backtype.storm.Config.MESSAGE_TIMEOUT_SECS 
appropriately to prevent
    +    // resubmitting the message while still retrying.
    --- End diff --
    
    Actually, in the spout I can't know how many times a topology is going to 
retry its messages, so at startup I can't know with certainty whether the 
retries might exceed TOPOLOGY_MESSAGE_TIMEOUT_SECS or not.  Plus there is the 
uncertainty of not knowing how long it will take to complete or fail on a given 
try.
    
    I __can__ detect the problem inside fail().  Maybe I could add some kind of 
response there, like logging an error and just not retrying (since I know that 
this other retry will happen before the spout's retry would)?


> Add delayed retries to KafkaSpout
> ---------------------------------
>
>                 Key: STORM-495
>                 URL: https://issues.apache.org/jira/browse/STORM-495
>             Project: Apache Storm
>          Issue Type: Improvement
>    Affects Versions: 0.9.3
>         Environment: all environments
>            Reporter: Rick Kilgore
>            Priority: Minor
>              Labels: kafka, retry
>
> If a tuple in the topology originates from the KafkaSpout from the 
> external/storm-kafka sources, and if a bolt in the topology indicates a 
> failure by calling fail() on its OutputCollector, the KafkaSpout will 
> immediately retry the message.
> We wish to use this failure and retry behavior in our ingestion system 
> whenever we experience a recoverable error from a downstream system, such as 
> a 500 or 503 error from a service we depend on.  But with the current 
> KafkaSpout behavior, doing so results in a tight loop where we retry several 
> times over a few seconds and then give up.  I want to be able to delay retry 
> to give the downstream service some time to recover.  Ideally, I would like 
> to have configurable, exponential backoff retry.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (STORM-495) Add delayed retries to KafkaSpout

Reply via email to