Github user ptgoetz commented on a diff in the pull request:

    https://github.com/apache/storm/pull/493#discussion_r27512587
  
    --- Diff: external/storm-kafka/README.md ---
    @@ -120,6 +120,23 @@ spoutConf.scheme = new SchemeAsMultiScheme(new 
StringScheme());
     OpaqueTridentKafkaSpout spout = new OpaqueTridentKafkaSpout(spoutConf);
     ```
     
    +### How KafkaSpout stores offsets of a kafka topic and recovers incase of 
failures
    +
    +As shown in the above KafkaConfig properties , user can control where in 
the topic they can start reading by setting **KafkaConfig.startOffsetTime.**
    +
    +There are two options **kafka.api.OffsetRequest.EarliestTime()** which 
makes the KafkaSpout to read from the begining of the topic and 
    --- End diff --
    
    I would also document the actual values of `EarliestTime()` (`-2`) and 
`LatestTime()` (`-1`), and that it can also be set to a point in time (a la 
`System.currentTimeMillis()`).
    
    My reasoning behind documenting the values (as opposed to the kafka API 
constants) is that the start offset time is likely to be specified via 
configuration (i.e. outside java code). Either that, or add spout constants 
that would get evaluated to `EarliestTime()`/`LatestTime()` if for some reason 
those values were ever changed in the Kafka API -- that seems like a less 
"leaky" solution.
    



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

Reply via email to