Martin Kleppmann created SAMZA-273:
--------------------------------------

             Summary: Make offset-related configuration more intuitive
                 Key: SAMZA-273
                 URL: https://issues.apache.org/jira/browse/SAMZA-273
             Project: Samza
          Issue Type: Improvement
    Affects Versions: 0.7.0
            Reporter: Martin Kleppmann


There are a few configuration parameters which control the offset at which a 
consumer starts:

* systems.\*.samza.reset.offset (whether to ignore checkpoints on container 
startup)
* systems.\*.samza.offset.default (what to do if there is no checkpoint)
* systems.\*.consumer.auto.offset.reset (what to do if the requested offset is 
out of range of the broker's stream history)
* CheckpointTool isn't a config per se, but is also related to consumer offsets

Although they are all valid, they are not really great. The parameter names are 
a bit obscure (I still don't remember them, even though I've been staring at 
them for some time), there are subtle interactions between them, and generally 
I feel they are set up from the framework's internals' point of view, rather 
than the "what is the job trying to accomplish" point of view. (Put another 
way, you need to understand how Samza works internally in order to make sense 
of them.)

I don't have an answer of what a better design would look like. This ticket is 
just a place to discuss how we could make offset-related configuration easier 
for job authors to understand and use.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to