To answer my own question (partially), I have learned that max.partition.fetch.bytes
, which defaults to a very large large number, will affect the number of records returned by each call to poll() I also learned that seekToBeginning is a partition-level thing, but props.put("auto.offset.reset","earliest") has the desired effect. On Fri, Apr 22, 2016 at 11:08 AM, Richard Rodseth <rrods...@gmail.com> wrote: > Do I understand correctly that poll() will return a subset of the messages > in a topic each time it is called? So if I want to replay all messages, I > would seek to the beginning and call poll in a loop? Not easily knowing > when I was done, without a high watermark > > https://issues.apache.org/jira/browse/KAFKA-2076 > > This is a pretty basic question, but I don't think it is explained in the > JavaDoc > > > http://kafka.apache.org/090/javadoc/index.html?org/apache/kafka/clients/consumer/KafkaConsumer.html > > Thanks >