Stig, Thanks for your clear explanation. It is quite helpful.
Regards, Ramin From: [email protected] At: 04/26/18 06:12:31To: [email protected] Subject: Re: ConsumerCoordinator Offset commit failed on partition xxx at offset nnn This is handled by the KafkaConsumer. As far as I can tell, the consumer will just retry. See https://github.com/apache/kafka/blob/trunk/clients/src/main/java/org/apache/kafka/clients/consumer/internals/ConsumerCoordinator.java#L748 and https://github.com/apache/kafka/blob/trunk/clients/src/main/java/org/apache/kafka/clients/consumer/internals/ConsumerCoordinator.java#L610. The request timeout error is retriable, so the consumer will try again. The spout calls commit synchronously, so until the commit succeeds (or fails in some non-retriable way), the spout will be blocked in the commitSync call. The consequence of the timeout should just be a delay while the consumer is retrying. If the commit ends up succeeding, there will be no other consequence. If the commit fails entirely (i.e. the commitSync method on the KafkaConsumer throws a CommitFailedException), the worker running the spout will crash, and the tuples will be reprocessed starting at the last committed offset. If this happens you will be able to see it in the log, since the worker will die. 2018-04-23 23:24 GMT+02:00 Ramin Farajollah (BLOOMBERG/ 731 LEX) <[email protected]>: Hi, We use a custom Kafka spout in our Apache Storm topology. When a machine in the Kafka cluster is bounced, we see the following error message in the logs: ERROR ConsumerCoordinator [thread-iii] - [Consumer clientId=consumer-1, groupId=xxxxx] Offset commit failed on partition yyyy at offset nnn: The request timed out. I understand Kafka's enable.auto.commit is not allowed in Storm 1.1.1. Please help me understand what is the consequence of this timeout: - Is a tuple lost or reprocessed? - Is the tuple processed out of order (at a later time)? Thank you Ramin
