Nifi/Kafka Rebalance Error in Nifi 2.5

Reinhard Sell Wed, 14 Jan 2026 05:43:40 -0800

Hi!

We have some issues/questions regarding the new Kafka3ConnectionServiceand the related consumer client processor.


Any advice would be would be much appreciated!


## In short:

We often see the following exception, whenever a consumer clientprocessor is started or stopped that is part of a larger client group:

```

RebalanceInProgressException: Offset commit cannot be completed sincethe consumer is undergoing a rebalance for auto partition assignment.You can try completing the rebalance by calling poll() and then retrythe operation.

```

We never saw anything similar with the "old" Kafka Consumer clientprocessor from Nifi 1.28: Rebalancing and committing offsets worked asexpected.

This issue causes some messages to be processed twice. So it's a realproblem.



## More details:

We operate a Nifi Cluster with three nodes and more than 1000 Nifiprocessors. It communicates with a Kafka cluster that also has three nodes.

Recently we upgraded from Nifi 1.28 to 2.5. The upgrade waswell-prepared and went smoothly. To avoid potential issues, we initiallykept the "old" Kafka Consumer and Producer client processors.Now we want to complete the upgrade and replace them with the newKafka3ConnectionService and its related processors.

However, we see the following Kafka consumer error in our Nifi clusterwhen using the new Kafka Connection Service:

```

INFO [Timer-Driven Process Thread-6]o.a.k.c.c.internals.ConsumerCoordinator [ConsumerclientId=consumer-rebalance-test-39, groupId=rebalance-test] FailingOffsetCommit request since the consumer is not part of an active groupERROR [Timer-Driven Process Thread-6]o.a.nifi.kafka.processors.ConsumeKafkaConsumeKafka[id=a31808c3-019b-1000-0000-0000627885f5] Failed to commitoffsets for Kafka Consumer Service; will attempt to rollback to latestcommitted offsetsorg.apache.kafka.common.errors.RebalanceInProgressException: Offsetcommit cannot be completed since the consumer is undergoing a rebalancefor auto partition assignment. You can try completing the rebalance bycalling poll() and then retry the operation.

```

This error occurs reproducibly when a consumer client processor that ispart of a larger client group is started or stopped.

The rebalance as such is completely expected: The Kafka broker mustinitiate a rebalance, as the number of consumer clients changes.

But to our understanding, the Kafka client has some time to commitoutstanding offsets, even when the rebalancing has started.

The above error occurs almost immediately. Well before the`session.timeout.ms` is reached.

So it seems that the Kafka client gives up committing offsets once therebalancing has started, even though there would be plenty of time left.

It might well be that we use wrong timing parameters or that we aremissing other important configuration settings. Currently, we use thedefault values, where possible. Of course, we also tried to tune someparameters, however, so far without success.

We can reproduce this issue in a simple test setup with a Kafka topicwith 6 partitions and approximately 100 messages per partition and second.

With the same Nifi and Kafka clusters and the same topic and messagerate, this does not occur when using the "old" ConsumeKafka_2_6processor: Rebalancing and committing offsets are both executed withoutproblems.



Thanks a lot for any additional information!
Reinhard Sell

Nifi/Kafka Rebalance Error in Nifi 2.5

Reply via email to