[
https://issues.apache.org/jira/browse/KAFKA-5016?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15966278#comment-15966278
]
Vahid Hashemian commented on KAFKA-5016:
----------------------------------------
[~domenico74] I have not been able to reproduce this issue with the 0.10.2.0
code base.
I tried this with command line producer and consumer, and also coded a simple
Jave based producer and consumer to run your use case.
Here's what I did:
* Ran a producer that writes 10 messages to a non-existing topic {{test}} (this
causes the topic {{test}} to be auto-created with a single partition)
* Ran a consumer that belongs to the consumer group {{cgroup}} and consumes
from the topic {{test}}
* Ran a second consumer similar to first one.
After this, when I run the consumer group command, I see this:
{code}
$ bin/kafka-consumer-groups.sh --bootstrap-server localhost:9092 --describe
--group cgroup
TOPIC PARTITION CURRENT-OFFSET LOG-END-OFFSET LAG CONSUMER-ID
HOST CLIENT-ID
test 0 10 10 0
consumer-1-1c06c568-7a83-42ea-a2d7-8a53132397bd /127.0.0.1 consumer-1
- - - - -
consumer-1-73aaff69-b43f-4be3-8b48-99bf337abc9a /127.0.0.1 consumer-1
{code}
I also don't see any issues in my server log:
{code}
...
[2017-04-12 10:35:57,976] INFO [Group Metadata Manager on Broker 0]: Finished
loading offsets from __consumer_offsets-42 in 1 milliseconds.
(kafka.coordinator.GroupMetadataManager)
[2017-04-12 10:35:57,976] INFO [Group Metadata Manager on Broker 0]: Loading
offsets and group metadata from __consumer_offsets-45
(kafka.coordinator.GroupMetadataManager)
[2017-04-12 10:35:57,977] INFO [Group Metadata Manager on Broker 0]: Finished
loading offsets from __consumer_offsets-45 in 1 milliseconds.
(kafka.coordinator.GroupMetadataManager)
[2017-04-12 10:35:57,977] INFO [Group Metadata Manager on Broker 0]: Loading
offsets and group metadata from __consumer_offsets-48
(kafka.coordinator.GroupMetadataManager)
[2017-04-12 10:35:57,978] INFO [Group Metadata Manager on Broker 0]: Finished
loading offsets from __consumer_offsets-48 in 1 milliseconds.
(kafka.coordinator.GroupMetadataManager)
[2017-04-12 10:35:57,984] INFO [GroupCoordinator 0]: Preparing to restabilize
group cgroup1 with old generation 0 (kafka.coordinator.GroupCoordinator)
[2017-04-12 10:35:57,987] INFO [GroupCoordinator 0]: Stabilized group cgroup1
generation 1 (kafka.coordinator.GroupCoordinator)
[2017-04-12 10:35:57,995] INFO [GroupCoordinator 0]: Assignment received from
leader for group cgroup1 for generation 1 (kafka.coordinator.GroupCoordinator)
[2017-04-12 10:36:10,624] INFO [GroupCoordinator 0]: Preparing to restabilize
group cgroup1 with old generation 1 (kafka.coordinator.GroupCoordinator)
[2017-04-12 10:36:13,116] INFO [GroupCoordinator 0]: Stabilized group cgroup1
generation 2 (kafka.coordinator.GroupCoordinator)
[2017-04-12 10:36:13,118] INFO [GroupCoordinator 0]: Assignment received from
leader for group cgroup1 for generation 2 (kafka.coordinator.GroupCoordinator)
{code}
Would you be able to share the code you are using that leads to the issue?
Thanks.
> Consumer hang in poll method while rebalancing is in progress
> -------------------------------------------------------------
>
> Key: KAFKA-5016
> URL: https://issues.apache.org/jira/browse/KAFKA-5016
> Project: Kafka
> Issue Type: Bug
> Components: core
> Affects Versions: 0.10.1.0, 0.10.2.0
> Reporter: Domenico Di Giulio
> Assignee: Vahid Hashemian
> Attachments: Kafka 0.10.2.0 Issue (TRACE) - Server + Client.txt,
> Kafka 0.10.2.0 Issue (TRACE).txt
>
>
> After moving to Kafka 0.10.2.0, it looks like I'm experiencing a hang in the
> rebalancing code.
> This is a test case, not (still) production code. It does the following with
> a single-partition topic and two consumers in the same group:
> 1) a topic with one partition is forced to be created (auto-created)
> 2) a producer is used to write 10 messages
> 3) the first consumer reads all the messages and commits
> 4) the second consumer attempts a poll() and hangs indefinitely
> The same issue can't be found with 0.10.0.0.
> See the attached logs at TRACE level. Look for "SERVER HANGS" to see where
> the hang is found: when this happens, the client keeps failing any hearbeat
> attempt, as the rebalancing is in progress, and the poll method hangs
> indefinitely.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)