Apurva Mehta created KAFKA-5339:
-----------------------------------
Summary: Transactions system test with hard broker bounces fails
sporadically
Key: KAFKA-5339
URL: https://issues.apache.org/jira/browse/KAFKA-5339
Project: Kafka
Issue Type: Sub-task
Reporter: Apurva Mehta
Assignee: Apurva Mehta
Priority: Blocker
The transactions hard bounce test occasionally fails because the transactional
message copy just seems to hang. In one of the client logs, I noticed:
{noformat}
[2017-05-27 20:36:12,596] WARN Got error produce response with correlation id
124 on topic-partition output-topic-0, retrying (2147483646 attempts left).
Error: NOT_LEADER_FOR_PARTITION
(org.apache.kafka.clients.producer.internals.Sender)
[2017-05-27 20:36:15,386] ERROR Uncaught error in kafka producer I/O thread:
(org.apache.kafka.clients.producer.internals.Sender)
java.lang.NullPointerException
at
org.apache.kafka.clients.producer.internals.TransactionManager$1.compare(TransactionManager.java:146)
at
org.apache.kafka.clients.producer.internals.TransactionManager$1.compare(TransactionManager.java:143)
at
java.util.PriorityQueue.siftDownUsingComparator(PriorityQueue.java:721)
at java.util.PriorityQueue.siftDown(PriorityQueue.java:687)
at java.util.PriorityQueue.poll(PriorityQueue.java:595)
at
org.apache.kafka.clients.producer.internals.TransactionManager.nextRequestHandler(TransactionManager.java:351)
at
org.apache.kafka.clients.producer.internals.Sender.maybeSendTransactionalRequest(Sender.java:303)
at
org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:193)
at
org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:154)
at java.lang.Thread.run(Thread.java:748)
[2017-05-27 20:36:52,007] INFO Closing the Kafka producer with timeoutMillis =
9223372036854775807 ms. (org.apache.kafka.clients.producer.KafkaProducer)
[2017-05-27 20:36:52,036] INFO Marking the coordinator knode02:9092 (id:
2147483645 rack: null) dead for group transactions-test-consumer-group
(org.apache.kafka.clients.consumer.internals.AbstractCoordinator)
root@7dcd60017519:/opt/kafka-dev/results/latest/TransactionsTest/test_transactions/failure_mode=hard_bounce.bounce_target=brokers/1#
{noformat}
This suggests that the client has gotten to a bad state which is why it stops
processing messages, causing the tests to fail.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)