[ 
https://issues.apache.org/jira/browse/FLINK-18634?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17284445#comment-17284445
 ] 

Dong Lin commented on FLINK-18634:
----------------------------------


- According to the log [1] from Till, testScaleUpAfterScalingDown failed due to 
error "Timeout expired after 60000milliseconds while awaiting InitProducerId"
- According to flink-connectors/flink-connector-kafka/pom.xml, the test is 
using Kafka 2.4.1.
- According to this comment in KAFKA-8803 [2], there are bugs in Kafka 2.4.1 
that could trigger the indefinite blocking on the client side with the same 
error message.
- According to KAFKA-8803, the bug should be fixed in Kafka 2.4.2. and 2.5.1.

I am able to reproduce the issue on my Macbook after running this test 42 
times. Let me bump up the Kafka dependency locally and see if the bug can still 
be reproduced after running this test 100 times.

[1] 
https://dev.azure.com/apache-flink/98463496-1af2-4620-8eab-a2ecc1a2e6fe/_apis/build/builds/12885/logs/124
[2] 
https://issues.apache.org/jira/browse/KAFKA-8803?focusedCommentId=17105691&page=com.atlassian.jira.plugin.system.issuetabpanels%3Acomment-tabpanel#comment-17105691

> FlinkKafkaProducerITCase.testRecoverCommittedTransaction failed with "Timeout 
> expired after 60000milliseconds while awaiting InitProducerId"
> --------------------------------------------------------------------------------------------------------------------------------------------
>
>                 Key: FLINK-18634
>                 URL: https://issues.apache.org/jira/browse/FLINK-18634
>             Project: Flink
>          Issue Type: Bug
>          Components: Connectors / Kafka, Tests
>    Affects Versions: 1.11.0, 1.12.0, 1.13.0
>            Reporter: Dian Fu
>            Assignee: Jiangjie Qin
>            Priority: Major
>              Labels: test-stability
>
> https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=4590&view=logs&j=c5f0071e-1851-543e-9a45-9ac140befc32&t=684b1416-4c17-504e-d5ab-97ee44e08a20
> {code}
> 2020-07-17T11:43:47.9693015Z [ERROR] Tests run: 12, Failures: 0, Errors: 1, 
> Skipped: 0, Time elapsed: 269.399 s <<< FAILURE! - in 
> org.apache.flink.streaming.connectors.kafka.FlinkKafkaProducerITCase
> 2020-07-17T11:43:47.9693862Z [ERROR] 
> testRecoverCommittedTransaction(org.apache.flink.streaming.connectors.kafka.FlinkKafkaProducerITCase)
>   Time elapsed: 60.679 s  <<< ERROR!
> 2020-07-17T11:43:47.9694737Z org.apache.kafka.common.errors.TimeoutException: 
> org.apache.kafka.common.errors.TimeoutException: Timeout expired after 
> 60000milliseconds while awaiting InitProducerId
> 2020-07-17T11:43:47.9695376Z Caused by: 
> org.apache.kafka.common.errors.TimeoutException: Timeout expired after 
> 60000milliseconds while awaiting InitProducerId
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to