[
https://issues.apache.org/jira/browse/KAFKA-4157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15487341#comment-15487341
]
ASF GitHub Bot commented on KAFKA-4157:
---------------------------------------
GitHub user granthenke opened a pull request:
https://github.com/apache/kafka/pull/1849
KAFKA-4157: Transient system test failure in replica_verification_tes…
…t.test_replica_lags
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/granthenke/kafka replica-verification-fix
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/kafka/pull/1849.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #1849
----
commit 4fdee9a5f7cb0c62fd001c8e64eab4f9ec6785ad
Author: Grant Henke <[email protected]>
Date: 2016-09-13T14:27:26Z
KAFKA-4157: Transient system test failure in
replica_verification_test.test_replica_lags
----
> Transient system test failure in replica_verification_test.test_replica_lags
> ----------------------------------------------------------------------------
>
> Key: KAFKA-4157
> URL: https://issues.apache.org/jira/browse/KAFKA-4157
> Project: Kafka
> Issue Type: Bug
> Components: system tests
> Affects Versions: 0.10.0.0
> Reporter: Grant Henke
> Assignee: Grant Henke
>
> The replica_verification_test.test_replica_lags test runs a background thread
> via replica_verification_tool that populates a dict with max lag for each
> "topic,partition" key. Because populating that map is in a separate thread,
> there is a race condition on populating the key and querying it via
> replica_verification_tool.get_lag_for_partition. This results in a key error
> like below:
> {noformat}
> Traceback (most recent call last):
> File "/usr/lib/python2.7/site-packages/ducktape/tests/runner.py", line 106,
> in run_all_tests
> data = self.run_single_test()
> File "/usr/lib/python2.7/site-packages/ducktape/tests/runner.py", line 162,
> in run_single_test
> return self.current_test_context.function(self.current_test)
> File
> "/root/kafka/tests/kafkatest/tests/tools/replica_verification_test.py", line
> 82, in test_replica_lags
> err_msg="Timed out waiting to reach zero replica lags.")
> File "/usr/lib/python2.7/site-packages/ducktape/utils/util.py", line 31, in
> wait_until
> if condition():
> File
> "/root/kafka/tests/kafkatest/tests/tools/replica_verification_test.py", line
> 81, in <lambda>
> wait_until(lambda: self.replica_verifier.get_lag_for_partition(TOPIC, 0)
> == 0, timeout_sec=10,
> File "/root/kafka/tests/kafkatest/services/replica_verification_tool.py",
> line 66, in get_lag_for_partition
> lag = self.partition_lag[topic_partition]
> KeyError: 'topic-replica-verification,0'
> {noformat}
> Instead of an error, None should be returned when no key is found.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)