[jira] [Updated] (KAFKA-4157) Transient system test failure in replica_verification_test.test_replica_lags

2016-09-17 Thread Ismael Juma (JIRA)

 [ 
https://issues.apache.org/jira/browse/KAFKA-4157?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ismael Juma updated KAFKA-4157:
---
   Resolution: Fixed
Fix Version/s: 0.10.0.2
   Status: Resolved  (was: Patch Available)

> Transient system test failure in replica_verification_test.test_replica_lags
> 
>
> Key: KAFKA-4157
> URL: https://issues.apache.org/jira/browse/KAFKA-4157
> Project: Kafka
>  Issue Type: Bug
>  Components: system tests
>Affects Versions: 0.10.0.0
>Reporter: Grant Henke
>Assignee: Grant Henke
> Fix For: 0.10.0.2
>
>
> The replica_verification_test.test_replica_lags test runs a background thread 
> via replica_verification_tool that populates a dict with max lag for each 
> "topic,partition" key. Because populating that map is in a separate thread, 
> there is a race condition on populating the key and querying it via 
> replica_verification_tool.get_lag_for_partition. This results in a key error 
> like below: 
> {noformat}
> Traceback (most recent call last):
>   File "/usr/lib/python2.7/site-packages/ducktape/tests/runner.py", line 106, 
> in run_all_tests
> data = self.run_single_test()
>   File "/usr/lib/python2.7/site-packages/ducktape/tests/runner.py", line 162, 
> in run_single_test
> return self.current_test_context.function(self.current_test)
>   File 
> "/root/kafka/tests/kafkatest/tests/tools/replica_verification_test.py", line 
> 82, in test_replica_lags
> err_msg="Timed out waiting to reach zero replica lags.")
>   File "/usr/lib/python2.7/site-packages/ducktape/utils/util.py", line 31, in 
> wait_until
> if condition():
>   File 
> "/root/kafka/tests/kafkatest/tests/tools/replica_verification_test.py", line 
> 81, in 
> wait_until(lambda: self.replica_verifier.get_lag_for_partition(TOPIC, 0) 
> == 0, timeout_sec=10,
>   File "/root/kafka/tests/kafkatest/services/replica_verification_tool.py", 
> line 66, in get_lag_for_partition
> lag = self.partition_lag[topic_partition]
> KeyError: 'topic-replica-verification,0'
> {noformat}
> Instead of an error, None should be returned when no key is found. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (KAFKA-4157) Transient system test failure in replica_verification_test.test_replica_lags

2016-09-13 Thread Grant Henke (JIRA)

 [ 
https://issues.apache.org/jira/browse/KAFKA-4157?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Grant Henke updated KAFKA-4157:
---
Status: Patch Available  (was: Open)

> Transient system test failure in replica_verification_test.test_replica_lags
> 
>
> Key: KAFKA-4157
> URL: https://issues.apache.org/jira/browse/KAFKA-4157
> Project: Kafka
>  Issue Type: Bug
>  Components: system tests
>Affects Versions: 0.10.0.0
>Reporter: Grant Henke
>Assignee: Grant Henke
>
> The replica_verification_test.test_replica_lags test runs a background thread 
> via replica_verification_tool that populates a dict with max lag for each 
> "topic,partition" key. Because populating that map is in a separate thread, 
> there is a race condition on populating the key and querying it via 
> replica_verification_tool.get_lag_for_partition. This results in a key error 
> like below: 
> {noformat}
> Traceback (most recent call last):
>   File "/usr/lib/python2.7/site-packages/ducktape/tests/runner.py", line 106, 
> in run_all_tests
> data = self.run_single_test()
>   File "/usr/lib/python2.7/site-packages/ducktape/tests/runner.py", line 162, 
> in run_single_test
> return self.current_test_context.function(self.current_test)
>   File 
> "/root/kafka/tests/kafkatest/tests/tools/replica_verification_test.py", line 
> 82, in test_replica_lags
> err_msg="Timed out waiting to reach zero replica lags.")
>   File "/usr/lib/python2.7/site-packages/ducktape/utils/util.py", line 31, in 
> wait_until
> if condition():
>   File 
> "/root/kafka/tests/kafkatest/tests/tools/replica_verification_test.py", line 
> 81, in 
> wait_until(lambda: self.replica_verifier.get_lag_for_partition(TOPIC, 0) 
> == 0, timeout_sec=10,
>   File "/root/kafka/tests/kafkatest/services/replica_verification_tool.py", 
> line 66, in get_lag_for_partition
> lag = self.partition_lag[topic_partition]
> KeyError: 'topic-replica-verification,0'
> {noformat}
> Instead of an error, None should be returned when no key is found. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)