[
https://issues.apache.org/jira/browse/KAFKA-1771?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Ewen Cheslack-Postava reassigned KAFKA-1771:
--------------------------------------------
Assignee: Ewen Cheslack-Postava
> replicate_testsuite data verification broken if num_partitions >
> replica_factor
> -------------------------------------------------------------------------------
>
> Key: KAFKA-1771
> URL: https://issues.apache.org/jira/browse/KAFKA-1771
> Project: Kafka
> Issue Type: Bug
> Components: system tests
> Affects Versions: 0.8.1.1
> Reporter: Ewen Cheslack-Postava
> Assignee: Ewen Cheslack-Postava
>
> As discussed in KAFKA-1763, testcase_0131, testcase_0132, and
> testcase_0133 currently fail with an exception:
> {quote}
> Traceback (most recent call last):
> File
> "/mnt/u001/kafka_replication_system_test/system_test/replication_testsuite/
> replica_basic_test.py", line 434, in runTest
> kafka_system_test_utils.validate_simple_consumer_data_matched_across_replic
> as(self.systemTestEnv, self.testcaseEnv)
> File
> "/mnt/u001/kafka_replication_system_test/system_test/utils/kafka_system_tes
> t_utils.py", line 2223, in
> validate_simple_consumer_data_matched_across_replicas
> replicaIdxMsgIdList[replicaIdx - 1][topicPartition] = consumerMsgIdList
> IndexError: list index out of range
> {quote}
> The root cause seems to be kafka_system_test_utils.start_simple_consumer. The
> current logic seems incorrect. It should be generating one consumer per
> partition per replica so it can verify the data from all sources, but it
> currently has a loop involving the list of brokers, where that loop variable
> isn't even used.
> But probably a bigger issue is that it's generating multiple processes in the
> background. It records pids to the single well-known entity pid path, which
> means only the last pid is saved and we could easily leave zombie processes
> if one of them hangs for some reason.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)