rondagostino opened a new pull request #10991: URL: https://github.com/apache/kafka/pull/10991
This patch adds a sanity-check bounce system test for the case where we have 3 co-located KRaft controllers and fixes the system test code so that this case will pass. We were not testing this case previously, and it would not pass because the first Kafka node would never be considered started. We start the Kafka nodes serially, and we decide that a node has successfully started when it logs a particular message. This message is not logged until the broker has identified the controller (i.e. the leader of the KRaft quorum). There cannot be a leader in the KRaft case until a majority of the KRaft quorum has started, so with 3 co-located controllers the first node could never be considered "started" by the system test. This patch skips that log message check when a majority of the controllers have not yet started. It also sends SIGKILL to any running nodes when a majority of co-located controllers have been stopped -- otherwise they do not shutdown, and we spin for the 60 seco nd timeout. ### Committer Checklist (excluded from commit message) - [ ] Verify design and implementation - [ ] Verify test coverage and CI build status - [ ] Verify documentation (including upgrade notes) -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
