albertogpz opened a new pull request #686: URL: https://github.com/apache/geode-native/pull/686
The following integration test cases under integration/test (new integration tests) ar flaky (do not fail normally when run locally but fail very often when run in CI). - PartitionRegionOpsTest.getPartitionedRegionWithRedundancyServerGoesDownSingleHop - PartitionRegionOpsTest.putPartitionedRegionWithRedundancyServerGoesDownSingleHop There were two reasons that can make them fail. One of them is that sometimes the connections to the server have expired before the server is restarted and therefore, when traffic is sent to the restarted server, no errors are found. To fix this, the pool configuration for the test client has been changed so that connections do not expire. The other reason is that sometimes the error in the connection is found by the ping thread that is invoking the ThinClientPoolDM::sendRequestToEP() method and in this method, when the IO error or TIMEOUT error are encountered, the endpoint is not removed from the metadata (by means of the removeBucketServerLocation method). The code has been updated to remove the metadata also in this case. With these two changes, the test cases are not flaky anymore. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected]
