[
https://issues.apache.org/jira/browse/RATIS-2501?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tsz-wo Sze updated RATIS-2501:
------------------------------
Component/s: test
(was: gRPC)
> Improve diagnostics for testInstallSnapshotDuringBootstrap timeout failures
> ---------------------------------------------------------------------------
>
> Key: RATIS-2501
> URL: https://issues.apache.org/jira/browse/RATIS-2501
> Project: Ratis
> Issue Type: Sub-task
> Components: test
> Reporter: Shilun Fan
> Assignee: Shilun Fan
> Priority: Major
> Time Spent: 0.5h
> Remaining Estimate: 0h
>
> The {{testInstallSnapshotDuringBootstrap}} test occasionally times out in CI
> with minimal diagnostic information:
> {code:java}
> java.util.concurrent.TimeoutException: testInstallSnapshotDuringBootstrap()
> timed out after 100 seconds
> Suppressed: java.io.InterruptedIOException: retry
> policy=RetryForeverWithSleep(sleepTime = 100ms)
> at org.apache.ratis.client.impl.AdminImpl.setConfiguration
> at org.apache.ratis.server.impl.MiniRaftCluster.setConfiguration {code}
> The current implementation uses unlimited retries for setConfiguration, which
> can cause the test to hang indefinitely until the overall test timeout
> occurs. When this happens, there's insufficient information in the logs to
> diagnose the root cause.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)