[
https://issues.apache.org/jira/browse/CASSANDRASC-86?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17812409#comment-17812409
]
ASF subversion and git services commented on CASSANDRASC-86:
------------------------------------------------------------
Commit 2eb3474d7037a2887bcd9dee1f64c2a36a7e8d26 in cassandra-sidecar's branch
refs/heads/trunk from Yuriy Semchyshyn
[ https://gitbox.apache.org/repos/asf?p=cassandra-sidecar.git;h=2eb3474 ]
CASSANDRASC-86 Add Random Delays Between Retry Attempts for Health Checks
Patch by Yuriy Semchyshyn; Reviewed by Yifan Cai and Francisco Guerrero for
CASSANDRASC-86
> Startup Validation Failures when Checking Sidecar Connectivity
> --------------------------------------------------------------
>
> Key: CASSANDRASC-86
> URL: https://issues.apache.org/jira/browse/CASSANDRASC-86
> Project: Sidecar for Apache Cassandra
> Issue Type: Improvement
> Components: Configuration
> Reporter: Yuriy Semchyshyn
> Assignee: Yuriy Semchyshyn
> Priority: Normal
> Labels: pull-request-available
> Time Spent: 0.5h
> Remaining Estimate: 0h
>
> We have experienced repeated startup validation failures caused by Sidecar
> health checks for some jobs with a large number of Spark executors.
> These failures are likely caused by the thundering herd problem, and have
> been so far worked around by disabling startup validations altogether.
> In order to prevent them going forward, a random delay needs to be added
> between retries of health checks in Sidecar client.
> It is also worth increasing the overall timeout for Sidecar health checks
> from current 30 seconds to 60 seconds.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]