[
https://issues.apache.org/jira/browse/CASSANDRA-15318?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17426393#comment-17426393
]
Jon Meredith commented on CASSANDRA-15318:
------------------------------------------
The original motivation for the patch was to mitigate the impact of an outage
for write queries with CL.EACH_QUORUM where the first node in the forwarding
list was considered UP but was in a state where it did not forward traffic to
the others, so by shuffling with equal probability means 2/3 traffic gets
through instead of failing. Sorting or weighting by proximity might help
performance but probably negate some of the resilience this was intended to
help with.
> sendMessagesToNonlocalDC() should shuffle targets
> -------------------------------------------------
>
> Key: CASSANDRA-15318
> URL: https://issues.apache.org/jira/browse/CASSANDRA-15318
> Project: Cassandra
> Issue Type: Improvement
> Components: Messaging/Internode
> Reporter: Jon Meredith
> Assignee: Jon Meredith
> Priority: Normal
> Fix For: 4.0-alpha3, 4.0
>
>
> To better spread load and reduce the impact of a node failure before
> detection (or other issues like issues host replacement), when forwarding
> messages to other data centers the forwarding non-local dc nodes should be
> selected at random rather than always selecting the first node in the list of
> endpoints for a token.
>
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]