[ 
https://issues.apache.org/jira/browse/CASSANDRA-18766?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17872980#comment-17872980
 ] 

Aswin Karthik commented on CASSANDRA-18766:
-------------------------------------------

Facing a similar issue:
We upgraded from 4.0.9 to 4.1.5. The speculative retries have increased a lot. 
In some clusters, around 80% of requests are being speculatively retried (it 
was <1% before the upgrade). % = sum(SpeculativeRetries.Count) / 
sum(CoordinatorReadLatency.Count). The symptoms are similar to this resolved 
ticket https://issues.apache.org/jira/browse/CASSANDRA-17877. 

I am able to reproduce the issue as well

Create 2 clusters

{noformat}
ccm create cluster1 -v 4.1.5 -n 3
ccm create cluster2 -v 4.0.9 -n 3
{noformat}

On each cluster do the following

{noformat}
ccm stress write "duration=2m" -rate "threads=8" -schema 
"replication(strategy=SimpleStrategy,replication_factor=2)"
ccm stress "read no-warmup duration=3m" -rate threads=8 -errors 
"skip-read-validation"
{noformat}

Checking Speculative retries

{noformat}
# On 4.1.5
$ ccm node1 nodetool -- sjk mx -b 
org.apache.cassandra.metrics:type=Table,keyspace=keyspace1,scope=standard1,name=SpeculativeRetries
 -mg -f Count
org.apache.cassandra.metrics:type=Table,keyspace=keyspace1,scope=standard1,name=SpeculativeRetries
174252

# On 4.0.9
$ ccm node1 nodetool -- sjk mx -b 
org.apache.cassandra.metrics:type=Table,keyspace=keyspace1,scope=standard1,name=SpeculativeRetries
 -mg -f Count
org.apache.cassandra.metrics:type=Table,keyspace=keyspace1,scope=standard1,name=SpeculativeRetries
5860
{noformat}

The difference is massive

> high speculative retries on v4.1.3
> ----------------------------------
>
>                 Key: CASSANDRA-18766
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-18766
>             Project: Cassandra
>          Issue Type: Bug
>          Components: Consistency/Coordination
>            Reporter: Ivans Novikovs
>            Priority: Normal
>             Fix For: 4.1.x
>
>         Attachments: screenshot-1.png, screenshot-2.png, signature.asc, 
> signature.asc, v4.0.png, v4.1.png
>
>
> There are up to 10+ times higher speculative retries for reads on 4.1.3 
> comparing to 4.0.7 and 4.1.2 when using QUORUM and default setting of 99p.
> On 4.1.3 after upgrade I see speculative retries for up to 35% of all reads 
> for specific table. Latency for reads is stable around 500 microseconds.
> java 1.8.0_382 is used



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to