[ 
https://issues.apache.org/jira/browse/CASSANDRA-13517?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17309065#comment-17309065
 ] 

Adam Holmberg edited comment on CASSANDRA-13517 at 3/26/21, 12:35 AM:
----------------------------------------------------------------------

The test fails this way intermittently, but consistently on a 
resource-constrained VM. Failures are characterized by driver heartbeat 
timeouts which [exits the 
worker|https://github.com/apache/cassandra-dtest/blob/49f46fce94c8f25f32e9b778ded8b14c30ad851e/paxos_test.py#L145-L149]
 and does not retry. I think the server and cluster are just being overwhelmed. 
This never fails on a well-provisioned machine.

The proposed change creates a client connection with ample timeouts and 
heartbeats disabled. I'm also reducing the concurrency from one arbitrary 
number to another slightly smaller arbitrary number to make it a bit more 
appropriate in the envelope of a single-host three-node test cluster.

[test patch|https://github.com/aholmberg/cassandra-dtest/pull/5]
 
[ci|https://app.circleci.com/pipelines/github/aholmberg/cassandra?branch=CASSANDRA-13517]
 (started, not reviewed)


was (Author: aholmber):
The test fails this way intermittently, but consistently on a 
resource-constrained VM. Failures are characterized by driver heartbeat 
timeouts which [exits the 
worker|https://github.com/apache/cassandra-dtest/blob/49f46fce94c8f25f32e9b778ded8b14c30ad851e/paxos_test.py#L145-L149]
 and does not retry.

The proposed change creates a client connection with ample timeouts and 
heartbeats disabled. I'm also reducing the concurrency from one arbitrary 
number to another slightly smaller arbitrary number to make it a bit more 
appropriate in the envelope of a single-host three-node test cluster.

[test patch|https://github.com/aholmberg/cassandra-dtest/pull/5]
 
[ci|https://app.circleci.com/pipelines/github/aholmberg/cassandra?branch=CASSANDRA-13517]
 (started, not reviewed)

> dtest failure in paxos_tests.TestPaxos.contention_test_many_threads
> -------------------------------------------------------------------
>
>                 Key: CASSANDRA-13517
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-13517
>             Project: Cassandra
>          Issue Type: Bug
>          Components: Test/dtest/python
>            Reporter: Ariel Weisberg
>            Assignee: Adam Holmberg
>            Priority: Normal
>              Labels: dtest, test-failure, test-failure-fresh
>             Fix For: 4.0-rc
>
>         Attachments: test_failure.txt
>
>
> Error Message
> AssertionError: value=278, errors=22, retries=2888 assert (278 == (300 * 1))
> {noformat}
> > assert (value == threads * iterations) and (errors == 0), "value={}, 
> > errors={}, retries={}".format(value, errors, retries) 
> E AssertionError: value=278, errors=22, retries=2888 E assert (278 == (300 * 
> 1)) 
> paxos_test.py:195: AssertionError
>   {noformat}
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to