Hi Jeff,
thanks for the pointers!
We upgraded to C* 3.11.0 now and the situation has improved a little
bit, the node does not die completely any more, but the
WriteTimeoutExceptions persists and still 'freeze' the node for a couple
of minutes.
> A single node with 20 cores and 256GB of RAM is
On 2017-07-25 12:49 (-0700), David Salz wrote:
> Hi,
>
> has anyone seen the following exception before?
>
> Context:
>
> * Cassandra 3.9,
>
> * single node (20 Cores / 256 GB RAM)
>
A single node with 20 cores and 256GB of RAM is probably not going to be
.e. become
completely unresponsive. Restarting the node fixes it... until the next
time :(
* Exceptions happen more often under heavy load
WARN [CounterMutationStage-10] 2017-07-25 18:39:31,867
AbstractLocalAwareExecutorService.java:169 - Uncaught exception on
thread Thread[CounterMutationStage