Peter Norton created CASSANDRA-11744:
----------------------------------------

             Summary: Trying to restart a 2.2.5 node, nodetool disablethrift 
fails
                 Key: CASSANDRA-11744
                 URL: https://issues.apache.org/jira/browse/CASSANDRA-11744
             Project: Cassandra
          Issue Type: Bug
            Reporter: Peter Norton
         Attachments: failure.jstack.out

We have a 2.2.5 cluster running in AWS VPC with EBS volumes.  Earlier today 3 
nodes seem to have gone into a bad state - clients were seeing high latencies 
when writing to these nodes, and the write to the commitlog on each of these 
nodes seemed high - more than the relatively low number of iops that AWS 
allocated to these volumes.  While trying to understand the situation we 
attempted to restart the 3 nodes.  We attempted to do a nodetool disablebinary; 
nodetool disablethrift; nodetool flush. and then stop the process.

When trying to disablethrift, the following stack trace appeared in the 
system.log:

```
INFO  [RMI TCP Connection(8)-172.26.32.248] 2016-05-10 15:26:58,599 
Server.java:218 - Stop listening for CQL clients
INFO  [RMI TCP Connection(10)-172.26.32.248] 2016-05-10 15:27:01,975 
ThriftServer.java:142 - Stop listening to thrift clients
ERROR [RPC-Thread:34] 2016-05-10 15:27:03,794 Message.java:324 - Unexpected 
throwable while invoking!
java.lang.NullPointerException: null
        at com.thinkaurelius.thrift.util.mem.Buffer.size(Buffer.java:83) 
~[thrift-server-0.3.7.jar:na]
        at 
com.thinkaurelius.thrift.util.mem.FastMemoryOutputTransport.expand(FastMemoryOutputTransport.java:84)
 ~[thrift-server-0.3.7.jar:na]
        at 
com.thinkaurelius.thrift.util.mem.FastMemoryOutputTransport.write(FastMemoryOutputTransport.java:167)
 ~[thrift-server-0.3.7.jar:na]
        at 
org.apache.thrift.transport.TFramedTransport.flush(TFramedTransport.java:156) 
~[libthrift-0.9.2.jar:0.9.2]
        at org.apache.thrift.ProcessFunction.process(ProcessFunction.java:55) 
~[libthrift-0.9.2.jar:0.9.2]
        at org.apache.thrift.TBaseProcessor.process(TBaseProcessor.java:39) 
~[libthrift-0.9.2.jar:0.9.2]
        at com.thinkaurelius.thrift.Message.invoke(Message.java:314) 
~[thrift-server-0.3.7.jar:na]
        at com.thinkaurelius.thrift.Message$Invocation.execute(Message.java:90) 
[thrift-server-0.3.7.jar:na]
        at 
com.thinkaurelius.thrift.TDisruptorServer$InvocationHandler.onEvent(TDisruptorServer.java:695)
 [thrift-server-0.3.7.jar:na]
        at 
com.thinkaurelius.thrift.TDisruptorServer$InvocationHandler.onEvent(TDisruptorServer.java:689)
 [thrift-server-0.3.7.jar:na]
        at com.lmax.disruptor.WorkProcessor.run(WorkProcessor.java:112) 
[disruptor-3.0.1.jar:na]
        at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) 
[na:1.8.0_60]
        at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) 
[na:1.8.0_60]
        at java.lang.Thread.run(Thread.java:745) [na:1.8.0_60]
```

The attached jstack was taken from a node after the above was noticed.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to