1. A hibernating node is participating in gossip but intentionally hasn't
yet joined the ring. The two cases where a node would set a hibernating
status are when the node was started with "-Dcassandra.join_ring=False" and
has tokens or when the node was started to replace another node (using
"-Dcassandra.replace_address" or "-Dcassandra.replace_address_first_boot").

2. A rolling restart is probably your best bet. You may have more luck with
an assassinate in the case that you connect to a node that is not
continuously removing/adding the state. I suspect that this node will have
an alive status for this endpoint state. As usual, you should wield
assassinate with lots of caution.

This issue sounds most similar to CASSANDRA-10371. If you provide debugging
information similar to that requested on the above ticket as well as what
operation you were performing on the node (was it a failed attempt at
replacing? etc) on a JIRA ticket, someone might have a chance to look into
this further.

On Wed, Oct 12, 2016 at 9:48 AM, Kasper Petersen <kas...@sybogames.com>
wrote:

> Hi,
>
> I've recently upgraded our Cassandra cluster from 2.1 to 3.9. By
> default(?) 3.9 creates a debug.log file containing a ton of lines (a new
> one every second) with:
>
> DEBUG [GossipTasks:1] 2016-10-12 14:43:38,761 Gossiper.java:337 -
>> Convicting /172.31.137.65 with status hibernate - alive false
>
>
> That node has not been around for a very long time now.
>
> It does not show up in nodetool status and nodetool gossipinfo returns
> the following output about that node:
>
> /172.31.137.65
>>   generation:1433571405
>>   heartbeat:232
>>   STATUS:3:hibernate,true
>>   LOAD:225:96445.0
>>   SCHEMA:53:e2d1a288-581c-3f35-b492-1b9d5a803631
>>   DC:9:us-east
>>   RACK:11:1b
>>   RELEASE_VERSION:7:2.1.5
>>   RPC_ADDRESS:6:172.31.137.65
>>   SEVERITY:231:0.2512562870979309
>>   NET_VERSION:4:8
>>   HOST_ID:5:7988d3c9-dec8-4b71-b5a9-0b962aad0680
>>   TOKENS:2:<hidden>
>
>
> nodetool removenode 7988d3c9-dec8-4b71-b5a9-0b962aad0680 resulted in:
>
> error: Host ID not found.
>>
>
> Now my questions are:
>
>    1. What does it mean for a node to be "hibernating"? How does it end
>    up in that state?
>    2. How do I get rid of it? Its not coming back.
>
>
>
> --
> Best regards,
> Kasper Middelboe Petersen
>
> *Lead Backend Developer*
>
> *SYBO Games ApS*
> Jorcks Passage 1A, 4th.
> 1162 Copenhagen K
>

Reply via email to