Re: Issue replacing a dead node

Courtney Tue, 27 May 2025 14:43:56 -0700

One last update:

After kicking it more, it finally fully joined the cluster. The thirdtime the server was rebooted and after that it eventually reached the UNstate. I wish I had kept the link, but I had read that someone had asimilar issue joining a node to a cluster with 4.1.x and the answer wasto restart the service.


On 5/23/25 10:46 PM, Courtney wrote:

Some updates after getting back to this. I did hardware tests andcould not find any hardware issues. Instead of trying a replace, Iwent the route of removing the dead node entirely and then adding in anew node.
The new node is still joining, but I am hitting some oddities in thelog. When joining, the process seems to "rescheduling" around the timethat these logs appear:
INFO [Messaging-EventLoop-3-25] 2025-05-24 05:20:38,203NoSpamLogger.java:105 -/<new-node>:7000->/<dead-node>:7000-SMALL_MESSAGES-[no-channel] failedto connectio.netty.channel.ConnectTimeoutException: connection timed out:/<dead-node>:7000WARN [Messaging-EventLoop-3-25] 2025-05-24 05:20:46,200NoSpamLogger.java:108 -/<new-node>:7000->/<dead-node>:7000-SMALL_MESSAGES-[no-channel]dropping message of type PING_REQ whose timeout expired beforereaching the networkINFO [Messaging-EventLoop-3-26] 2025-05-24 05:32:27,606NoSpamLogger.java:105 -/<new-node>:7000->/<dead-node>:7000-LARGE_MESSAGES-[no-channel] failedto connectio.netty.channel.ConnectTimeoutException: connection timed out:/<dead-node>:7000INFO [GossipStage:1] 2025-05-24 05:33:15,538 Gossiper.java:1428 -InetAddress /<dead-node>:7000 is now DOWN
The cluster knows 0 about the old node, the old node is completelypowered off. I don't know why it is attempting to connect to the deadnode that has no presence in the cluster.
Beforehand, I start to get these messages:
INFO [OptionalTasks:1] 2025-05-24 05:27:46,189 NoSpamLogger.java:105- "Cannot read from a bootstrapping node" while executing SELECT *FROM system_auth.roles WHERE role = 'cassandra' ALLOW FILTERINGWARN [OptionalTasks:1] 2025-05-24 05:32:16,240CassandraRoleManager.java:359 - CassandraRoleManager skipped defaultrole setup: some nodes were not readyINFO [OptionalTasks:1] 2025-05-24 05:32:16,240CassandraRoleManager.java:395 - Setup task failed with error,rescheduling
I've had to stop and start cassandra to get passed this, but I amafraid I will hit this again soon.
On 5/16/25 11:54 PM, Sebastian Marsching wrote:
To add on to what Bowen already wrote, if you cannot find any reasonin the logs at all, I would retry using different hardware.
In the recent past I have seen two cases where strange Cassandraproblems were actually caused by broken hardware (in both cases, afaulty memory module caused the issues). In one case, there were logmessages, but misleading ones about SSTable corruption (the SSTableswere fine, but when loaded into memory, the data got corrupted). Inthe other case, there were no log messages at all. The Cassandraprocess simply stopped without a good reason. Eventually I found thecrash dumps in the Cassandra data directory (whether they are writtendepends on the JVM setting), which indicated that the JVM experienceda segmentation fault.
So, if you have any spare hardware lying around (it’s best if it isfrom a different batch to exclude common-mode failures), using adifferent piece of hardware and trying with that one might makesense. If the problem stays, you can at least be sure that it isn’trelated to the hardware, and if it vanishes, you can further inspectthe original hardware.
Am 17.05.2025 um 04:27 schrieb Bowen Song via user<[email protected]>:
In my experience, failed bootstrap / node replacement always leavesome traces in the logs. At the very minimal, there's going to belogs about streaming sessions failing or aborting. I have never seenit silently fails or stops without leaving any traces in the log. Ican't think of anything that can cause the process to fail anddoesn't leave a trace in the log. BTW, the relevant logs can behours before the symptom becomes visible, because a failed streamingsession does not cause Cassandra to immediately abort other activestreaming sessions, and the remaining active sessions can take awhile to complete.
If the process repeatedly fails at a certain place, I would suspectsome sort of data corruption or disk error, resulting in the datacannot be read or deserialised correctly. But this is just a guess,and I could be wrong.
On 16/05/2025 01:14, Courtney wrote:
I checked all the logs and really couldn't find anything. Icouldn't find any sort of errors in dmesg, system.log, debug.log,gc.log (maybe up the log level?), systemd journal...the logs aretotally clean. It just stops gossiping all of a sudden at 22GB ofdata each time, then the old node returning to DN state. What is`nodetool bootstrap resume` going to do? Is there a risk to runningresume when the replacement node is no longer in the cluster? Couldtoo high of a tombstone ratio cause this?
On 5/15/25 5:08 PM, Bowen Song via user wrote:
The dead node being replaced went back to DN state indicating thenew replacement node failed to join the cluster, usually becausethe streaming was interrupted (e.g. by network issues, or long STWGC pauses). I would start looking for red flags in the logs,including Cassandra's logs, GC logs, dmesg, systemd journal, etc.,on the new node, and other nodes in the cluster too. Also, I wouldtry `nodetool bootstrap resume` on the replacement node.
On 12/05/2025 09:53, Courtney wrote:
Hello everyone,
I have a cluster with 2 datacenters. I am usingGossipingPropertyFileSnitch as my endpoint snitch. Cassandraversion 4.1.8. One datacenter is fully Ubuntu 24.04 and OpenJDK11 and another is Ubuntu 20.04 on OpenJDK 8. A seed node died inmy second DC running Ubuntu 20.04 hosts. I ordered a newdedicated server. I updated my seeds to forget the dead seednode. I did the steps to replace a dead node
JVM_OPTS="$JVM_OPTS $JVM_EXTRA_OPTS-Dcassandra.replace_address_first_boot=<dead_node_ip>"
Configs between the old/new node are identical minus IP addressesand that line above in the env file to replace the dead node. Istarted the node and it started replacing the old node and was inthe `UJ` state. Not long into the process, the new node stopsprocessing data and the cluster forgets the new node andremembers the old one in its `DN` state (which is turned off, nopower). There are no errors in the logs. I've tried differenttimes hoping to solve the issue. I upped my ROOT logging level toDEBUG, I also set "org.apache.cassandra.gms.Gossiper TRACE". Noerrors.
With TRACE set for the Gossiper, I notice gossiping stops anddata stopping streaming about the same time. I cannot run anynodetool commands on the new node. The process doesn't die, itleaves open connections to nodes that are streaming data, but Idon't see any data streaming.
I've thought through a lot. Space isn't an issue, ulimits are sethigh in /etc/security/limits.conf. Checking /proc/<pid>/limitsshows the values are high. I've replaced nodes before like thiswithout issue, but this one is causing me grief. Is thereanything more I can do?
Courtney

Re: Issue replacing a dead node

Reply via email to