[ClusterLabs] [rabbitmq] Maximum Number of Sessions (8192) Reached

Eugen Block Tue, 27 Aug 2024 05:22:28 -0700

Hi,

I'm newly subscribed to the list, hoping to find some pointers. Ican't seem to find much about rabbitmq and logind, so I wanted to askthe list if anyone has encountered the same and if so, how they dealtwith it.

We're supporting a Victoria cluster (installed with our own deploymentmethod) mostly controlled by pacemaker. And on two of the threecontrol nodes I see this warning constantly:


---snip---

2024-07-29T14:09:23.552576+02:00 control01 su: pam_unix(su:session):session opened for user rabbitmq by (uid=0)2024-07-29T14:09:24.450657+02:00 control01 su: pam_unix(su:session):session closed for user rabbitmq

2024-07-29T14:09:24.500356+02:00 control01 su: (to rabbitmq) root on none

2024-07-29T14:09:24.502370+02:00 control01 su:pam_systemd(su:session): Failed to create session: Maximum number ofsessions (8192) reached, refusing further sessions.2024-07-29T14:09:24.502681+02:00 control01 su: pam_unix(su:session):session opened for user rabbitmq by (uid=0)2024-07-29T14:09:25.565203+02:00 control01 su: pam_unix(su:session):session closed for user rabbitmq

2024-07-29T14:09:25.609613+02:00 control01 su: (to rabbitmq) root on none
---snip---

This is obviously initiated by pacemaker (just grabbed newer logs):

Aug 27 13:16:06 control03 lrmd[297534]: INFO: rabbitmq[296363]:su_rabbit_cmd(): the invoked command exited 0: /usr/sbin/rabbitmqctlnode_health_check -t 128Aug 27 13:16:06 control03 lrmd[297542]: INFO: rabbitmq[296363]:get_monitor(): get_monitor function ready to return 0

Looking into loginctl list-sessions, almost all of them belong torabbitmq and they have a very old timestamp (2023). I'm aware of oldersystemd versions which can't handle closing sessions correctly [0],but we already use a version newer than required according to [0]. Iincreased the SessionsMax to 16384 on one of the nodes, and again,rabbitmq uses almost all available sessions:


control03:~ # loginctl list-sessions | grep -c rabbit
16325

But everything seems to be working okay, it's just filling up the logsapparently. And it seems as if all new sessions are closed properly:

control03:~ # journalctl --since 2024-08-14 | grep -c "session openedfor user rabbitmq"

control03:~ # journalctl --since 2024-08-14 | grep -c "session closedfor user rabbitmq"

What I'm wondering about is why only two out of three control nodesreach the SessionsMax limit while the third (which joined the clusterlater) only has 2 rabbitmq sessions. I seem to overlook something, butI don't know what it is yet. And I'm curious if this is working "asdesigned". This is a cluster with 3 control nodes and 36 computenodes. What do other operators see in their HA clouds regardingrabbitmq?

Or could this be a rabbitmq issue since the ocf ha resource is fromthe rabbitmq-server package?


rpm -qf /usr/lib/ocf/resource.d/rabbitmq/rabbitmq-server-ha
rabbitmq-server-3.8.3-lp152.2.3.1.x86_64

Thanks for any pointers!
Eugen

[0] https://www.suse.com/support/kb/doc/?id=000020549


_______________________________________________
Manage your subscription:
https://lists.clusterlabs.org/mailman/listinfo/users

ClusterLabs home: https://www.clusterlabs.org/

[ClusterLabs] [rabbitmq] Maximum Number of Sessions (8192) Reached

Reply via email to