Hello all,
I would apreciate some help to resolve an issue that appears on my installation
since a few day.
I experience deadlock in the directory server, around after 5/10 mins after
restarting freeipa.
My installation is a dual master server freeipa installation, with replication
in place between the 2 master.
When the deadlock occurs, the directory server just failed with timeout on
simple ldapsearch -x request.
I even can't restart the directory server using systemctl, need to kill the
process before.
I've take a stack trace of the server when experiencing the dead lock, and see
some threads waiting on __pthread_mutex_lock_full, that come from replication
threads.
So perhaps somethings wrong with my replication ?
Thread 34 (Thread 0x7f6d0e75a6c0 (LWP 3980434) "ns-slapd"):
#0 0x00007f6d14081332 in __pthread_mutex_lock_full () at
target:/lib64/libc.so.6
#1 0x00007f6d0f6b6bd2 in mdb_txn_renew0 () at target:/lib64/liblmdb.so.0.0.0
#2 0x00007f6d0f6b73c4 in mdb_txn_begin () at target:/lib64/liblmdb.so.0.0.0
#3 0x00007f6d0ef88b55 in dbmdb_start_txn () at
target:/usr/lib64/dirsrv/plugins/libback-ldbm.so
#4 0x00007f6d0ef8b93a in dbmdb_dbi_txn_begin () at
target:/usr/lib64/dirsrv/plugins/libback-ldbm.so
#5 0x00007f6d0ec43eae in _cl5Iterate () at
target:/usr/lib64/dirsrv/plugins/libreplication-plugin.so
#6 0x00007f6d0ec49581 in _cl5TrimReplica () at
target:/usr/lib64/dirsrv/plugins/libreplication-plugin.so
#7 0x00007f6d0ec49761 in _cl5TrimMain () at
target:/usr/lib64/dirsrv/plugins/libreplication-plugin.so
#8 0x00007f6d13e4d3d7 in _pt_root () at target:/lib64/libnspr4.so
#9 0x00007f6d1407e168 in start_thread () at target:/lib64/libc.so.6
#10 0x00007f6d1410214c in __clone3 () at target:/lib64/libc.so.6
In error logs, I can see a log of errors like this one:
[01/Mar/2025:10:53:22.462107359 +0100] - ERR - NSMMReplicationPlugin -
changelog program - repl_plugin_name_cl - agmt="cn=meToserver2.mydomain.com"
(server2:389): CSN 67c2c570000500040000 not found, we aren't as up to date, or
we purged
And also a lot of lines like this one:
[01/Mar/2025:10:53:19.683683661 +0100] - INFO - idl_lmdb_range_fetch -
flag_err=0
I've even try to reinstall freeipa on this two servers a few days ago.
After a clean install, everything work well for 1 or 2 days, but after the
deadlock come back.
Thanks,
Laurent.
--
_______________________________________________
FreeIPA-users mailing list -- [email protected]
To unsubscribe send an email to [email protected]
Fedora Code of Conduct:
https://docs.fedoraproject.org/en-US/project/code-of-conduct/
List Guidelines: https://fedoraproject.org/wiki/Mailing_list_guidelines
List Archives:
https://lists.fedorahosted.org/archives/list/[email protected]
Do not reply to spam, report it:
https://pagure.io/fedora-infrastructure/new_issue