Hello, we have recently upgraded our 389-ds setup to version 2.6.1 running on AlmaLinux 9.6 (installed from the official AlmaLinux appstream repo). Or upgrade approach was to build a completely new setup, import all the data and afterwards switch the IP addresses of the old and new servers. Our setup consists of 2 suppliers (mdir01 + mdir02) and 2 consumers (sdir01 + sdir02). Both suppliers each have a replication agreement between each other, aswell as agreements to both consumers. Our provisioning system is designed to only write changes to mdir01, it just uses mdir02 in case it can't reach mdir01. Our clients (DHCP servers) use all 4 directory servers.
Since the upgrade we have that problem that we observe the following messages in the error logs of both suppliers: <code> [10/Jul/2025:09:58:55.479913820 +0200] - ERR - agmt="cn=agreement-mdir01-to-sdir02" (10:1389) - clcache_load_buffer - Can't locate CSN 686f72bf00050f4b0000 in the changelog (DB rc=-12797). If replication stops, the consumer may need to be reinitialized. [10/Jul/2025:09:58:55.484629809 +0200] - ERR - agmt="cn=agreement-mdir01-to-mdir02" (10:1389) - clcache_load_buffer - Can't locate CSN 686f72bf00050f4b0000 in the changelog (DB rc=-12797). If replication stops, the consumer may need to be reinitialized. [10/Jul/2025:09:58:55.484868342 +0200] - ERR - agmt="cn=agreement-mdir01-to-sdir01" (10:1389) - clcache_load_buffer - Can't locate CSN 686f72bf00050f4b0000 in the changelog (DB rc=-12797). If replication stops, the consumer may need to be reinitialized. [10/Jul/2025:10:01:07.738009372 +0200] - ERR - agmt="cn=agreement-mdir01-to-sdir02" (10:1389) - clcache_load_buffer - Can't locate CSN 686f734300000f4b0000 in the changelog (DB rc=-12797). If replication stops, the consumer may need to be reinitialized. [10/Jul/2025:10:01:07.741023198 +0200] - ERR - agmt="cn=agreement-mdir01-to-mdir02" (10:1389) - clcache_load_buffer - Can't locate CSN 686f734300000f4b0000 in the changelog (DB rc=-12797). If replication stops, the consumer may need to be reinitialized. [09/Jul/2025:12:18:14.429187195 +0200] - ERR - agmt="cn=agreement-mdir02-to-sdir01" (10:1389) - clcache_load_buffer - Can't locate CSN 686eb26600000f4b0000 in the changelog (DB rc=-12797). If replication stops, the consumer may need to be reinitialized. [09/Jul/2025:12:18:14.430628311 +0200] - ERR - agmt="cn=agreement-mdir02-to-sdir02" (10:1389) - clcache_load_buffer - Can't locate CSN 686eb26600000f4b0000 in the changelog (DB rc=-12797). If replication stops, the consumer may need to be reinitialized. [09/Jul/2025:12:18:16.909625172 +0200] - ERR - agmt="cn=agreement-mdir02-to-sdir01" (10:1389) - clcache_load_buffer - Can't locate CSN 686eb26800000f4b0000 in the changelog (DB rc=-12797). If replication stops, the consumer may need to be reinitialized. [09/Jul/2025:12:18:16.913147068 +0200] - ERR - agmt="cn=agreement-mdir02-to-sdir02" (10:1389) - clcache_load_buffer - Can't locate CSN 686eb26800000f4b0000 in the changelog (DB rc=-12797). If replication stops, the consumer may need to be reinitialized. [09/Jul/2025:12:42:30.255121122 +0200] - ERR - agmt="cn=agreement-mdir02-to-sdir01" (10:1389) - clcache_load_buffer - Can't locate CSN 686f60d500010f4b0000 in the changelog (DB rc=-12797). If replication stops, the consumer may need to be reinitialized. </code> On mdir01 these messages appear on average every 5 minutes during peak hours. On mdir02 much more infrequently, on average every 15-20 minutes. However, looking through the changelog I can find all the CSNs which it apparently can't locate: <code> changetype: delete replgen: 62b5bf320000010f0000 csn: 686f72bf00050f4b0000 nsuniqueid: da0de304-593411f0-aef0dc83-b57f4cfd dn: ClientIdentifier=00:00:00:00:d9:05,ou=dhcpldap,o=customer,dc=domain,dc=net changetype: delete replgen: 62b5bf320000010f0000 csn: 686f734300000f4b0000 nsuniqueid: 4c315e08-5d6111f0-aef0dc83-b57f4cfd dn: ClientIdentifier=00:00:00:00:c6:f4,ou=dhcpldap,o=customer,dc=domain,dc=net changetype: delete replgen: 62b5bf320000010f0000 csn: 686eb26600000f4b0000 nsuniqueid: 54f4e05b-04a311f0-9233a0e8-dc56aea6 dn: ClientIdentifier=00:00:00:00:26:99,ou=dhcpldap,o=customer,dc=domain,dc=net changetype: delete replgen: 62b5bf320000010f0000 csn: 686eb26800000f4b0000 nsuniqueid: 463ba84b-0af711f0-a597a0e8-dc56aea6 dn: ClientIdentifier=00:00:00:00:f7:a0,ou=dhcpldap,o=customer,dc=domain,dc=net changetype: add replgen: 62b5bf320000010f0000 csn: 686f60d500010f4b0000 nsuniqueid: e9d45f81-5d5811f0-aef0dc83-b57f4cfd parentuniqueid: 6167af03-f3c311ec-862ceac3-35201d04 dn: ClientIdentifier=00:00:00:00:16:68,ou=dhcpldap,o=customer,dc=domain,dc=net change:: ... </code> All the changes are populated to all directory servers in the cluster. So there is no real problem visible. In general, we have not seen any problems with replication whatsoever, we just have these seemingly "false" messages in the error log. Changelog trim is currently set to the following values: <code> nsslapd-changelogmaxage: 30d nsslapd-changelogtrim-interval: 3600 </code> Does anybody know why these error messages appear? And if / how we can get rid of them? I just want to make sure that there is really no underlying issue somewhere. And if those messages really falsely appear, I would like to get rid of them if possible to avoid confusion and to stop spamming the error logs. Thank you very much in advance. -- _______________________________________________ 389-users mailing list -- 389-users@lists.fedoraproject.org To unsubscribe send an email to 389-users-le...@lists.fedoraproject.org Fedora Code of Conduct: https://docs.fedoraproject.org/en-US/project/code-of-conduct/ List Guidelines: https://fedoraproject.org/wiki/Mailing_list_guidelines List Archives: https://lists.fedoraproject.org/archives/list/389-users@lists.fedoraproject.org Do not reply to spam, report it: https://pagure.io/fedora-infrastructure/new_issue