Hello Shilen,

   I wonder if it is not a transient issue.
   If you add an other entry on the supplier, is it replicated  and the
   warning messages do no long occur ?

   thanks
   theirry

On 08/22/2014 03:59 PM, Shilen Patel wrote:
Once the init is done, the following is repeated on the consumer (when logging is turned up). Note that the ADD was done while the init was happening and after the init is done, the entry does exist on the consumer but, from what I can tell, the supplier wants to still send it to the consumer and the consumer is rejecting it.

Also, the other issue that I'm noticing is that a couple of my consumers seem to be crashing (about once a day). Right now, they are not getting any traffic at all other than updates from replication. I'm debugging that at the moment and will report what I find.

[22/Aug/2014:14:48:34 +0100] - import test: Import complete. Processed 282 entries in 145 seconds. (1.94 entries/sec) [22/Aug/2014:14:48:34 +0100] NSMMReplicationPlugin - multimaster_be_state_change: replica ou=test,dc=duke,dc=edu is coming online; enabling replication [22/Aug/2014:14:48:34 +0100] NSMMReplicationPlugin - _replica_configure_ruv: No ruv tombstone found for replica ou=test,dc=duke,dc=edu. Created a new one [22/Aug/2014:14:48:34 +0100] NSMMReplicationPlugin - conn=0 op=0 repl="ou=test,dc=duke,dc=edu": Released replica held by locking_purl=conn=15 id=4 [22/Aug/2014:14:48:34 +0100] NSMMReplicationPlugin - replica_enable_replication: replica ou=test,dc=duke,dc=edu is relinquished [22/Aug/2014:14:48:34 +0100] NSMMReplicationPlugin - conn=15 op=287 repl="ou=test,dc=duke,dc=edu": Replica not in use [22/Aug/2014:14:48:34 +0100] NSMMReplicationPlugin - conn=15 op=287 Relinquishing consumer connection extension [22/Aug/2014:14:48:36 +0100] NSMMReplicationPlugin - conn=13 op=69 Acquired consumer connection extension [22/Aug/2014:14:48:36 +0100] NSMMReplicationPlugin - conn=13 op=69 repl="ou=test,dc=duke,dc=edu": Begin incremental protocol [22/Aug/2014:14:48:36 +0100] - csngen_adjust_time: gen state before 53f74a330001:1408715314:0:1 [22/Aug/2014:14:48:36 +0100] - _csngen_adjust_local_time: gen state before 53f74a330001:1408715314:0:1 [22/Aug/2014:14:48:36 +0100] - _csngen_adjust_local_time: gen state after 53f74a350000:1408715316:0:1 [22/Aug/2014:14:48:36 +0100] NSMMReplicationPlugin - conn=13 op=69 repl="ou=test,dc=duke,dc=edu": Acquired replica [22/Aug/2014:14:48:36 +0100] NSMMReplicationPlugin - conn=13 op=69 repl="ou=test,dc=duke,dc=edu": StartNSDS50ReplicationRequest: response=0 rc=0 [22/Aug/2014:14:48:36 +0100] NSMMReplicationPlugin - conn=13 op=69 Relinquishing consumer connection extension [22/Aug/2014:14:48:36 +0100] NSMMReplicationPlugin - ruv_add_csn_inprogress: successfully inserted csn 53f749bb000000050000 into pending list [22/Aug/2014:14:48:36 +0100] £ - urp_add (uid=shilen3,ou=people,ou=test,dc=duke,dc=edu): an entry with this uniqueid already exists. [22/Aug/2014:14:48:36 +0100] NSMMReplicationPlugin - conn=13 op=70 csn=53f749bb000000050000 process postop: canceling operation csn [22/Aug/2014:14:48:37 +0100] NSMMReplicationPlugin - conn=16 op=63 Acquired consumer connection extension [22/Aug/2014:14:48:37 +0100] NSMMReplicationPlugin - conn=16 op=63 repl="ou=test,dc=duke,dc=edu": Begin incremental protocol [22/Aug/2014:14:48:37 +0100] - csngen_adjust_time: gen state before 53f74a350000:1408715316:0:1 [22/Aug/2014:14:48:37 +0100] - _csngen_adjust_local_time: gen state before 53f74a350000:1408715316:0:1 [22/Aug/2014:14:48:37 +0100] - _csngen_adjust_local_time: gen state after 53f74a360000:1408715317:0:1 [22/Aug/2014:14:48:37 +0100] - csngen_adjust_time: gen state after 53f74a360000:1408715317:0:1 [22/Aug/2014:14:48:37 +0100] NSMMReplicationPlugin - conn=16 op=63 repl="ou=test,dc=duke,dc=edu": Replica in use locking_purl=conn=13 id=69 [22/Aug/2014:14:48:37 +0100] NSMMReplicationPlugin - conn=16 op=63 replica="ou=test,dc=duke,dc=edu": Unable to acquire replica: error: replica busy locked by conn=13 id=69 for incremental update [22/Aug/2014:14:48:37 +0100] NSMMReplicationPlugin - conn=16 op=63 repl="ou=test,dc=duke,dc=edu": StartNSDS50ReplicationRequest: response=1 rc=0 [22/Aug/2014:14:48:37 +0100] NSMMReplicationPlugin - conn=16 op=63 Relinquishing consumer connection extension [22/Aug/2014:14:48:37 +0100] NSMMReplicationPlugin - conn=18 op=4 Acquired consumer connection extension [22/Aug/2014:14:48:37 +0100] NSMMReplicationPlugin - conn=18 op=4 repl="ou=test,dc=duke,dc=edu": Begin incremental protocol [22/Aug/2014:14:48:37 +0100] - csngen_adjust_time: gen state before 53f74a360000:1408715317:0:1 [22/Aug/2014:14:48:37 +0100] - csngen_adjust_time: gen state after 53f74a360000:1408715317:0:1 [22/Aug/2014:14:48:37 +0100] NSMMReplicationPlugin - conn=18 op=4 repl="ou=test,dc=duke,dc=edu": Replica in use locking_purl=conn=13 id=69 [22/Aug/2014:14:48:37 +0100] NSMMReplicationPlugin - conn=18 op=4 replica="ou=test,dc=duke,dc=edu": Unable to acquire replica: error: replica busy locked by conn=13 id=69 for incremental update [22/Aug/2014:14:48:37 +0100] NSMMReplicationPlugin - conn=18 op=4 repl="ou=test,dc=duke,dc=edu": StartNSDS50ReplicationRequest: response=1 rc=0 [22/Aug/2014:14:48:37 +0100] NSMMReplicationPlugin - conn=18 op=4 Relinquishing consumer connection extension [22/Aug/2014:14:48:37 +0100] NSMMReplicationPlugin - conn=13 op=71 Acquired consumer connection extension [22/Aug/2014:14:48:37 +0100] NSMMReplicationPlugin - conn=13 op=71 repl="ou=test,dc=duke,dc=edu": Released replica held by locking_purl=conn=13 id=69 [22/Aug/2014:14:48:37 +0100] NSMMReplicationPlugin - conn=13 op=71 Relinquishing consumer connection extension [22/Aug/2014:14:48:38 +0100] NSMMReplicationPlugin - conn=17 op=63 Acquired consumer connection extension [22/Aug/2014:14:48:38 +0100] NSMMReplicationPlugin - conn=17 op=63 repl="ou=test,dc=duke,dc=edu": Begin incremental protocol [22/Aug/2014:14:48:38 +0100] - csngen_adjust_time: gen state before 53f74a360000:1408715317:0:1 [22/Aug/2014:14:48:38 +0100] - _csngen_adjust_local_time: gen state before 53f74a360000:1408715317:0:1 [22/Aug/2014:14:48:38 +0100] - _csngen_adjust_local_time: gen state after 53f74a370000:1408715318:0:1 [22/Aug/2014:14:48:38 +0100] NSMMReplicationPlugin - conn=17 op=63 repl="ou=test,dc=duke,dc=edu": Acquired replica [22/Aug/2014:14:48:38 +0100] NSMMReplicationPlugin - conn=17 op=63 repl="ou=test,dc=duke,dc=edu": StartNSDS50ReplicationRequest: response=0 rc=0 [22/Aug/2014:14:48:38 +0100] NSMMReplicationPlugin - conn=17 op=63 Relinquishing consumer connection extension [22/Aug/2014:14:48:38 +0100] NSMMReplicationPlugin - ruv_add_csn_inprogress: successfully inserted csn 53f749bb000000050000 into pending list [22/Aug/2014:14:48:38 +0100] £ - urp_add (uid=shilen3,ou=people,ou=test,dc=duke,dc=edu): an entry with this uniqueid already exists.

Thanks!

-- Shilen


From: thierry bordaz <[email protected] <mailto:[email protected]>>
Date: Thursday, August 21, 2014 10:20 AM
To: "[email protected] <mailto:[email protected]>" <[email protected] <mailto:[email protected]>> Cc: "General discussion list for the 389 Directory server project." <[email protected] <mailto:[email protected]>>, Shilen Patel <[email protected] <mailto:[email protected]>>
Subject: Re: [389-users] Replication error after initializing consumer

    Hello,

        This logs are coming from the supplier. It confirms what you
        said. The supplier retrieves a starting point for sending
        updates (53f5e4b5000000050000) then sends the next update
        53f5e647000000050000
        (uid=shilen3,ou=people,ou=test,dc=duke,dc=edu). For some
        reason the consumerhost:636 is unwilling to apply the update
        (is the backend readonly ?).
        Have you got the error logs from consumerhost (with
        replication logging enabled) ?.

        thanks
        theirry

    On 08/21/2014 02:45 PM, Shilen Patel wrote:
    Here's the additional logging for the init issue:

    [21/Aug/2014:13:31:55 +0100] NSMMReplicationPlugin - Finished
    total update of replica "agmt="cn=test5toSing1"
    (consumerhost:636)". Sent 278 entries.
    [21/Aug/2014:13:31:55 +0100] NSMMReplicationPlugin -
    agmt="cn=test5toSing1" (consumerhost:636): Cancelling linger on
    the connection
    [21/Aug/2014:13:31:55 +0100] NSMMReplicationPlugin -
    agmt="cn=test5toSing1" (consumerhost:636): Disconnected from the
    consumer
    [21/Aug/2014:13:31:55 +0100] NSMMReplicationPlugin -
    agmt="cn=test5toSing1" (consumerhost:636): State: start ->
    ready_to_acquire_replica
    [21/Aug/2014:13:31:55 +0100] NSMMReplicationPlugin -
    agmt="cn=test5toSing1" (consumerhost:636): Trying secure
    slapi_ldap_init_ext
    [21/Aug/2014:13:31:55 +0100] NSMMReplicationPlugin -
    agmt="cn=test5toSing1" (consumerhost:636): binddn = cn=xxx
     passwd = xxx
    [21/Aug/2014:13:31:56 +0100] NSMMReplicationPlugin -
    agmt="cn=test5toSing1" (consumerhost:636): No linger to cancel on
    the connection
    [21/Aug/2014:13:31:57 +0100] - _csngen_adjust_local_time: gen
    state before 53f5e64c0000:1408624203:0:1
    [21/Aug/2014:13:31:57 +0100] - _csngen_adjust_local_time: gen
    state after 53f5e6be0000:1408624317:0:1
    [21/Aug/2014:13:31:58 +0100] NSMMReplicationPlugin -
    agmt="cn=test5toSing1" (consumerhost:636): Replica was
    successfully acquired.
    [21/Aug/2014:13:31:58 +0100] NSMMReplicationPlugin -
    agmt="cn=test5toSing1" (consumerhost:636): State:
    ready_to_acquire_replica -> sending_updates
    [21/Aug/2014:13:31:58 +0100] - csngen_adjust_time: gen state
    before 53f5e6be0001:1408624317:0:1
    [21/Aug/2014:13:31:58 +0100] - _csngen_adjust_local_time: gen
    state before 53f5e6be0001:1408624317:0:1
    [21/Aug/2014:13:31:58 +0100] - _csngen_adjust_local_time: gen
    state after 53f5e6bf0000:1408624318:0:1
    [21/Aug/2014:13:31:58 +0100] NSMMReplicationPlugin - changelog
    program - _cl5GetDBFile: found DB object 2b9c0c0 for database
    
/var/lib/dirsrv/slapd-idms-authdir/changelogdb/e9c04c06-1c0711e4-a066d327-58be45f0_53e0f14e000000050000.db4
    [21/Aug/2014:13:31:58 +0100] - _cl5PositionCursorForReplay
    (agmt="cn=test5toSing1" (consumerhost:636)): Consumer RUV:
    [21/Aug/2014:13:31:58 +0100] NSMMReplicationPlugin -
    agmt="cn=test5toSing1" (consumerhost:636): {replicageneration}
    53e0f14e000000050000
    [21/Aug/2014:13:31:58 +0100] NSMMReplicationPlugin -
    agmt="cn=test5toSing1" (consumerhost:636): {replica 5
    ldap://xxx:389} 53e0f3cc000000050000 53f5e4b5000000050000 00000000
    [21/Aug/2014:13:31:58 +0100] NSMMReplicationPlugin -
    agmt="cn=test5toSing1" (consumerhost:636): {replica 8 ldap://xxx:389}
    [21/Aug/2014:13:31:58 +0100] NSMMReplicationPlugin -
    agmt="cn=test5toSing1" (consumerhost:636): {replica 7
    ldap://xxx:389} 53f3a401000000070000 53f3a401000000070000 00000000
    [21/Aug/2014:13:31:58 +0100] NSMMReplicationPlugin -
    agmt="cn=test5toSing1" (consumerhost:636): {replica 6
    ldap://xxx:389} 53e3b64f000000060000 53f3a68e000000060000 00000000
    [21/Aug/2014:13:31:58 +0100] - _cl5PositionCursorForReplay
    (agmt="cn=test5toSing1" (consumerhost:636)): Supplier RUV:
    [21/Aug/2014:13:31:58 +0100] NSMMReplicationPlugin -
    agmt="cn=test5toSing1" (consumerhost:636): {replicageneration}
    53e0f14e000000050000
    [21/Aug/2014:13:31:58 +0100] NSMMReplicationPlugin -
    agmt="cn=test5toSing1" (consumerhost:636): {replica 5
    ldap://xxx:389} 53e0f3cc000000050000 53f5e647000000050000 53f5e646
    [21/Aug/2014:13:31:58 +0100] NSMMReplicationPlugin -
    agmt="cn=test5toSing1" (consumerhost:636): {replica 8 ldap://xxx:389}
    [21/Aug/2014:13:31:58 +0100] NSMMReplicationPlugin -
    agmt="cn=test5toSing1" (consumerhost:636): {replica 7
    ldap://xxx:389} 53f3a401000000070000 53f3a401000000070000 00000000
    [21/Aug/2014:13:31:58 +0100] NSMMReplicationPlugin -
    agmt="cn=test5toSing1" (consumerhost:636): {replica 6
    ldap://xxx:389} 53e3b64f000000060000 53f3a68e000000060000 00000000
    [21/Aug/2014:13:31:58 +0100] agmt="cn=test5toSing1"
    (consumerhost:636) - clcache_get_buffer: found thread private
    buffer cache 7fac100773d0
    [21/Aug/2014:13:31:58 +0100] agmt="cn=test5toSing1"
    (consumerhost:636) - clcache_get_buffer: _pool is 2b8b6f0
    _pool->pl_busy_lists is 7fac140012e0
    _pool->pl_busy_lists->bl_buffers is 7fac04126e60
    [21/Aug/2014:13:31:58 +0100] agmt="cn=test5toSing1"
    (consumerhost:636) - session start: anchorcsn=53f5e4b5000000050000
    [21/Aug/2014:13:31:58 +0100] NSMMReplicationPlugin - changelog
    program - agmt="cn=test5toSing1" (consumerhost:636): CSN
    53f5e4b5000000050000 found, position set for replay
    [21/Aug/2014:13:31:58 +0100] agmt="cn=test5toSing1"
    (consumerhost:636) - load=1 rec=1 csn=53f5e647000000050000
    [21/Aug/2014:13:31:58 +0100] NSMMReplicationPlugin -
    agmt="cn=test5toSing1" (consumerhost:636): replay_update: Sending
    add operation (dn="uid=shilen3,ou=people,ou=test,dc=duke,dc=edu"
    csn=53f5e647000000050000)
    [21/Aug/2014:13:31:58 +0100] NSMMReplicationPlugin -
    agmt="cn=test5toSing1" (consumerhost:636): replay_update:
    Consumer successfully sent operation with csn 53f5e647000000050000
    [21/Aug/2014:13:31:58 +0100] NSMMReplicationPlugin -
    agmt="cn=test5toSing1" (consumerhost:636): Consumer failed to
    replay change (uniqueid b92e2101-292e11e4-a066d327-58be45f0, CSN
    53f5e647000000050000): Server is unwilling to perform (53). Will
    retry later.
    [21/Aug/2014:13:31:58 +0100] agmt="cn=test5toSing1"
    (consumerhost:636) - session end: state=0 load=1 sent=1 skipped=0
    skipped_new_rid=0 skipped_csn_gt_cons_maxcsn=0
    skipped_up_to_date=0 skipped_csn_gt_ruv=0 skipped_csn_covered=0
    [21/Aug/2014:13:31:59 +0100] NSMMReplicationPlugin -
    agmt="cn=test5toSing1" (consumerhost:636): Successfully released
    consumer
    [21/Aug/2014:13:31:59 +0100] NSMMReplicationPlugin -
    agmt="cn=test5toSing1" (consumerhost:636): Beginning linger on
    the connection
    [21/Aug/2014:13:31:59 +0100] NSMMReplicationPlugin -
    agmt="cn=test5toSing1" (consumerhost:636): State: sending_updates
    -> start_backoff

    Thanks!

    -- Shilen

    From: Mark Reynolds <[email protected]
    <mailto:[email protected]>>
    Reply-To: "[email protected] <mailto:[email protected]>"
    <[email protected] <mailto:[email protected]>>
    Date: Tuesday, August 19, 2014 3:32 PM
    To: "General discussion list for the 389 Directory server
    project." <[email protected]
    <mailto:[email protected]>>, Shilen Patel
    <[email protected] <mailto:[email protected]>>
    Subject: Re: [389-users] Replication error after initializing
    consumer


        On 08/19/2014 03:05 PM, Shilen Patel wrote:
        Hi Mark,

        So with this issue, I'm not adding a password at all to the
        entry.  See the LDIF in (2) below.
No, I was referring to your first email, not this issue. Sorry I should of responded separately.
         I'll open a ticket.
        For what issue?  For the initialization issue I want you to
        grab the logging first.  For the first issue(userpassword not
        being replicated), if you can reproduce it when only adding
        already hashed passwords, then please file a ticket with a
        reproducible testcase.

        I hope that clears things up.

        Regards,
        Mark



        Thanks!

        -- Shilen

        From: Mark Reynolds <[email protected]
        <mailto:[email protected]>>
        Reply-To: "[email protected]
        <mailto:[email protected]>" <[email protected]
        <mailto:[email protected]>>
        Date: Tuesday, August 19, 2014 2:58 PM
        To: "General discussion list for the 389 Directory server
        project." <[email protected]
        <mailto:[email protected]>>, Shilen Patel
        <[email protected] <mailto:[email protected]>>
        Subject: Re: [389-users] Replication error after
        initializing consumer

            Shilen,

            A few things, you should not be adding a prehashed
            password (e.g. {SSHA}DMK4S6PK6+rKSLNOL1Hl01mVJmgGi5jH) -
            but that should not break replication.  Can you confirm
            that only prehashed passwords are causing the issue?  If
            so, please files a ticket with a reproducible testcase:
            https://fedorahosted.org/389/newticket

            As for this replication issue, enabling replication
            logging might provide more information:

            
https://access.redhat.com/documentation/en-US/Red_Hat_Directory_Server/9.0/html/Configuration_Command_and_File_Reference/error-logs.html#error-logs-levels

            Then post the logging output back on this list.  You
            should also undo this logging level once you're done -
            as this logging level can impact performance.

            Thanks,
            Mark


            On 08/19/2014 01:08 PM, Shilen Patel wrote:
            Hi,

            I'm not sure if this is related to my previous email,
            but I'm also seeing issues when adding an entry while a
            suffix on a consumer is being initialized.  Again, I'm
            running 1.2.11.30.  Here are the details:

            1.  Using the console, I started to initialize a suffix.

            [19/Aug/2014:17:46:35 +0100] NSMMReplicationPlugin -
            Beginning total update of replica
            "agmt="cn=test5toSing1" (consumerhost:636)".

            2.  While that was happening, I added an entry to the
            suffix on the master.

            dn: uid=shilen3,ou=people,ou=test,dc=duke,dc=edu
            changetype: add
            objectclass: top
            objectclass: person
            objectclass: inetorgperson
            objectclass: organizationalperson
            cn: test
            sn: test
            uid: shilen3


            [19/Aug/2014:17:47:04 +0100] conn=155 op=1 ADD
            dn="uid=shilen3,ou=people,ou=test,dc=duke,dc=edu"
            [19/Aug/2014:17:47:04 +0100] conn=155 op=1 RESULT err=0
            tag=105 nentries=0 etime=0 csn=53f37f89000000050000

            3.  The init later finished.

            [19/Aug/2014:17:48:55 +0100] NSMMReplicationPlugin -
            Finished total update of replica
            "agmt="cn=test5toSing1" (consumerhost:636)". Sent 270
            entries.

            At this point, the entry does exist on the consumer.
             Presumably it was added as part of the init.


            4.  However, the master still wants to send the ADD to
            the consumer.  On the consumer, I have the following
            repeated every few seconds:

            [19/Aug/2014:17:48:54 +0100] conn=2002 op=56 ADD
            dn="uid=shilen3,ou=people,ou=test,dc=duke,dc=edu"
            [19/Aug/2014:17:48:54 +0100] conn=2002 op=56 RESULT
            err=53 tag=105 nentries=0 etime=0 csn=53f37f89000000050000

            And on the master, I have this:

            [19/Aug/2014:17:48:58 +0100] NSMMReplicationPlugin -
            agmt="cn=test5toSing1" (consumerhost:636): Consumer
            failed to replay change (uniqueid
            5d458a02-27c011e4-a066d327-58be45f0, CSN
            53f37f89000000050000): Server is unwilling to perform
            (53). Will retry later.

            A cl-dump shows the following:

            changetype: add
            replgen: 53e0f14e000000050000
            csn: 53f37f89000000050000
            nsuniqueid: 5d458a02-27c011e4-a066d327-58be45f0
            parentuniqueid: bcd34401-21ba11df-80799838-3cd697e9
            dn: uid=shilen3,ou=people,ou=test,dc=duke,dc=edu
            change::
            add: objectClass
            objectClass: top
            objectClass: person
            objectClass: inetorgperson
            objectClass: organizationalperson
            -
            add: cn
            cn: test
            -
            add: sn
            sn: test
            -
            add: uid
            uid: shilen3
            -
            add: creatorsName
            creatorsName: cn=directory manager
            -
            add: modifiersName
            modifiersName: cn=directory manager
            -
            add: createTimestamp
            createTimestamp: 20140819164704Z
            -
            add: modifyTimestamp
            modifyTimestamp: 20140819164704Z
            -
            add: nsUniqueId
            nsUniqueId: 5d458a02-27c011e4-a066d327-58be45f0
            -
            add: parentid
            parentid: 2
            -
            add: entryid
            entryid: 277
            -
            add: entrydn
            entrydn: uid=shilen3,ou=people,ou=test,dc=duke,dc=edu
            -


            If anyone has any thoughts on this one, that would be
            appreciated.

            Thanks!

            -- Shilen


            --
            389 users mailing list
            
[email protected]https://admin.fedoraproject.org/mailman/listinfo/389-users



        --
        389 users mailing list
        
[email protected]https://admin.fedoraproject.org/mailman/listinfo/389-users



    --
    389 users mailing list
    
[email protected]https://admin.fedoraproject.org/mailman/listinfo/389-users


--
389 users mailing list
[email protected]
https://admin.fedoraproject.org/mailman/listinfo/389-users

Reply via email to