On Mon, Jan 11, 2010 at 11:37 AM, Steven Dake <[email protected]> wrote:
> On Fri, 2010-01-08 at 16:54 -0700, hj lee wrote: > > Hi, > > > > My simple two-node cluster is configured with two ring interfaces > > (ring 0 and ring 1). If I disconnect one of cable at one of nodes, > > then openais enters to GATHER mode and returns to OPERATIONAL mode. > > This takes 200 - 300 msec. In GATHER mode, the openais does not seem > > to send and receive any message from Pacemaker like messages for > > updating cib files. > > > > What is the reason behind of this behavior while the openais is still > > connected with other ring interface? Is it possible to disable this > > behavior? > > > > In my configuration, the eth0 is used both for openais and pingd > > monitoring. So when eth0 is disconnected, pingd fails and also openais > > enters GATHER mode at the same time. The pingd failure trigger the > > resource migration, but this migration is delayed by 200 - 300 msec. I > > want to get rid of this delay when pingd fails. > > > > This seems quite odd. Gather should not be entered unless the network > interface is actually changed. It may be that you are running a tool > such as network manager which is downing the interface. > > Running network manager with corosync is likely to provide bad results > because network manager destroys interfaces within the kernel. > > Please report back if this is your issue. > > I am not sure what you mean "network manager", I do not run any thing like that.. I just unplug one of two ring NIC cables, sometimes it enters GATHER mode. If I do ifdown one of two ring interfaces, sometimes it enters GATHER mode. What I want is the Openais enters GATHER mode only when both ring interfaces are disconnected. 2010-01-11 16:18:09.954914 silverthorne2-openais[3789]: [totemrrp.c:0803] Marking seqid 1710799 ringid 1 interface 172.16.127.30 FAULTY - adminisrtative intervention required. 2010-01-11 16:18:10.478828 silverthorne2-pingd: [4000]: info: stand_alone_ping: Node 111.16.127.254 is unreachable (read) 2010-01-11 16:18:10.900847 silverthorne2-vmre[18099]: vmre_send_keepalive: 2010-01-11 16:18:10.900875 silverthorne2-vmre[18099]: vmre_send: 2010-01-11 16:18:10.900882 silverthorne2-vmre[18099]: vmre_send_keepalive: vm_state 4 sent 8 bytes num_ckpts, ckpt_seq since last ka=73 current ckpt_seq=527662 2010-01-11 16:18:11.138074 silverthorne2-openais[3789]: [totemsrp.c:3339] FAILED TO RECEIVE 2010-01-11 16:18:11.138217 silverthorne2-openais[3789]: [totemsrp.c:1732] entering GATHER state from 6. 2010-01-11 16:18:11.138226 silverthorne2-openais[3789]: [totemsrp.c:2788] Creating commit token because I am the rep. 2010-01-11 16:18:11.138235 silverthorne2-openais[3789]: [totemsrp.c:1303] Saving state aru 4d high seq received 4d 2010-01-11 16:18:11.138267 silverthorne2-openais[3789]: [totemsrp.c:2949] Storing new sequence id for ring 134e0 2010-01-11 16:18:11.138276 silverthorne2-openais[3789]: [totemsrp.c:1771] entering COMMIT state. 2010-01-11 16:18:11.138744 silverthorne2-openais[3789]: [totemsrp.c:1803] entering RECOVERY state. 2010-01-11 16:18:11.138874 silverthorne2-openais[3789]: [totemsrp.c:1832] position [0] member 192.168.10.21: 2010-01-11 16:18:11.139056 silverthorne2-openais[3789]: [totemsrp.c:1836] previous ring seq 79068 rep 192.168.10.21
_______________________________________________ Openais mailing list [email protected] https://lists.linux-foundation.org/mailman/listinfo/openais
