Hi,

On Sep 14, 2007, at 10:07 , Gérard BUNEL wrote:

Hello,

Setting up a Raid-b1 configuration, my 2 controllers can't see each other Looking at the logs I can see messages exchanged between the 2 controllers but for some reason I don't know the 2 controllers can't add the other in the group. I've set all log messages to DEBUG level
to be able to analyze. Can someone give some help on this problem ?


Looking at Appia, they can see each other. This is the start of the view change process, when this group member detects that there is another member to join:

2007-09-14 11:02:36,716 DEBUG group.intra.IntraSession Started view change due to ViewChange
2007-09-14 11:02:36,717 DEBUG group.sync.VSyncSession Started blocking
2007-09-14 11:02:36,717 DEBUG group.bottom.GroupBottomSession Trying to send to somenone but i'm the only one. Discarding Event 2007-09-14 11:02:36,717 DEBUG continuent.hedera.gms Appia reported block()


This is the new view, with 2 members in the group:

2007-09-14 11:02:38,227 DEBUG continuent.hedera.appia Received new view:
version: 1
group: [Group:MATISSEDB]
id: [ViewID:1;[AppiaHedera@/192.168.87.54:32768]]
previous: [[ViewID:0;[AppiaHedera@/192.168.87.54:32768]],[ViewID:0; [AppiaHedera@/192.168.87.64:32768]],] view: [[AppiaHedera@/192.168.87.54:32768],[AppiaHedera@/ 192.168.87.64:32768],]
addresses: [/192.168.87.54:32768,/192.168.87.64:32768,]

2007-09-14 11:02:38,228 DEBUG group.heal.GossipOutSession placing my address /192.168.87.54:32768 on message to /224.1.1.5:45566 2007-09-14 11:02:38,228 DEBUG continuent.hedera.gms Appia reported new view:
version: 1
group: [Group:MATISSEDB]
id: [ViewID:1;[AppiaHedera@/192.168.87.54:32768]]
previous: [[ViewID:0;[AppiaHedera@/192.168.87.54:32768]],[ViewID:0; [AppiaHedera@/192.168.87.64:32768]],] view: [[AppiaHedera@/192.168.87.54:32768],[AppiaHedera@/ 192.168.87.64:32768],]
addresses: [/192.168.87.54:32768,/192.168.87.64:32768,]

2007-09-14 11:02:38,229 DEBUG continuent.hedera.gms Member(address=/ 192.168.87.64:32768, uid=192.168.87.64:32768) joins Group (gid=MATISSEDB).

And then we can see a lot of control messages on Appia.
After a while, we can see some messages in the from the Sequoia controller being exchanged. The AppiaGroupEvent is a message produced by Sequoia.


2007-09-14 11:03:37,284 DEBUG protocols.udpsimple.UdpSimpleSession $UdpSimpleReader :run: PtP datagram received. Size = 1661 2007-09-14 11:03:37,285 DEBUG protocols.udpsimple.UdpSimpleSession $UdpSimpleReader :receiveAndFormat: Reader, creating org.continuent.hedera.appia.AppiaGroupEvent event. 2007-09-14 11:03:37,285 DEBUG protocols.udpsimple.UdpSimpleSession $UdpSimpleReader :receiveAndFormat: UDP p2p TOKEN Channel (1135015211) 2007-09-14 11:03:37,285 DEBUG protocols.udpsimple.UdpSimpleSession :handleSendable: [EMAIL PROTECTED] 2007-09-14 11:03:37,285 DEBUG group.bottom.GroupBottomSession Received message [EMAIL PROTECTED] from 1

And then you see a message from Sequoia like this:

2007-09-14 11:03:37,305 DEBUG controller.virtualdatabase.MATISSEDB handleMessageSingleThreaded (class org.continuent.sequoia.controller.virtualdatabase.protocol.SuspendActi vity): org.continuent.sequoia.controller.virtualdatabase.protocol.SuspendActi [EMAIL PROTECTED] 2007-09-14 11:03:37,309 DEBUG controller.virtualdatabase.MATISSEDB handleMessageMultiThreaded (class org.continuent.sequoia.controller.virtualdatabase.protocol.SuspendActi vity): org.continuent.sequoia.controller.virtualdatabase.protocol.SuspendActi [EMAIL PROTECTED] 2007-09-14 11:03:37,309 DEBUG sequoia.controller.scheduler Suspending new persistent connections 2007-09-14 11:03:37,309 DEBUG sequoia.controller.scheduler Suspending new transactions and writes 2007-09-14 11:03:37,309 DEBUG sequoia.controller.scheduler All writes suspended 2007-09-14 11:03:37,309 DEBUG sequoia.controller.scheduler All transactions suspended 2007-09-14 11:03:37,309 DEBUG sequoia.controller.scheduler All persistent connections closed 2007-09-14 11:03:37,310 DEBUG continuent.hedera.adapters Replying to Member(address=/192.168.87.64:32768, uid=192.168.87.64:32768) for message 0


(...)

2007-09-14 11:03:37,813 DEBUG sequoia.controller.loadbalancer Removed org.continuent.sequoia.controller.virtualdatabase.protocol.SuspendWrit [EMAIL PROTECTED] from total order queue 2007-09-14 11:03:37,816 INFO controller.virtualdatabase.MATISSEDB Checkpoint disable all backends-192.168.87.64:25322-20070914031034930+0200 was stored 2007-09-14 11:03:37,816 WARN controller.virtualdatabase.MATISSEDB No information has been found for remote controller Member(address=/ 192.168.87.64:32768, uid=192.168.87.64:32768) 2007-09-14 11:03:37,817 WARN controller.virtualdatabase.MATISSEDB Updating backend list with unknown backend sd01 disabled on controller Member(address=/192.168.87.64:32768, uid=192.168.87.64:32768) 2007-09-14 11:03:37,817 DEBUG continuent.hedera.adapters Replying to Member(address=/192.168.87.64:32768, uid=192.168.87.64:32768) for message 1 2007-09-14 11:03:37,819 DEBUG sequoia.controller.recoverylog Storing checkpoint disable all backends-192.168.87.64:25322-20070914031034930+0200 at request id 0

After a while, this group member receives a LeaveEvent from the other controller. This somehow was forced by the other controller and will cause this one leave the group:


2007-09-14 11:03:40,374 DEBUG protocols.udpsimple.UdpSimpleSession $UdpSimpleReader :receiveAndFormat: Reader, creating org.continuent.appia.protocols.group.leave.LeaveEvent event. 2007-09-14 11:03:40,375 DEBUG protocols.udpsimple.UdpSimpleSession $UdpSimpleReader :receiveAndFormat: UDP p2p TOKEN Channel (1135015211) 2007-09-14 11:03:40,375 DEBUG protocols.udpsimple.UdpSimpleSession :handleSendable: [EMAIL PROTECTED] 2007-09-14 11:03:40,375 DEBUG group.bottom.GroupBottomSession Received message [EMAIL PROTECTED] from 1 2007-09-14 11:03:40,375 DEBUG group.stable.StableSession Received 147 from 1 2007-09-14 11:03:40,376 DEBUG total.token.TotalTokenSession My rank = 0 hasToken = 1 2007-09-14 11:03:40,376 DEBUG total.token.TotalTokenSession Received Group Sendable from the network with seq = 294 token = true 2007-09-14 11:03:40,376 DEBUG total.token.TotalTokenSession Delivering message with seq = 294 2007-09-14 11:03:40,376 DEBUG total.token.TotalTokenSession I'll try to send some messages 2007-09-14 11:03:40,376 DEBUG total.token.TotalTokenSession I do not have any messages. Rotanting token. My rank is 0 2007-09-14 11:03:40,376 DEBUG group.intra.IntraSession Started view change due to ViewChange 2007-09-14 11:03:40,376 DEBUG group.bottom.GroupBottomSession Sending message (isSend=false) [EMAIL PROTECTED] 2007-09-14 11:03:40,376 DEBUG protocols.udpsimple.UdpSimpleSession :handleSendable: [EMAIL PROTECTED] 2007-09-14 11:03:40,377 DEBUG protocols.udpsimple.UdpSimpleSession :formatAndSend: 106 bytes datagram sent to 192.168.87.64 (port 32768)
2007-09-14 11:03:40,377 DEBUG group.sync.VSyncSession Started blocking
2007-09-14 11:03:40,377 DEBUG group.bottom.GroupBottomSession Sending message (isSend=true) [EMAIL PROTECTED] 2007-09-14 11:03:40,377 DEBUG protocols.udpsimple.UdpSimpleSession :handleSendable: [EMAIL PROTECTED] 2007-09-14 11:03:40,377 DEBUG protocols.udpsimple.UdpSimpleSession :formatAndSend: 89 bytes datagram sent to 192.168.87.64 (port 32768) 2007-09-14 11:03:40,378 DEBUG continuent.hedera.gms Appia reported block() 2007-09-14 11:03:40,378 DEBUG group.sync.VSyncSession {0} Sending my_casts and my_sends_sent to coordinator
2007-09-14 11:03:40,378 DEBUG group.sync.VSyncSession my_casts=[,,]
2007-09-14 11:03:40,378 DEBUG group.sync.VSyncSession my_sends_sent= [,,] 2007-09-14 11:03:40,379 DEBUG protocols.udpsimple.UdpSimpleSession $UdpSimpleReader :run: PtP datagram received. Size = 110 2007-09-14 11:03:40,379 DEBUG protocols.udpsimple.UdpSimpleSession $UdpSimpleReader :receiveAndFormat: Reader, creating org.continuent.appia.protocols.total.token.TokenEvent event. 2007-09-14 11:03:40,379 DEBUG protocols.udpsimple.UdpSimpleSession $UdpSimpleReader :receiveAndFormat: UDP p2p TOKEN Channel (1135015211) 2007-09-14 11:03:40,380 DEBUG protocols.udpsimple.UdpSimpleSession :handleSendable: [EMAIL PROTECTED] 2007-09-14 11:03:40,380 DEBUG group.bottom.GroupBottomSession Received message [EMAIL PROTECTED] from 1 2007-09-14 11:03:40,380 DEBUG group.stable.StableSession Received gossip information from 1 (doClean=true) 2007-09-14 11:03:40,380 DEBUG group.stable.StableSession Received 148 from 1 2007-09-14 11:03:40,380 DEBUG group.stable.StableStorage Cleaned 116 from 1 2007-09-14 11:03:40,380 DEBUG group.stable.StableStorage Cleaned 117 from 1 2007-09-14 11:03:40,380 DEBUG group.stable.StableStorage Cleaned 118 from 1 2007-09-14 11:03:40,380 DEBUG group.stable.StableStorage Cleaned 119 from 1 2007-09-14 11:03:40,380 DEBUG group.stable.StableStorage Cleaned 120 from 1 2007-09-14 11:03:40,381 DEBUG group.stable.StableStorage Cleaned 121 from 1 2007-09-14 11:03:40,381 DEBUG group.stable.StableStorage Cleaned 122 from 1 2007-09-14 11:03:40,381 DEBUG group.stable.StableStorage Cleaned 123 from 1 2007-09-14 11:03:40,381 DEBUG group.stable.StableStorage Cleaned 124 from 1 2007-09-14 11:03:40,381 DEBUG protocols.udpsimple.UdpSimpleSession $UdpSimpleReader :run: PtP datagram received. Size = 129 2007-09-14 11:03:40,381 DEBUG group.stable.StableStorage Cleaned 125 from 1 2007-09-14 11:03:40,381 DEBUG protocols.udpsimple.UdpSimpleSession $UdpSimpleReader :receiveAndFormat: Reader, creating org.continuent.appia.protocols.group.sync.Block event. 2007-09-14 11:03:40,381 DEBUG group.stable.StableStorage Cleaned 126 from 1 2007-09-14 11:03:40,381 DEBUG group.stable.StableStorage Cleaned 127 from 1 2007-09-14 11:03:40,381 DEBUG protocols.udpsimple.UdpSimpleSession $UdpSimpleReader :receiveAndFormat: UDP p2p TOKEN Channel (1135015211) 2007-09-14 11:03:40,381 DEBUG group.stable.StableStorage Cleaned 128 from 1 2007-09-14 11:03:40,382 DEBUG group.stable.StableStorage Cleaned 129 from 1 2007-09-14 11:03:40,382 DEBUG group.stable.StableStorage Cleaned 130 from 1 2007-09-14 11:03:40,382 DEBUG group.stable.StableStorage Cleaned 131 from 1 2007-09-14 11:03:40,382 DEBUG group.stable.StableStorage Cleaned 132 from 1 2007-09-14 11:03:40,382 DEBUG group.stable.StableStorage Cleaned 133 from 1 2007-09-14 11:03:40,382 DEBUG group.stable.StableStorage Cleaned 134 from 1 2007-09-14 11:03:40,382 DEBUG group.stable.StableStorage Cleaned 135 from 1 2007-09-14 11:03:40,382 DEBUG group.stable.StableStorage Cleaned 136 from 1 2007-09-14 11:03:40,382 DEBUG group.stable.StableStorage Cleaned 137 from 1 2007-09-14 11:03:40,383 DEBUG group.stable.StableStorage Cleaned 138 from 1 2007-09-14 11:03:40,383 DEBUG group.stable.StableStorage Cleaned 139 from 1 2007-09-14 11:03:40,383 DEBUG group.stable.StableStorage Cleaned 140 from 1 2007-09-14 11:03:40,383 DEBUG group.stable.StableStorage Cleaned 141 from 1 2007-09-14 11:03:40,383 DEBUG group.stable.StableStorage Cleaned 142 from 1 2007-09-14 11:03:40,383 DEBUG group.stable.StableStorage Cleaned 143 from 1 2007-09-14 11:03:40,383 DEBUG group.stable.StableStorage Cleaned 144 from 1 2007-09-14 11:03:40,383 DEBUG group.stable.StableStorage Cleaned 145 from 1 2007-09-14 11:03:40,384 DEBUG group.stable.StableStorage Cleaned 146 from 1 2007-09-14 11:03:40,384 DEBUG group.stable.StableStorage Cleaned 147 from 1 2007-09-14 11:03:40,384 DEBUG group.stable.StableStorage Cleaned 148 from 1 2007-09-14 11:03:40,384 DEBUG protocols.udpsimple.UdpSimpleSession :handleSendable: [EMAIL PROTECTED] 2007-09-14 11:03:40,384 DEBUG group.bottom.GroupBottomSession Received message [EMAIL PROTECTED] from 1 2007-09-14 11:03:40,384 DEBUG group.sync.VSyncSession {0} Ended blocking started synching 2007-09-14 11:03:40,384 DEBUG group.sync.VSyncSession global_casts= [,,] 2007-09-14 11:03:40,385 DEBUG group.sync.VSyncSession sends_table[0] =[,,] 2007-09-14 11:03:40,385 DEBUG group.sync.VSyncSession sends_table[1] =[,,]
2007-09-14 11:03:40,385 DEBUG group.sync.VSyncSession my_casts=[,,]
2007-09-14 11:03:40,385 DEBUG group.sync.VSyncSession testing in_sync
2007-09-14 11:03:40,385 DEBUG group.sync.VSyncSession in_sync: my_casts==global_casts -> true 2007-09-14 11:03:40,385 DEBUG group.sync.VSyncSession in_sync: my_sends_recv== global_sends -> true 2007-09-14 11:03:40,385 DEBUG total.token.TotalTokenSession My rank = 0 hasToken = 1 2007-09-14 11:03:40,385 DEBUG total.token.TotalTokenSession Received Group Sendable from the network with seq = 296 token = true 2007-09-14 11:03:40,385 DEBUG total.token.TotalTokenSession Delivering message with seq = 296 2007-09-14 11:03:40,386 DEBUG group.bottom.GroupBottomSession Sending message (isSend=true) [EMAIL PROTECTED] 2007-09-14 11:03:40,386 DEBUG protocols.udpsimple.UdpSimpleSession :handleSendable: [EMAIL PROTECTED] 2007-09-14 11:03:40,386 DEBUG protocols.udpsimple.UdpSimpleSession :formatAndSend: 128 bytes datagram sent to 192.168.87.64 (port 32768) 2007-09-14 11:03:40,389 DEBUG protocols.udpsimple.UdpSimpleSession $UdpSimpleReader :run: PtP datagram received. Size = 108 2007-09-14 11:03:40,389 DEBUG protocols.udpsimple.UdpSimpleSession $UdpSimpleReader :receiveAndFormat: Reader, creating org.continuent.appia.protocols.group.sync.Sync event. 2007-09-14 11:03:40,389 DEBUG protocols.udpsimple.UdpSimpleSession $UdpSimpleReader :receiveAndFormat: UDP p2p TOKEN Channel (1135015211) 2007-09-14 11:03:40,389 DEBUG protocols.udpsimple.UdpSimpleSession :handleSendable: [EMAIL PROTECTED] 2007-09-14 11:03:40,390 DEBUG group.bottom.GroupBottomSession Received message [EMAIL PROTECTED] from 1 2007-09-14 11:03:40,390 DEBUG protocols.udpsimple.UdpSimpleSession :handleSendable: [EMAIL PROTECTED] 2007-09-14 11:03:40,390 DEBUG protocols.udpsimple.UdpSimpleSession :formatAndSend: 85 bytes datagram sent to 192.168.87.64 (port 32768) 2007-09-14 11:03:40,390 DEBUG group.bottom.GroupBottomSession Sending message (isSend=false) [EMAIL PROTECTED] 2007-09-14 11:03:40,391 DEBUG protocols.udpsimple.UdpSimpleSession :handleSendable: [EMAIL PROTECTED] 2007-09-14 11:03:40,391 DEBUG protocols.udpsimple.UdpSimpleSession :formatAndSend: 248 bytes datagram sent to 192.168.87.64 (port 32768) 2007-09-14 11:03:40,391 DEBUG group.bottom.GroupBottomSession Recived new view: group=-711498822 view=1146261631 2007-09-14 11:03:40,391 DEBUG group.heal.HealSession Sending Gossip to Server 2007-09-14 11:03:40,391 DEBUG total.token.TotalTokenSession Received new view with 1 members 2007-09-14 11:03:40,391 DEBUG total.token.TotalTokenSession I'll try to send some messages 2007-09-14 11:03:40,392 DEBUG continuent.hedera.appia Received new view:
version: 1
group: [Group:MATISSEDB]
id: [ViewID:2;[AppiaHedera@/192.168.87.54:32768]]
previous: [[ViewID:1;[AppiaHedera@/192.168.87.54:32768]],]
view: [[AppiaHedera@/192.168.87.54:32768],]
addresses: [/192.168.87.54:32768,]

2007-09-14 11:03:40,392 DEBUG group.heal.GossipOutSession placing my address /192.168.87.54:32768 on message to /224.1.1.5:45566 2007-09-14 11:03:40,392 DEBUG continuent.hedera.gms Appia reported new view:
version: 1
group: [Group:MATISSEDB]
id: [ViewID:2;[AppiaHedera@/192.168.87.54:32768]]
previous: [[ViewID:1;[AppiaHedera@/192.168.87.54:32768]],]
view: [[AppiaHedera@/192.168.87.54:32768],]
addresses: [/192.168.87.54:32768,]

2007-09-14 11:03:40,392 DEBUG protocols.udpsimple.UdpSimpleSession :handleSendable: [EMAIL PROTECTED] 2007-09-14 11:03:40,392 DEBUG continuent.hedera.gms Member(address=/ 192.168.87.64:32768, uid=192.168.87.64:32768) leaves Group (gid=MATISSEDB).

At this point, they are separated again.

So, in my eyes they see each other and join. What happen after that causes Sequoia to suspend the activity and separate the group again, I do not understand. Maybe you get some clues if you check the Sequoia debug logs before the LeaveEvent.

Hope this helps.

Regards,
--
Nuno Carvalho
University of Lisbon, Portugal
http://lasige.di.fc.ul.pt/~ncarvalho



_______________________________________________
Sequoia mailing list
[email protected]
https://forge.continuent.org/mailman/listinfo/sequoia

Reply via email to