[jira] Commented: (QPID-2983) Broker in cluster goes down

Gordon Sim (JIRA) Mon, 28 Feb 2011 09:34:04 -0800

    [ 
https://issues.apache.org/jira/browse/QPID-2983?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13000418#comment-13000418
 ]


Gordon Sim commented on QPID-2983:
----------------------------------

This sounds like an issue with the underlying CPG. Perhaps turning the debug on 
there would shed some light? Or perhaps test routing of multicast packets?

> Broker in cluster goes down
> ---------------------------
>
>                 Key: QPID-2983
>                 URL: https://issues.apache.org/jira/browse/QPID-2983
>             Project: Qpid
>          Issue Type: Bug
>          Components: C++ Broker, Dot Net Client
>    Affects Versions: 0.8
>         Environment: for broker Centos 5.3
> for Client Windows 7
>            Reporter: Adam
>
> I have 2 .net clients. They sends messages to fanout or direct exchange and 
> both consume messages which where sended. From time to time on  of the brokes 
> go down with message in log:
> 2010-12-20 23:54:19 debug 
> [email protected]: receiver marked 
> completed: 32 incomplete: { } unknown-completed: { [1,32] }
> 2010-12-20 23:54:19 trace cluster(192.168.44.135:3927 READY) DLVR 6058:  
> Frame[BEbe; channel=0; {ClusterConnectionDeliverDoOutputBody: limit=2048; }] 
> control 192.168.44.134:457-267
> 2010-12-20 23:54:19 debug Sufficient credit for 
> 15c72e661-1a54d-14663-1b3cf-1dfc4e7d90e0c on 
> [email protected], have bytes: 4294967295 
> msgs: 9884, need 147 bytes
> 2010-12-20 23:54:19 debug Credit allocated for 
> 15c72e661-1a54d-14663-1b3cf-1dfc4e7d90e0c on 
> [email protected], was  bytes: 4294967295 
> msgs: 9884 now bytes: 4294967295 msgs: 9883
> 2010-12-20 23:54:19 trace 
> [email protected]: sent cmd 116: 
> {MessageTransferBody: destination=15c72e661-1a54d-14663-1b3cf-1dfc4e7d90e0c; 
> accept-mode=0; acquire-mode=0; }
> 2010-12-20 23:54:19 trace 
> [email protected]: sent cmd 116: header 
> (84 bytes); properties={{MessageProperties: 
> message-id=49568fee-5825-49b6-beb7-7ca4ee942b44; 
> content-type=SerializableObject; content-encoding=SerializableObject; 
> application-headers={}; }{DeliveryProperties: exchange=test; routing-key=; }}
> 2010-12-20 23:54:19 trace 
> [email protected]: sent cmd 116: content 
> (63 bytes) 
> \x00\x01\x00\x00\x00\xFF\xFF\xFF\xFF\x01\x00\x00\x00\x00\x00\x00\x00\x06\x01\x00\x00\x00'Message
>  I...
> 2010-12-20 23:54:19 debug No messages to dispatch on queue 
> '15c72e661-1a54d-14663-1b3cf-1dfc4e7d90e0c'
> 2010-12-20 23:54:19 trace cluster(192.168.44.135:3927 READY) DLVR 6059:  
> Frame[BEbe; channel=1; {MessageAcceptBody: transfers={ [116,116] }; }] data 
> 192.168.44.134:457-267 read-credit=1
> 2010-12-20 23:54:19 trace 
> [email protected]: recv cmd 123: 
> {MessageAcceptBody: transfers={ [116,116] }; }
> 2010-12-20 23:54:19 debug DeliveryRecord::setEnded() id=116
> 2010-12-20 23:54:19 debug Accepted 116
> 2010-12-20 23:54:19 debug 
> [email protected]: receiver marked 
> completed: 123 incomplete: { } unknown-completed: { [1,123] }
> 2010-12-20 23:54:20 trace Sending cluster timer wakeup 
> ManagementAgent::periodicProcessing
> 2010-12-20 23:54:20 trace MCAST Event[192.168.44.135:3927-0 Frame[BEbe; 
> channel=0; {ClusterTimerWakeupBody: name=ManagementAgent::periodicProcessing; 
> }]]
> 2010-12-20 23:54:23 warning LinkRegistry timer woken up 2998ms late
> 2010-12-20 23:54:25 debug Exception constructed: Cannot mcast to CPG group 
> klaster: library (2)
> 2010-12-20 23:54:25 critical Multicast error: Cannot mcast to CPG group 
> klaster: library (2)
> 2010-12-20 23:54:25 notice cluster(192.168.44.135:3927 LEFT) leaving cluster 
> klaster
> 2010-12-20 23:54:25 trace SEND raiseEvent (v1) 
> class=org.apache.qpid.broker.clientDisconnect
> 2010-12-20 23:54:25 trace SEND raiseEvent (v1) 
> class=org.apache.qpid.broker.clientDisconnect
> 2010-12-20 23:54:25 trace SEND raiseEvent (v1) 
> class=org.apache.qpid.broker.clientDisconnect
> 2010-12-20 23:54:25 debug Shutting down CPG
> 2010-12-20 23:54:26 notice Shut down 
> I have 2 C++ brokers run in cluster. 
> I always start them by:
> qpidd --auth no --trace --log-to-file /var/log/QPIDLOG.log --daemon 
> --cluster-name=klaster
> Corosync config:
> totem {
>         version: 2
>         secauth: off
>         threads: 0
>         interface {
>                 ringnumber: 0
>                 ## You must change this address ##
>                 bindnetaddr: 192.168.44.0
>                 mcastaddr: 226.94.32.36
>                 mcastport: 5405
>         }
> }
> logging {
>         debug: off
>       timestamp: on
>       to_file: yes
>       logfile: /tmp/aisexec.log
> }
> amf {
>       mode: disabled
> }
> Log from corosync:
> Dec 20 21:56:47 corosync [MAIN  ] Completed service synchronization, ready to 
> provide service.
> Dec 20 23:54:25 corosync [TOTEM ] Process pause detected for 3632 ms, 
> flushing membership messages.
> Dec 20 23:54:25 corosync [TOTEM ] A processor failed, forming new 
> configuration.
> Dec 20 23:54:27 corosync [TOTEM ] A processor joined or left the membership 
> and a new membership was formed.
> Dec 20 23:54:27 corosync [MAIN  ] Completed service synchronization, ready to 
> provide service.
> Dec 20 23:54:29 corosync [TOTEM ] A processor joined or left the membership 
> and a new membership was formed.
> Dec 20 23:54:29 corosync [MAIN  ] Completed service synchronization, ready to 
> provide service.
> Dec 21 00:03:04 corosync [TOTEM ] A processor failed, forming new 
> configuration.
> Dec 21 00:03:05 corosync [TOTEM ] A processor joined or left the membership 
> and a new membership was formed.
> Dec 21 00:03:05 corosync [MAIN  ] Completed service synchronization, ready to 
> provide service.
> Dec 21 00:13:06 corosync [TOTEM ] A processor failed, forming new 
> configuration.
> Dec 21 00:13:07 corosync [TOTEM ] A processor joined or left the membership 
> and a new membership was formed.
> Dec 21 00:13:07 corosync [MAIN  ] Completed service synchronization, ready to 
> provide service.
> Dec 21 03:13:24 corosync [TOTEM ] A processor failed, forming new 
> configuration.
> Dec 21 03:13:25 corosync [TOTEM ] A processor joined or left the membership 
> and a new membership was formed.
> Dec 21 03:13:25 corosync [TOTEM ] A processor joined or left the membership 
> and a new membership was formed.
> Dec 21 03:13:25 corosync [MAIN  ] Completed service synchronization, ready to 
> provide service.
> Dec 21 07:30:59 corosync [TOTEM ] Process pause detected for 5544 ms, 
> flushing membership messages.

-- 
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
Apache Qpid - AMQP Messaging Implementation
Project:      http://qpid.apache.org
Use/Interact: mailto:[email protected]

[jira] Commented: (QPID-2983) Broker in cluster goes down

Reply via email to