I've been experimented with failover on a cluster of two brokers, and I often 
see this log item when a broker fails:

2009-mar-04 17:11:19 debug Exception constructed: Attempted size underflow on 
dequeue(21): size: max=104857600, current=0; count: unli
mited; type=flow_to_disk (qpid/broker/QueuePolicy.cpp:54)

What does underflow mean here?

The broker seems to have died:

[prat...@hsvrhm5 qpidd]$ sudo /sbin/service qpidd status
qpidd dead but pid file exists

The test I was running failed over to the other broker and completed after a 
timeout expired.

A subsequent test immediately failed over to the other broker and completed 
(which makes sense because qpid on the first broker was probably dead before it 
started).

In a general sense, what are the steps required to recover from a broker 
failure?  What I am looking for is step #3 below:

Assume a cluster of two brokers, A and B
1) A dies
2) clients fail over to B
3) do something to recover A without interrupting clients of B
4) A and B are again interchangeable

I've looked through the docs and haven't seen anything about this.  Apologies 
if I missed it.  I also tried simply restarting A, which doesn't seem to work.

Thanks,

Sandy

Reply via email to