Re: [ClusterLabs] Regular pengine warnings after a transient failure

2016-03-08 Thread Ferenc Wágner
Ken Gaillot writes: > On 03/07/2016 02:03 PM, Ferenc Wágner wrote: > >> The transition-keys match, does this mean that the above is a late >> result from the monitor operation which was considered timed-out >> previously? How did it reach vhbl07, if the DC at that time was vhbl03? >> >>> The pe

Re: [ClusterLabs] Regular pengine warnings after a transient failure

2016-03-08 Thread Ferenc Wágner
Andrew Beekhof writes: > On Tue, Mar 8, 2016 at 7:03 AM, Ferenc Wágner wrote: > >> Ken Gaillot writes: >> >>> On 03/07/2016 07:31 AM, Ferenc Wágner wrote: >>> 12:55:13 vhbl07 crmd[8484]: notice: Transition aborted by vm-eiffel_monitor_6 'create' on vhbl05: Foreign event (ma

Re: [ClusterLabs] Regular pengine warnings after a transient failure

2016-03-07 Thread Ken Gaillot
On 03/07/2016 02:03 PM, Ferenc Wágner wrote: > Ken Gaillot writes: > >> On 03/07/2016 07:31 AM, Ferenc Wágner wrote: >> >>> 12:55:13 vhbl07 crmd[8484]: notice: Transition aborted by >>> vm-eiffel_monitor_6 'create' on vhbl05: Foreign event >>> (magic=0:0;521:0:0:634eef05-39c1-4093-94d4-8d62

Re: [ClusterLabs] Regular pengine warnings after a transient failure

2016-03-07 Thread Andrew Beekhof
On Tue, Mar 8, 2016 at 7:03 AM, Ferenc Wágner wrote: > Ken Gaillot writes: > > > On 03/07/2016 07:31 AM, Ferenc Wágner wrote: > > > >> 12:55:13 vhbl07 crmd[8484]: notice: Transition aborted by > vm-eiffel_monitor_6 'create' on vhbl05: Foreign event > (magic=0:0;521:0:0:634eef05-39c1-4093-94d

Re: [ClusterLabs] Regular pengine warnings after a transient failure

2016-03-07 Thread Ferenc Wágner
Ken Gaillot writes: > On 03/07/2016 07:31 AM, Ferenc Wágner wrote: > >> 12:55:13 vhbl07 crmd[8484]: notice: Transition aborted by >> vm-eiffel_monitor_6 'create' on vhbl05: Foreign event >> (magic=0:0;521:0:0:634eef05-39c1-4093-94d4-8d624b423bb7, cib=0.613.98, >> source=process_graph_even

Re: [ClusterLabs] Regular pengine warnings after a transient failure

2016-03-07 Thread Ken Gaillot
On 03/07/2016 07:31 AM, Ferenc Wágner wrote: > Hi, > > A couple of days ago the nodes of our Pacemaker 1.1.14 cluster > (vhbl0[3-7]) experienced temporary storage outage, leading to processes > stucking randomly for a couple of minutes and big load spikes. There > were 30 monitor operation timeou

[ClusterLabs] Regular pengine warnings after a transient failure

2016-03-07 Thread Ferenc Wágner
Hi, A couple of days ago the nodes of our Pacemaker 1.1.14 cluster (vhbl0[3-7]) experienced temporary storage outage, leading to processes stucking randomly for a couple of minutes and big load spikes. There were 30 monitor operation timeouts altogether on vhbl05, and an internal error on the DC.