Greetings,

I have a cluster with one resource group, containing two resources. If the second resource in the group can't start (the start op fails), I end up with one resource started, and the other stopped, i.e. we have the following sequence of events:

  1) res-1 start (succeeds)
  2) res-2 start (fails)
  3) res-2 stop  (succeeds)
  4) res-1 stop  (succeeds)
  5) res-1 start (succeeds)
  6) res-2 start (fails)
  7) ....and now we're stuck with res-1 started and res-2 stopped.
     No further starts/stops occur.

Given that resource groups behave otherwise like a set of co-located resources, I'd have expected that if res-2 continually failed to start, that res-1 would also be forced to stop.

I tried a similar test with two separate resources, with ordering & co-location constraints in place, and got the behaviour I expected:

  1) res-1 start (succeeds)
  2) res-2 start (fails)
  3) res-2 stop  (succeeds)
  4) res-1 stop  (succeeds)
  5) res-1 start (succeeds)
  6) res-2 start (fails)
  7) res-2 stop  (succeeds)
  8) res-1 stop  (succeeds)
  9) ....and now everything is stopped, because res-2 can't run.

I'm using heartbeat 2.1.3. Can anyone shed any light on likely reasons for this discrepancy in behaviour?

Thanks,

Tim
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to