Hi,

we have a simple config: (hav2)
- 2 nodes <active-active>
- 6 FILESYSTEM resources as one group + 1 Stonith resource on each node
  (Use contraints to score them)
- Default Resource stickiness is 0
- Default Resource failover stickiness is 0

As we have seen in past is that if a filesystem has failed the whole
group is moved to the other node and the failing node is stonith'ed due
to the filesystem could not be unmounted properly.

But this filesystem could not be mounted on both nodes anymore. So the
group was moved from one node to another and nodes got reset all the
time. Start and Stop of the Filesystem always ended up with a Timeout
(of 120 s).

How can we treat that issue within HA? What happens with a resource
which can not run anymore? When will HA give up to run it?

My understanding is that default resource failover stickiness will not
help us in such situation because the failcount gets reset due to node
reset.

We will configure auto_failback off with setting a high value for
default resource stickiness. We already removed the resource group and
configured single resources instead.

Best regards,

Danny
--
Danny Sternkopf http://www.nec.de/hpc       [EMAIL PROTECTED]
HPCE Division  Germany phone: +49-711-68770-35 fax: +49-711-6877145
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
NEC Deutschland GmbH, Hansaallee 101, 40549 Düsseldorf
Geschäftsführer Makoto Tsukakoshi
Handelsregister Düsseldorf HRB 57941; VAT ID DE129424743


_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to