Hi,

On Wed, Mar 26, 2008 at 02:38:00PM +0100, Danny Sternkopf wrote:
> Hi,
> 
> we have a simple config: (hav2)
> - 2 nodes <active-active>
> - 6 FILESYSTEM resources as one group + 1 Stonith resource on each node
>   (Use contraints to score them)
> - Default Resource stickiness is 0
> - Default Resource failover stickiness is 0
> 
> As we have seen in past is that if a filesystem has failed the whole
> group is moved to the other node and the failing node is stonith'ed due
> to the filesystem could not be unmounted properly.
> 
> But this filesystem could not be mounted on both nodes anymore. So the
> group was moved from one node to another and nodes got reset all the
> time.

If it was not possible to mount the filesystem then how/why did
the cluster try to unmount it? Also, if the filesystem's not
mounted then the stop operation should've succeeded. Or did you
see different behaviour?

> Start and Stop of the Filesystem always ended up with a Timeout
> (of 120 s).
> 
> How can we treat that issue within HA? What happens with a resource
> which can not run anymore? When will HA give up to run it?

Depends on the start-failure-is-fatal crm_config parameter. If
it's set to true, the CRM should give up after the first failed
start operation. Of course, in case the machine is rebooted it
will try again.

> My understanding is that default resource failover stickiness will not
> help us in such situation because the failcount gets reset due to node
> reset.
> 
> We will configure auto_failback off with setting a high value for
> default resource stickiness. We already removed the resource group and
> configured single resources instead.

The configuration should reflect real relations between
resources. If a set of resources must run on one node but there
are no other dependencies, then use an unordered group.

Any news?

Thanks,

Dejan

> 
> Best regards,
> 
> Danny
> -- 
> Danny Sternkopf http://www.nec.de/hpc       [EMAIL PROTECTED]
> HPCE Division  Germany phone: +49-711-68770-35 fax: +49-711-6877145
> ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
> NEC Deutschland GmbH, Hansaallee 101, 40549 D?sseldorf
> Gesch?ftsf?hrer Makoto Tsukakoshi
> Handelsregister D?sseldorf HRB 57941; VAT ID DE129424743
> 
> 
> _______________________________________________
> Linux-HA mailing list
> [email protected]
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems

-- 
Dejan
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to