Hi, we have a simple config: (hav2) - 2 nodes <active-active> - 6 FILESYSTEM resources as one group + 1 Stonith resource on each node (Use contraints to score them) - Default Resource stickiness is 0 - Default Resource failover stickiness is 0
As we have seen in past is that if a filesystem has failed the whole group is moved to the other node and the failing node is stonith'ed due to the filesystem could not be unmounted properly. But this filesystem could not be mounted on both nodes anymore. So the group was moved from one node to another and nodes got reset all the time. Start and Stop of the Filesystem always ended up with a Timeout (of 120 s). How can we treat that issue within HA? What happens with a resource which can not run anymore? When will HA give up to run it? My understanding is that default resource failover stickiness will not help us in such situation because the failcount gets reset due to node reset. We will configure auto_failback off with setting a high value for default resource stickiness. We already removed the resource group and configured single resources instead. Best regards, Danny -- Danny Sternkopf http://www.nec.de/hpc [EMAIL PROTECTED] HPCE Division Germany phone: +49-711-68770-35 fax: +49-711-6877145 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ NEC Deutschland GmbH, Hansaallee 101, 40549 Düsseldorf Geschäftsführer Makoto Tsukakoshi Handelsregister Düsseldorf HRB 57941; VAT ID DE129424743 _______________________________________________ Linux-HA mailing list [email protected] http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
