Am Donnerstag, 24. Oktober 2013, 11:07:10 schrieb Karl Rößmann: > Hi, > > we have a two node HA cluster using SuSE SlES 11 HA Extension SP3 > > For some reason there was heavy I/O load on both nodes yesterday. > and one of the nodes went down. (Which was a serious problem)
Then you should thick about your system design. A cluster should be designed to provide the service even if one server fails. > Maybe we have to change a timeout value ? Depends why your node went down. Please see the logs for the reason, the node wnt down. If you found out the reason then you can optimize your cluster configuration. -- Mit freundlichen Grüßen, Michael Schwartzkopff -- [*] sys4 AG http://sys4.de, +49 (89) 30 90 46 64, +49 (162) 165 0044 Franziskanerstraße 15, 81669 München Sitz der Gesellschaft: München, Amtsgericht München: HRB 199263 Vorstand: Patrick Ben Koetter, Axel von der Ohe, Marc Schiffbauer Aufsichtsratsvorsitzender: Florian Kirstein
signature.asc
Description: This is a digitally signed message part.
_______________________________________________ Pacemaker mailing list: [email protected] http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://bugs.clusterlabs.org
