Check the heartbeat logs, though -- it should tell you exactly why it
failed. Set "debugfile /var/log/ha-debug" in /etc/ha.d/ha.cf and look
through that log while/after it tries to start.
In fact the node which is going to release the shared-ressources locks
the drbd ressource, which leads to a rebooting. How can I investigate
what is locking the drbd ressource ?
This is my syslog file:
Feb 25 22:44:33 secondaire heartbeat: debug: /etc/init.d/monit-Inet-
Secondaire stop done. RC=0
Feb 25 22:44:33 secondaire heartbeat: info: Running /etc/ha.d/
resource.d/MailTo r...@localhost InetCluster stop
Feb 25 22:44:33 secondaire heartbeat: debug: Starting /etc/ha.d/
resource.d/MailTo r...@localhost InetCluster stop
Feb 25 22:44:33 secondaire heartbeat: debug: /etc/ha.d/resource.d/
MailTo r...@localhost InetCluster stop done. RC=0
Feb 25 22:44:33 secondaire heartbeat: info: Running /etc/ha.d/
resource.d/Filesystem /dev/drbd0 /data ext3 stop
Feb 25 22:44:33 secondaire heartbeat: debug: Starting /etc/ha.d/
resource.d/Filesystem /dev/drbd0 /data ext3 stop
Feb 25 22:44:33 secondaire heartbeat: ERROR: Couldn't unmount /data
Feb 25 22:44:33 secondaire heartbeat: debug: /etc/ha.d/resource.d/
Filesystem /dev/drbd0 /data ext3 stop done. RC=1
Feb 25 22:44:33 secondaire heartbeat: ERROR: Return code 1 from /etc/
ha.d/resource.d/Filesystem
...
Feb 25 22:44:35 secondaire heartbeat: info: Retrying failed stop
operation [Filesystem::/dev/drbd0::/data::ext3]
Feb 25 22:44:35 secondaire heartbeat: info: Running /etc/ha.d/
resource.d/Filesystem /dev/drbd0 /data ext3 stop
Feb 25 22:44:35 secondaire heartbeat: debug: Starting /etc/ha.d/
resource.d/Filesystem /dev/drbd0 /data ext3 stop
Feb 25 22:44:35 secondaire heartbeat: ERROR: Couldn't unmount /data
Feb 25 22:44:35 secondaire heartbeat: debug: /etc/ha.d/resource.d/
Filesystem /dev/drbd0 /data ext3 stop done. RC=1
Feb 25 22:44:35 secondaire heartbeat: ERROR: Return code 1 from /etc/
ha.d/resource.d/Filesystem
Feb 25 22:44:43 secondaire heartbeat: info: Retrying failed stop
operation [Filesystem::/dev/drbd0::/data::ext3]
Feb 25 22:44:43 secondaire heartbeat: info: Running /etc/ha.d/
resource.d/Filesystem /dev/drbd0 /data ext3 stop
Feb 25 22:44:43 secondaire heartbeat: debug: Starting /etc/ha.d/
resource.d/Filesystem /dev/drbd0 /data ext3 stop
Feb 25 22:44:43 secondaire heartbeat: ERROR: Couldn't unmount /data
Feb 25 22:44:43 secondaire heartbeat: debug: /etc/ha.d/resource.d/
Filesystem /dev/drbd0 /data ext3 stop done. RC=1
Feb 25 22:44:43 secondaire heartbeat: ERROR: Return code 1 from /etc/
ha.d/resource.d/Filesystem
Feb 25 22:44:43 secondaire heartbeat: CRIT: Resource STOP failure.
Reboot required!
Thanks,
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems