Hello Andrew,
> Any change to the configuration section is automatically written to
> disk. The cluster only stops doing this if writing to disk fails at
> some point - but there would have been an error in your logs if that
> were the case.
than I do not get it. Yesterday, when the nodes sucided itself I lost 24
hours of configuration, so I looked in /var/lib/heartbeat/crm and there
was no XML file and I changed the configuration many times, but three
resource groups were gone:
apache-03-fencing (stonith:external/ipmi): Started apache-04
apache-04-fencing (stonith:external/ipmi): Started apache-03
Resource Group: routing
router_ipv4 (ocf::heartbeat:IPaddr2): Started apache-03
router_ipv6 (ocf::heartbeat:IPv6addr): Started apache-03
openvpn_ipv4 (ocf::heartbeat:IPaddr2): Started apache-03
router_ipv6_transfer (ocf::heartbeat:IPv6addr): Started
apache-03
openvpn_glanzmann (ocf::heartbeat:openvpn): Started apache-03
openvpn_ipxechange (ocf::heartbeat:openvpn): Started apache-03
openvpn_eclogic (ocf::heartbeat:openvpn): Started apache-03
openvpn_einwahl (ocf::heartbeat:openvpn): Started apache-03
Resource Group: nfs
gcl_fs (ocf::heartbeat:Filesystem): Started apache-04
nfs-common (ocf::heartbeat:nfs-common): Started apache-04
nfs-kernel-server (ocf::heartbeat:nfs-kernel-server): Started
apache-04
nfs_ipv4 (ocf::heartbeat:IPaddr2): Started apache-04
Master/Slave Set: ma-ms-drbd0 [drbd0]
Masters: [ apache-04 ]
Slaves: [ apache-03 ]
Resource Group: apache
eccar_ipv4 (ocf::heartbeat:IPaddr2): Started apache-04
apache_loadbalancer (lsb:apache2): Started apache-04
Master/Slave Set: ma-ms-drbd1 [drbd1]
Masters: [ apache-04 ]
Slaves: [ apache-03 ]
Resource Group: mail
postfix_fs (ocf::heartbeat:Filesystem): Started apache-04
postfix_ipv4 (ocf::heartbeat:IPaddr2): Started apache-04
spamass (lsb:spamass-milter): Started apache-04
clamav (lsb:clamav-daemon): Started apache-04
postgrey (lsb:postgrey): Started apache-04
dovecot (lsb:dovecot): Started apache-04
postfix (ocf::heartbeat:postfix): Started apache-04
This is my cluster, and the mail group was gone, the drbd1 was gone,
apache was gone and some resources of the routing group were missing,
all the changes were commited in the last 24 hours, after the suicide a
grep in the /var/lib/heartbeat/crm and they were not saved.
Now I rebooted both nodes and manually exported it to be on the very
safe side.
I'll collect the log files and provide them crm_report doesn't work for
me probably because my syslog location is non default.
Cheers,
Thomas
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems