On 05/08/2013, at 5:20 PM, Thomas Glanzmann <[email protected]> wrote:

> Hello Andrew,
> 
>> Any change to the configuration section is automatically written to
>> disk.  The cluster only stops doing this if writing to disk fails at
>> some point - but there would have been an error in your logs if that
>> were the case.
> 
> than I do not get it. Yesterday, when the nodes sucided itself I lost 24
> hours of configuration,

did they ensure everything was flushed to disk first? 

> so I looked in /var/lib/heartbeat/crm and there

thats not where recent versions of pacemaker keep the cib by default.
check /var/lib/pacemaker/cib too

> was no XML file and I changed the configuration many times, but three
> resource groups were gone:
> 
> apache-03-fencing       (stonith:external/ipmi):        Started apache-04
> apache-04-fencing       (stonith:external/ipmi):        Started apache-03
> Resource Group: routing
>     router_ipv4        (ocf::heartbeat:IPaddr2):       Started apache-03
>     router_ipv6        (ocf::heartbeat:IPv6addr):      Started apache-03
>     openvpn_ipv4       (ocf::heartbeat:IPaddr2):       Started apache-03
>     router_ipv6_transfer       (ocf::heartbeat:IPv6addr):      Started 
> apache-03
>     openvpn_glanzmann  (ocf::heartbeat:openvpn):       Started apache-03
>     openvpn_ipxechange (ocf::heartbeat:openvpn):       Started apache-03
>     openvpn_eclogic    (ocf::heartbeat:openvpn):       Started apache-03
>     openvpn_einwahl    (ocf::heartbeat:openvpn):       Started apache-03
> Resource Group: nfs
>     gcl_fs     (ocf::heartbeat:Filesystem):    Started apache-04
>     nfs-common (ocf::heartbeat:nfs-common):    Started apache-04
>     nfs-kernel-server  (ocf::heartbeat:nfs-kernel-server):     Started 
> apache-04
>     nfs_ipv4   (ocf::heartbeat:IPaddr2):       Started apache-04
> Master/Slave Set: ma-ms-drbd0 [drbd0]
>     Masters: [ apache-04 ]
>     Slaves: [ apache-03 ]
> Resource Group: apache
>     eccar_ipv4 (ocf::heartbeat:IPaddr2):       Started apache-04
>     apache_loadbalancer        (lsb:apache2):  Started apache-04
> Master/Slave Set: ma-ms-drbd1 [drbd1]
>     Masters: [ apache-04 ]
>     Slaves: [ apache-03 ]
> Resource Group: mail
>     postfix_fs (ocf::heartbeat:Filesystem):    Started apache-04
>     postfix_ipv4       (ocf::heartbeat:IPaddr2):       Started apache-04
>     spamass    (lsb:spamass-milter):   Started apache-04
>     clamav     (lsb:clamav-daemon):    Started apache-04
>     postgrey   (lsb:postgrey): Started apache-04
>     dovecot    (lsb:dovecot):  Started apache-04
>     postfix    (ocf::heartbeat:postfix):       Started apache-04
> 
> This is my cluster, and the mail group was gone, the drbd1 was gone,
> apache was gone and some resources of the routing group were missing,
> all the changes were commited in the last 24 hours, after the suicide a
> grep in the /var/lib/heartbeat/crm and they were not saved.
> 
> Now I rebooted both nodes and manually exported it to be on the very
> safe side.
> 
> I'll collect the log files and provide them crm_report doesn't work for
> me probably because my syslog location is non default.
> 
> Cheers,
>        Thomas
> _______________________________________________
> Linux-HA mailing list
> [email protected]
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems

_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to