Folks,
It's just not clear to me how to clear a resource and get it to restart
without clearing the CIB and
rebooting.
I have the following setup:
debian lenny, 2.6.21.7-2.fc8xen, amazon AMI, no serial, heartbeat
2.1.3-6lenny1.
Crm_mon shows:
Resource Group: openvpn_nfs_lvm_ebs
ebs-1 (aws::ocf:ebs): Started ip-10-244-47-99
lvm-1 (heartbeat::ocf:LVM): Started ip-10-244-47-99
nfs-1 (heartbeat::ocf:Filesystem): Started ip-10-244-47-99
openvpn-1 (lsb:openvpn): Started ip-10-244-47-99
Failed actions:
ebs-1_start_0 (node=ip-164, call=6, rc=1): complete
Because I had a script bug while I was starting my disk resource (ebs-1).
# crm_failcount -G -U ip-164 -r ebs-1
name=fail-count-ebs-1 value=1
# crm_failcount -D -U ip-164 -r ebs-1
crm_failcount -G -U ip-164 -r ebs-1
name=fail-count-ebs-1 value=0
And yet, when I
crm_resource -r openvpn_nfs_lvm_ebs -M -H ip-164
the resource does not move and daemon.log shows:
Aug 6 01:14:36 ip-164 crm_resource: [1199]: ERROR: unpack_rsc_op: Remapping
ebs-1_start_0 (rc=1) on ip-164 to an ERROR
Aug 6 01:14:36 ip-164 crm_resource: [1199]: WARN: unpack_rsc_op: Processing
failed op ebs-1_start_0 on ip-164: Error
Aug 6 01:14:36 ip-164 crm_resource: [1199]: WARN: unpack_rsc_op:
Compatability handling for failed op ebs-1_start_0 on ip-164
Aug 6 01:14:36 ip-164 crm_resource: [1199]: WARN: main: here i am - 3
Questions:
1) How do I restart a failed resource, after I fix a script or other
problem?
2) Should this have happened?
Thanks, Brian
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems