Hello all, I have configuration heartbeat server in openfiler system using crm enable options. I can able to run my primary server successfully with out crm resource problem. But when i try to do failure i can not do because while failure happening the crm resources started to stop the lvm filesystem mounting. I did try the checking resources file i couldn't understand, why its stopping. I have given system informations in below.
1) crm_mon command information. *[r...@gtt5 ~]# crm_mon * ============ Last updated: Fri Aug 21 17:26:02 2009 Current DC: gtt5.linux.com (7d892d6c-d277-45c2-beb6-331fca5b3920) 2 Nodes configured. 1 Resources configured. ============ Node: gtt5.linux.com (7d892d6c-d277-45c2-beb6-331fca5b3920): online Node: gtt4.linux.com (87dc2dcc-791b-4bfb-a971-b30fbd909255): online Resource Group: group_1 open-iscsi_1 (lsb:open-iscsi): Started gtt5.linux.com MailTo_2 (heartbeat::ocf:MailTo): Started gtt5.linux.com IPaddr_192_168_2_20 (heartbeat::ocf:IPaddr): Started gtt5.linux.com drbddisk_4 (heartbeat:drbddisk): Started gtt5.linux.com LVM_5 (heartbeat::ocf:LVM): Started gtt5.linux.com Filesystem_6 (heartbeat::ocf:Filesystem): Started gtt5.linux.com MakeMounts_7 (heartbeat:MakeMounts): Started gtt5.linux.com *Filesystem_8 (heartbeat::ocf:Filesystem): Stopped* *nfs_9 (lsb:nfs): Stopped* *smb_10 (lsb:smb): Stopped* *acpid_11 (lsb:acpid): Stopped* *openfiler_12 (lsb:openfiler): Stopped* F*ailed actions:* *Filesystem_8_start_0 (node=gtt5.linux.com, call=32, rc=1): Error* *IPaddr_192_168_2_20_start_0 (node=gtt4.linux.com, call=56, rc=1): Error* 2)This is drbd status what i have created. *[r...@gtt5 ~]# service drbd status* drbd driver loaded OK; device status: version: 8.2.7 (api:88/proto:86-88) GIT-hash: 61b7f4c2fc34fe3d2acf7be6bcc1fc2684708a7d build by p...@fat-tyre, 2008-11-12 16:47:11 m:res cs st ds p mounted fstype 0:cluster_metadata Connected Primary/Secondary UpToDate/UpToDate C /cluster_metadata ext3 1:vg0_drbd Connected Primary/Secondary Diskless/UpToDate C [r...@gtt5 ~] 1. If i see the mount list here there is no any lvm mount poing it has only mounted /cluster_metadata. *[r...@gtt5 ~]# mount* /dev/sda5 on / type ext3 (rw) /proc on /proc type proc (rw) /sys on /sys type sysfs (rw) devpts on /dev/pts type devpts (rw,gid=5,mode=620) /dev/sda7 on /boot type ext3 (rw) tmpfs on /dev/shm type tmpfs (rw) none on /proc/sys/fs/binfmt_misc type binfmt_misc (rw) sunrpc on /var/lib/nfs/rpc_pipefs type rpc_pipefs (rw) */dev/drbd0 on /cluster_metadata type ext3 (rw,noatime)* [r...@gtt5 ~]# 1. secondary no system log information is as follow. crmd[3828]: 2009/08/21_17:20:32 info: process_lrm_event: LRM operation drbddisk_4_start_0 (call=45, rc=0) complete tengine[3833]: 2009/08/21_17:20:32 info: match_graph_event: Action drbddisk_4_start_0 (20) confirmed on gtt5.linux.com (rc=0) LVM[5307]: 2009/08/21_17:20:32 INFO: Activating volume group vg0_drbd LVM[5307]: 2009/08/21_17:20:33 INFO: File descriptor 4 left open File descriptor 5 left open File descriptor 6 left open File descriptor 7 left open File descriptor 8 left open File descriptor 9 left open File descriptor 10 left open File descriptor 11 left open File descriptor 13 left open File descriptor 14 left open File descriptor 16 left open Device '/dev/drbd1' has been left open. Reading all physical volumes. This may take a while... Found volume group "vg0_drbd" using metadata type lvm2 LVM[5307]: 2009/08/21_17:20:33 INFO: File descriptor 4 left open File descriptor 5 left open File descriptor 6 left open File descriptor 7 left open File descriptor 8 left open File descriptor 9 left open File descriptor 10 left open File descriptor 11 left open File descriptor 13 left open File descriptor 14 left open File descriptor 16 left open 1 logical volume(s) in volume group "vg0_drbd" now active lrmd[3825]: 2009/08/21_17:20:33 info: RA output: (LVM_5:start:stderr) File descriptor 4 left open lrmd[3825]: 2009/08/21_17:20:33 info: RA output: (LVM_5:start:stderr) File descriptor 5 left open lrmd[3825]: 2009/08/21_17:20:33 info: RA output: (LVM_5:start:stderr) File descriptor 6 left open lrmd[3825]: 2009/08/21_17:20:33 info: RA output: (LVM_5:start:stderr) File descriptor 7 left open lrmd[3825]: 2009/08/21_17:20:33 info: RA output: (LVM_5:start:stderr) File descriptor 8 left open lrmd[3825]: 2009/08/21_17:20:33 info: RA output: (LVM_5:start:stderr) File descriptor 9 left open lrmd[3825]: 2009/08/21_17:20:33 info: RA output: (LVM_5:start:stderr) File descriptor 10 left open lrmd[3825]: 2009/08/21_17:20:33 info: RA output: (LVM_5:start:stderr) File descriptor 11 left open lrmd[3825]: 2009/08/21_17:20:33 info: RA output: (LVM_5:start:stderr) File descriptor 13 left open lrmd[3825]: 2009/08/21_17:20:33 info: RA output: (LVM_5:start:stderr) File descriptor 14 left open lrmd[3825]: 2009/08/21_17:20:33 info: RA output: (LVM_5:start:stderr) File descriptor 16 left open lrmd[3825]: 2009/08/21_17:20:33 info: RA output: (LVM_5:start:stderr) lrmd[3825]: 2009/08/21_17:20:33 info: RA output: (LVM_5:start:stderr) lrmd[3825]: 2009/08/21_17:20:33 info: RA output: (LVM_5:start:stderr) Using volume group(s) on command line Finding volume group "vg0_drbd" crmd[3828]: 2009/08/21_17:20:33 info: process_lrm_event: LRM operation LVM_5_start_0 (call=46, rc=0) complete tengine[3833]: 2009/08/21_17:20:33 info: match_graph_event: Action LVM_5_start_0 (12) confirmed on gtt5.linux.com (rc=0) tengine[3833]: 2009/08/21_17:20:33 info: send_rsc_command: Initiating action 22: LVM_5_start_120000 on gtt5.linux.com tengine[3833]: 2009/08/21_17:20:33 info: send_rsc_command: Initiating action 11: Filesystem_6_start_0 on gtt5.linux.com crmd[3828]: 2009/08/21_17:20:33 ERROR: construct_op: Start and Stop actions cannot have an interval crmd[3828]: 2009/08/21_17:20:33 info: do_lrm_rsc_op: Performing op=LVM_5_start_0 key=22:6:b2620470-2f4f-4191-b958-fb74219903cf) lrmd[3825]: 2009/08/21_17:20:33 info: rsc:LVM_5: start crmd[3828]: 2009/08/21_17:20:33 info: do_lrm_rsc_op: Performing op=Filesystem_6_start_0 key=11:6:b2620470-2f4f-4191-b958-fb74219903cf) lrmd[3825]: 2009/08/21_17:20:33 info: rsc:Filesystem_6: start LVM[5382]: 2009/08/21_17:20:33 INFO: Activating volume group vg0_drbd Filesystem[5383]: 2009/08/21_17:20:33 INFO: Running start for /dev/drbd0 on /cluster_metadata crmd[3828]: 2009/08/21_17:20:33 info: process_lrm_event: LRM operation Filesystem_6_start_0 (call=48, rc=0) complete tengine[3833]: 2009/08/21_17:20:33 info: match_graph_event: Action Filesystem_6_start_0 (11) confirmed on gtt5.linux.com (rc=0) tengine[3833]: 2009/08/21_17:20:33 info: send_rsc_command: Initiating action 24: Filesystem_6_start_120000 on gtt5.linux.com tengine[3833]: 2009/08/21_17:20:33 info: send_rsc_command: Initiating action 10: MakeMounts_7_start_0 on gtt5.linux.com crmd[3828]: 2009/08/21_17:20:33 ERROR: construct_op: Start and Stop actions cannot have an interval crmd[3828]: 2009/08/21_17:20:33 info: do_lrm_rsc_op: Performing op=Filesystem_6_start_0 key=24:6:b2620470-2f4f-4191-b958-fb74219903cf) lrmd[3825]: 2009/08/21_17:20:33 info: rsc:Filesystem_6: start crmd[3828]: 2009/08/21_17:20:33 info: do_lrm_rsc_op: Performing op=MakeMounts_7_start_0 key=10:6:b2620470-2f4f-4191-b958-fb74219903cf) lrmd[3825]: 2009/08/21_17:20:33 info: rsc:MakeMounts_7: start MakeMounts[5451]: 2009/08/21_17:20:33 Openfiler making mount paths... Filesystem[5448]: 2009/08/21_17:20:33 INFO: Running start for /dev/drbd0 on /cluster_metadata Filesystem[5448]: 2009/08/21_17:20:33 INFO: Filesystem /cluster_metadata is already mounted. crmd[3828]: 2009/08/21_17:20:33 info: process_lrm_event: LRM operation Filesystem_6_start_0 (call=49, rc=0) complete tengine[3833]: 2009/08/21_17:20:33 info: match_graph_event: Action Filesystem_6_start_0 (24) confirmed on gtt5.linux.com (rc=0) crmd[3828]: 2009/08/21_17:20:33 info: process_lrm_event: LRM operation MakeMounts_7_start_0 (call=50, rc=0) complete tengine[3833]: 2009/08/21_17:20:33 info: match_graph_event: Action MakeMounts_7_start_0 (10) confirmed on gtt5.linux.com (rc=0) tengine[3833]: 2009/08/21_17:20:33 info: send_rsc_command: Initiating action 26: MakeMounts_7_start_120000 on gtt5.linux.com crmd[3828]: 2009/08/21_17:20:33 ERROR: construct_op: Start and Stop actions cannot have an interval crmd[3828]: 2009/08/21_17:20:33 info: do_lrm_rsc_op: Performing op=MakeMounts_7_start_0 key=26:6:b2620470-2f4f-4191-b958-fb74219903cf) lrmd[3825]: 2009/08/21_17:20:33 info: rsc:MakeMounts_7: start LVM[5382]: 2009/08/21_17:20:34 INFO: File descriptor 4 left open File descriptor 5 left open File descriptor 6 left open File descriptor 7 left open File descriptor 8 left open File descriptor 9 left open File descriptor 16 left open Device '/dev/drbd1' has been left open. Reading all physical volumes. This may take a while... Found volume group "vg0_drbd" using metadata type lvm2 MakeMounts[5523]: 2009/08/21_17:20:34 Openfiler making mount paths... crmd[3828]: 2009/08/21_17:20:34 info: process_lrm_event: LRM operation MakeMounts_7_start_0 (call=51, rc=0) complete LVM[5382]: 2009/08/21_17:20:34 INFO: File descriptor 4 left open File descriptor 5 left open File descriptor 6 left open File descriptor 7 left open File descriptor 8 left open File descriptor 9 left open File descriptor 16 left open 1 logical volume(s) in volume group "vg0_drbd" now active tengine[3833]: 2009/08/21_17:20:34 info: match_graph_event: Action MakeMounts_7_start_0 (26) confirmed on gtt5.linux.com (rc=0) lrmd[3825]: 2009/08/21_17:20:34 info: RA output: (LVM_5:start:stderr) File descriptor 4 left open File descriptor 5 left open File descriptor 6 left open File descriptor 7 left open File descriptor 8 left open File descriptor 9 left open File descriptor 16 left open lrmd[3825]: 2009/08/21_17:20:34 info: RA output: (LVM_5:start:stderr) Using volume group(s) on command line lrmd[3825]: 2009/08/21_17:20:34 info: RA output: (LVM_5:start:stderr) Finding volume group "vg0_drbd" crmd[3828]: 2009/08/21_17:20:34 info: process_lrm_event: LRM operation LVM_5_start_0 (call=47, rc=0) complete tengine[3833]: 2009/08/21_17:20:34 info: match_graph_event: Action LVM_5_start_0 (22) confirmed on gtt5.linux.com (rc=0) tengine[3833]: 2009/08/21_17:20:34 info: run_graph: Transition 6: (Complete=24, Pending=0, Fired=0, Skipped=0, Incomplete=0) tengine[3833]: 2009/08/21_17:20:34 info: notify_crmd: Transition 6 status: te_complete - <null> crmd[3828]: 2009/08/21_17:20:34 info: do_state_transition: State transition S_TRANSITION_ENGINE -> S_IDLE [ input=I_TE_SUCCESS cause=C_IPC_MESSAGE origin=route_message ] heartbeat[3398]: 2009/08/21_17:22:03 info: Link gtt4.linux.com:eth0 dead. cib[3824]: 2009/08/21_17:26:28 info: cib_stats: Processed 104 operations (12403.00us average, 0% utilization) in the last 10min cib[3824]: 2009/08/21_17:36:28 info: cib_stats: Processed 40 operations (8250.00us average, 0% utilization) in the last 10min cib[3824]: 2009/08/21_17:46:28 info: cib_stats: Processed 40 operations (7750.00us average, 0% utilization) in the last 10min cib[3824]: 2009/08/21_17:56:28 info: cib_stats: Processed 14 operations (9285.00us average, 0% utilization) in the last 10min Please tell me where i am going wrong and what should i have to do. Thanks Prakash KH _______________________________________________ Linux-HA mailing list [email protected] http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
