>>From: drbd-user-bounces at lists.linbit.com [mailto:drbd-user-bounces at lists.linbit.com] On Behalf Of Lars Ellenberg
>>On Wed, Sep 24, 2014 at 11:31:58PM +1000, Klint Gore wrote:
>>> Looks like it exists.  Same file exists on both nodes (md5 matches).
>>> Is there a way to tell what version it is? Should there be other
>>> files as well?
>>
>>rpm -qf /usr/lib/ocf/resource.d/linbit/drbd
>
>[root at hans0 log]# rpm -qf /usr/lib/ocf/resource.d/linbit/drbd
>drbd84-utils-8.9.1-1.el7.elrepo.x86_64
>
>> do the resources listed by "drbdadm dump" match the resource names used in the pacemaker configuration?
>
>yes
>
>> do you get something different for "drbdadm -c /etc/drbd.conf dump"?
>
>They're the same
>
>[root at hans0 tmp]# drbdadm dump >admdump
>[root at hans0 tmp]# drbdadm -c /etc/drbd.conf dump >confdump
>[root at hans0 tmp]# ll *dump
>-rw-r--r--. 1 root root 3256 Sep 25 09:30 admdump
>-rw-r--r--. 1 root root 3256 Sep 25 09:30 confdump
>[root at hans0 tmp]# diff admdump confdump
>[root at hans0 tmp]#
>
>
>In trying stuff yesterday, I seem to have caused a change in something that there's new error in the log
>
>Sep 25 09:48:06 [14956] hans0.une.edu.au lrmd: notice: operation_finished: drbd_homeagbu_notify_0:17296:stderr [ /usr/lib/ocf/lib/heartbeat/ocf-shellfuncs: line 226: /var/log/pacemaker.log: Permission denied ] >Sep 25 09:48:06 [14956] hans0.une.edu.au lrmd: notice: operation_finished: drbd_homeagbu_notify_0:17296:stderr [ Could not establish cib_rw connection: Permission denied (13) ] >Sep 25 09:48:06 [14956] hans0.une.edu.au lrmd: notice: operation_finished: drbd_homeagbu_notify_0:17296:stderr [ Error signing on to the CIB service: Transport endpoint is not connected ] >Sep 25 09:48:06 [14956] hans0.une.edu.au lrmd: notice: operation_finished: drbd_homeagbu_notify_0:17296:stderr [ /usr/lib/ocf/lib/heartbeat/ocf-shellfuncs: line 226: /var/log/pacemaker.log: Permission denied ] >Sep 25 09:48:06 [14956] hans0.une.edu.au lrmd: notice: operation_finished: drbd_homeagbu_notify_0:17296:stderr [ /usr/lib/ocf/lib/heartbeat/ocf-shellfuncs: line 226: /var/log/pacemaker.log: Permission denied ] >Sep 25 09:48:06 [14956] hans0.une.edu.au lrmd: notice: operation_finished: drbd_homeagbu_notify_0:17296:stderr [ /usr/lib/ocf/lib/heartbeat/ocf-shellfuncs: line 226: /var/log/pacemaker.log: Permission denied ] >Sep 25 09:48:06 [14956] hans0.une.edu.au lrmd: info: log_finished: finished - rsc:drbd_homeagbu action:notify call_id:117 pid:17296 exit-code:5 exec-time:15058ms queue-time:0ms
>
>Line 226 in ocf-shellfunc is in function ha_log and says
>    222         if
>    223           [ -n "$HA_LOGFILE" ]
>    224         then
>    225           : appending to $HA_LOGFILE
>    226           echo "$HA_LOGTAG:     "`hadate`"${*}" >> $HA_LOGFILE
>    227         fi
>
>The permissions on /usr/lib/ocf/lib/heartbeat/* were all 644 so I changed them to 755 (owner and group is root). The permission on /var/log/pacemaker.log is 660, owner is hacluster, group is haclient.
>I changed that to 666 but it doesn't seem to help.
>
>And now it's giving me the not installed message instead of the not configured. >Sep 25 14:26:03 [19406] hans0.une.edu.au pengine: error: unpack_rsc_op: No further recovery can be attempted for drbd_homeagbu:0: stop action failed with 'not installed' (5) >Sep 25 14:26:03 [19406] hans0.une.edu.au pengine: notice: unpack_rsc_op: Preventing master_drbd from re-starting on hans0: operation stop failed 'not installed' (rc=5) >Sep 25 14:26:03 [19406] hans0.une.edu.au pengine: warning: unpack_rsc_op: Processing failed op stop for drbd_homeagbu:0 on hans0: not installed (5)
>
>
>Attempting manual start on one of the drbd resources
>[root at hans0 log]# pcs resource debug-start drbd_homeagbu
>Operation start for drbd_homeagbu:0 (ocf:linbit:drbd) returned 0
> >  stdout:
> >  stdout:
> >  stdout:
> > stderr: WARNING: You may be disappointed: This RA is intended for pacemaker 1.0 or better!
> >  stderr: WARNING: homeagbu already Primary, demoting.
> > stderr: DEBUG: homeagbu: Calling drbdadm -c /etc/drbd.conf secondary homeagbu
> >  stderr: DEBUG: homeagbu: Exit code 0
> >  stderr: DEBUG: homeagbu: Command output:
> > stderr: DEBUG: homeagbu: Calling drbdadm -c /etc/drbd.conf adjust homeagbu
> >  stderr: DEBUG: homeagbu: Exit code 0
> >  stderr: DEBUG: homeagbu: Command output:
> > stderr: DEBUG: homeagbu: Calling /usr/sbin/crm_master -Q -l reboot -v 10000
> >  stderr: DEBUG: homeagbu: Exit code 0
> >  stderr: DEBUG: homeagbu: Command output:
>[root at hans0 log]# drbd-overview
> 1:homeagbu/0  Connected Secondary/Secondary UpToDate/UpToDate
> 2:backdesk/0  Connected Primary/Secondary UpToDate/UpToDate
> 3:genomics/0  Connected Primary/Secondary UpToDate/UpToDate
> 4:backserv/0  Connected Primary/Secondary UpToDate/UpToDate
> 5:agbudata/0  Connected Primary/Secondary UpToDate/UpToDate
>[root at hans0 log]# pcs resource debug-start drbd_homeagbu
>Operation start for drbd_homeagbu:0 (ocf:linbit:drbd) returned 0
> >  stdout:
> >  stdout:
> > stderr: WARNING: You may be disappointed: This RA is intended for pacemaker 1.0 or better! > > stderr: DEBUG: homeagbu: Calling drbdadm -c /etc/drbd.conf adjust homeagbu
> >  stderr: DEBUG: homeagbu: Exit code 0
> >  stderr: DEBUG: homeagbu: Command output:
> > stderr: DEBUG: homeagbu: Calling /usr/sbin/crm_master -Q -l reboot -v 10000
> >  stderr: DEBUG: homeagbu: Exit code 0
> >  stderr: DEBUG: homeagbu: Command output:
>[root at hans0 log]# less -S pacemaker.log
>[root at hans0 log]# drbd-overview
> 1:homeagbu/0  Connected Secondary/Secondary UpToDate/UpToDate
> 2:backdesk/0  Connected Primary/Secondary UpToDate/UpToDate
> 3:genomics/0  Connected Primary/Secondary UpToDate/UpToDate
> 4:backserv/0  Connected Primary/Secondary UpToDate/UpToDate
> 5:agbudata/0  Connected Primary/Secondary UpToDate/UpToDate
>
Have you managed to fix this issue? I got the exactly same problem as you and I can't find the solution.


cheers,

Steve

_______________________________________________
drbd-user mailing list
[email protected]
http://lists.linbit.com/mailman/listinfo/drbd-user

Reply via email to