On Fri, Dec 24, 2010 at 08:59:29AM +0200, Alexander Bodnarashik wrote:
>
> On Dec 23, 2010, at 18:53, Dejan Muhamedagic wrote:
>
> > On Fri, Dec 10, 2010 at 02:08:34PM +0200, Alexander Bodnarashik wrote:
> >> I've moved a little bit further with running linux ha suite on freebsd,
> >> few more problems though:
> >>
> >> 1. crm node standby/online is not working
> >> issuing following command has no effect and node remains online/standby
> >>> [r...@alice ~]# crm node standby alice.local reboot
> >>> scope=nodes name=standby value=off
> >>> [r...@alice ~]#
> >> debug log says:
> >>> Dec 10 13:51:09 alice cib: [50795]: debug: cib_process_xpath: Processing
> >>> cib_query op for
> >>> //cib/configuration/nodes//no...@id='alice.local']//nvpa...@name='standby']
> >>> (/cib/configuration/nodes/node[3]/instance_attributes/nvpair)
> >>> Dec 10 13:51:09 alice cib: [50795]: debug: cib_process_xpath: Processing
> >>> cib_query op for
> >>> //cib/configuration/nodes//no...@id='alice.local']//nvpa...@name='standby']
> >>> (/cib/configuration/nodes/node[3]/instance_attributes/nvpair)
> >> in case if attribute does not exist output is following:
> >>> [r...@alice ~]# crm node standby alice reboot
> >>> Error performing operation: The object/attribute does not exist
> >>> scope=status name=standby value=off
> >> and logs:
> >>> Dec 10 14:07:36 alice cib: [50795]: debug: cib_process_xpath: cib_query:
> >>> //cib/configuration/nodes//no...@id='alice.local']//nvpa...@name='standby']
> >>> does not exist
> >>> Dec 10 14:07:36 alice cib: [50795]: debug: cib_process_xpath: cib_query:
> >>> //cib/status//node_sta...@id='alice.local']//nvpa...@name='standby'] does
> >>> not exist
> >>
> >> on the other hand following puts node in standby mode:
> >>> [r...@alice ~]# crm node attribute alice.local set standby on
> >>> [r...@alice ~]#
> >
> > Best to open a bugzilla for this. It's funny because newer crm
> > shell version in 1.1 uses crm_attribute for standby/online. The
> > stable 1.0 series however still relies on crm_standby.
> I guess i will, wanna try pacemaker 1.1 before.
crm_standby is available in 1.1 too. Andrew modified the shell
to use crm_attribute in its stead. The log message says:
Use crm_attribute directly instead of the
deprecated crm_standby and crm_failcount wrappers
> >> 2. hb_report basically does not work as expected:
> >>> [r...@alice ~]# hb_report -f 1pm /root/hb.log
> >>> alice.local: INFO: logd used but logd.cf not found: using defaults
> >>> alice.local: WARN: this is not a node and you didn't specify a list of
> >>> nodes using -n
> >>> alice.local: WARN: could not find on alice.local
> >
> > hb_report obviously cannot figure out that this node is a member
> > of the cluster. Was pacemaker running at that point in time? Did
> > you try to specify the nodes with -n?
> Yes, pacemaker was running on the node.
> I tried to specify nodes with -n:
> [r...@alice ~]# hb_report -f 1pm -n alice.local /root/hb.log
> alice.local: INFO: logd used but logd.cf not found: using defaults
> alice.local: WARN: could not find on alice.local
> usage: mktemp [-d] [-q] [-t prefix] [-u] template ...
> mktemp [-d] [-q] [-u] -t prefix
> alice.local: WARN: sorry, can't create temporary file for find_files
> usage: mktemp [-d] [-q] [-t prefix] [-u] template ...
> mktemp [-d] [-q] [-u] -t prefix
> alice.local: WARN: sorry, can't create temporary file for find_files
> usage: mktemp [-d] [-q] [-t prefix] [-u] template ...
> mktemp [-d] [-q] [-u] -t prefix
> alice.local: ERROR: cannot create temporary files
Oops. mktemp has different usage in *BSD. We need to specify the
template.
> [r...@alice ~]# crm_mon -1
> ============
> Last updated: Fri Dec 24 08:52:38 2010
> Stack: openais
> Current DC: bob.local - partition with quorum
> Version: 1.0.10-b2e39d318fda501e2fcf223c2d039b721f3679a9
> 3 Nodes configured, 3 expected votes
> 1 Resources configured.
> ============
>
> Online: [ mike.local bob.local alice.local ]
>
> Resource Group: gr
> dummy1 (ocf::heartbeat:Dummy): Started mike.local
> dummy2 (ocf::heartbeat:Dummy): Started mike.local
> dummy3 (ocf::heartbeat:Dummy): Started mike.local
> dummy4 (ocf::heartbeat:Dummy): Started mike.local
> dummy5 (ocf::heartbeat:Dummy): Started mike.local
> dummy6 (ocf::heartbeat:Dummy): Started mike.local
> dummy7 (ocf::heartbeat:Dummy): Started mike.local
> dummy8 (ocf::heartbeat:Dummy): Started mike.local
> dummy9 (ocf::heartbeat:Dummy): Started mike.local
> dummy10 (ocf::heartbeat:Dummy): Started mike.local
> dummy11 (ocf::heartbeat:Dummy): Started mike.local
> dummy12 (ocf::heartbeat:Dummy): Started mike.local
> dummy13 (ocf::heartbeat:Dummy): Started mike.local
> dummy14 (ocf::heartbeat:Dummy): Started mike.local
> dummy15 (ocf::heartbeat:Dummy): Started mike.local
> [r...@alice ~]#
>
> At this point there is nothing suspicious in logs:
> Dec 24 08:52:18 alice cib: [886]: WARN: adjust rcvbuf size to 1048576 failed:
> No buffer space available
> Dec 24 08:52:18 alice cib: [886]: WARN: adjust sndbuf size to 1048576 failed:
> No buffer space available
> Dec 24 08:52:18 alice cib: [886]: WARN: adjust rcvbuf size to 1048576 failed:
> No buffer space available
> Dec 24 08:52:18 alice cib: [886]: WARN: adjust sndbuf size to 1048576 failed:
> No buffer space available
>
> >
> >>> The report is saved in /root/hb.log.tar.bz2
> >>> [r...@alice ~]# tar tjf /root/hb.log.tar.bz2
> >>> hb.log/
> >>> hb.log/analysis.txt
> >>> hb.log/description.txt
> >>> [r...@alice ~]#
> >>
> >>
> >> My configuration:
> >>> [r...@alice ~]# uname -a
> >>> FreeBSD alice.local 8.1-RELEASE FreeBSD 8.1-RELEASE #0: Mon Jul 19
> >>> 02:55:53 UTC 2010
> >>> [email protected]:/usr/obj/usr/src/sys/GENERIC i386
> >>
> >>> [r...@alice ~]# cibadmin -$
> >>> cibadmin 1.0.10 for OpenAIS (Build:
> >>> b0266dd5ffa9c51377c68b1f29d6bc84367f51dd)
> >>>
> >>> Written by Andrew Beekhof
> >>
> >>> [r...@alice ~]# corosync -v
> >>> Corosync Cluster Engine, version '1.2.8' SVN revision '3035'
> >>> Copyright (c) 2006-2009 Red Hat, Inc.
> >>
> >>> [r...@alice ~]# crm configure show
> >>> node alice.local \
> >>> attributes standby="on"
> >>> node bob.local \
> >>> attributes standby="off"
> >>> node mike.local
> >>> property $id="cib-bootstrap-options" \
> >>> dc-version="1.0.10-b0266dd5ffa9c51377c68b1f29d6bc84367f51dd" \
> >>> cluster-infrastructure="openais" \
> >>> expected-quorum-votes="3" \
> >>> no-quorum-policy="ignore" \
> >>> stonith-enabled="true" \
> >>> last-lrm-refresh="1291296991" \
> >>> default-resource-stickiness="100" \
> >>> stonith-action="reboot"
> >>> rsc_defaults $id="rsc-options" \
> >>> allow-migrate="true"
> >>> op_defaults $id="op-options" \
> >>> on-fail="restart"
> >>
> >>> [r...@alice ~]# crm_mon -1
> >>> ============
> >>> Last updated: Fri Dec 10 14:02:28 2010
> >>> Stack: openais
> >>> Current DC: alice.local - partition WITHOUT quorum
> >>> Version: 1.0.10-b0266dd5ffa9c51377c68b1f29d6bc84367f51dd
> >>> 3 Nodes configured, 3 expected votes
> >>> 0 Resources configured.
> >>> ============
> >>>
> >>> Node alice.local: standby
> >>> OFFLINE: [ mike.local bob.local ]
> >>
> >> Please let me know if i should provide more info.
> >
> > Are those nodes really offline? Did you manage to form a
> > cluster?
> >
> Yes, they were really offline. I've switched them off to minimize test case.
> I've managed to form a cluster, nodes see each other, failover/fencing
> (meatware) seems to work, but i have not tested cluster much.
Great. If you feel like testing, there's CTS.
Thanks,
Dejan
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems