On Mon, Oct 12, 2009 at 1:45 PM, Dejan Muhamedagic <[email protected]>wrote:

> Hi,
>
> On Mon, Oct 12, 2009 at 01:25:31PM +0300, Stratos Zolotas wrote:
> > On Mon, Oct 12, 2009 at 10:16 AM, Andrew Beekhof <[email protected]>
> wrote:
> >
> > > On Mon, Oct 12, 2009 at 8:34 AM, Stratos Zolotas <[email protected]>
> wrote:
> > > > Hello to the list.
> > > >
> > > > Please excuse my ignorance, because it is the first time i try to
> built a
> > > > cluster.
> > > >
> > > > I'm trying to built a 2 node Active/Passive cluster with
> > > > DRBD+Pacemaker+Openais.
> > > >
> > > > I'm on the very beginning and i try to achieve initial communication
> > > between
> > > > the nodes.
> > > >
> > > > I'm getting the following:
> > > >
> > > > crm_mon[12156]: 2009/10/12_09:22:36 ERROR: unpack_resources: No
> STONITH
> > > > resources have been defined
> > > > crm_mon[12156]: 2009/10/12_09:22:36 ERROR: unpack_resources: Either
> > > > configure some or disable STONITH with the stonith-enabled option
> > > > crm_mon[12156]: 2009/10/12_09:22:36 ERROR: unpack_resources: NOTE:
> > > Clusters
> > > > with shared data need STONITH to ensure data integrity
> > > >
> > > >
> > > > ============
> > > > Last updated: Mon Oct 12 09:22:36 2009
> > > > Current DC: NONE
> > > > 0 Nodes configured, unknown expected votes
> > > > 0 Resources configured.
> > > > ============
> > >
> > > You might just need to wait a bit longer.
> > > But its hard to say without seeing all the logs.  If you attach them
> > > (compressed) we'll be able to help further.
> > >
> > > > I don't care for the first three messages, because i haven't
> configure
> > > > anything yet, but it seems that i don't have communication between
> the
> > > > nodes. There is no any firewall and the communication is on a
> dedicated
> > > LAN.
> > > >
> > > > My openais.conf (identical for the two systems) is:
> > > >
> > > > alpha:/etc/ais # crm_mon --one-shot -V
> > > > # Please read the openais.conf.5 manual page
> > > >
> > > > aisexec {
> > > >        # Run as root - this is necessary to be able to manage
> resources
> > > > with Pacemaker
> > > >        user:   root
> > > >        group:  root
> > > > }
> > > >
> > > > service {
> > > >        # Load the Pacemaker Cluster Resource Manager
> > > >        ver:       0
> > > >        name:      pacemaker
> > > >        use_mgmtd: yes
> > > >        use_logd:  yes
> > > > }
> > > >
> > > > totem {
> > > >        version: 2
> > > >
> > > >        # How long before declaring a token lost (ms)
> > > >        token:          5000
> > > >
> > > >        # How many token retransmits before forming a new
> configuration
> > > >        token_retransmits_before_loss_const: 10
> > > >
> > > >        # How long to wait for join messages in the membership
> protocol
> > > (ms)
> > > >        join:           1000
> > > >
> > > >        # How long to wait for consensus to be achieved before
> starting a
> > > > new round of membership conf$
> > > >        consensus:      2500
> > > >
> > > > # Turn off the virtual synchrony filter
> > > >        vsftype:        none
> > > >
> > > >        # Number of messages that may be sent by one processor on
> receipt
> > > of
> > > > the token
> > > >        max_messages:   20
> > > >
> > > >        # Stagger sending the node join messages by 1..send_join ms
> > > >        send_join: 45
> > > >
> > > >        # Limit generated nodeids to 31-bits (positive signed
> integers)
> > > >        clear_node_high_bit: yes
> > > >
> > > >        # Disable encryption
> > > >        secauth:        off
> > > >
> > > >        # How many threads to use for encryption/decryption
> > > >        threads:        0
> > > >
> > > >        # Optionally assign a fixed node id (integer)
> > > >        # nodeid:         1234
> > > >
> > > >        interface {
> > > >                ringnumber: 0
> > > >
> > > >                # The following values need to be set based on your
> > > > environment
> > > >                bindnetaddr: 192.168.67.0
> > > >                mcastaddr: 226.94.1.1
> > > >                mcastport: 5405
> > > >        }
> > > >
> > > > logging {
> > > >        debug: off
> > > >        fileline: off
> > > >        to_syslog: yes
> > > >        to_stderr: off
> > > >        syslog_facility: daemon
> > > >        timestamp: on
> > > > }
> > > >
> > > > amf {
> > > >        mode: disabled
> > > > }
> > > >
> > > >
> > > > The first node is on 192.168.67.10 and the second on 192.168.67.11.
> > > >
> > > > Am i missing something?
> > > >
> > > > Thank you in advance and please forgive my lack of knowledge.
> > > >
> > > > Stratos.
> >
> > Thank you for your immediate response.
> >
> > I think that something is wrong because i'm waiting for at least 2-3
> hours
> > for the nodes to appear.
>
> That seems to be a bit excessive :)
>
> > Please find the logs for the first machine (/var/log/messages) attached
> to
> > the message. If the logs from the second node are needed please ask me to
> > send them, but i think that the problem is common for both nodes.
> >
> > I'm sending only the logs after the last run of openais (rcopenais start
> on
> > Opensuse 11.1)
>
> There was a segfault in crmd/plumbing:
>
> Oct 12 09:13:04 alpha kernel: crmd[11007]: segfault at 18 ip
> 00007f40ea896eee sp 00007fff0336a960 error 4 in
> libplumb.so.2.0.0[7f40ea87a000+30000]
>
> You should capture the backtrace with gdb or use hb_report.
> Hopefully there's a core file.
>
> There won't be much of a cluster without crmd. Otherwise, openais
> seems to function fine.
>
> Thanks,
>
> Dejan
>
>
> > Thank you again.
> >
> >
> > Stratos
> >
> >
> >
> > --
> > Kernel IT Solutions Ltd
> > http://www.kernelit.gr
> >
> > Cyclades Wireless Network
> > http://www.cywn.gr
>
>
> > _______________________________________________
> > Linux-HA mailing list
> > [email protected]
> > http://lists.linux-ha.org/mailman/listinfo/linux-ha
> > See also: http://linux-ha.org/ReportingProblems
>
> _______________________________________________
> Linux-HA mailing list
> [email protected]
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems
>


Thank you for your response. I have post the problem to the pacemaker's list
and if i discovered something i will report back.

Thanks again.


-- 
Kernel IT Solutions Ltd
http://www.kernelit.gr

Cyclades Wireless Network
http://www.cywn.gr
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to