2010/12/11 Andrew Beekhof <and...@beekhof.net> > On Fri, Dec 10, 2010 at 4:59 PM, ruslan usifov <ruslan.usi...@gmail.com> > wrote: > > and to me what to do? > > Nothing yet, there looks to be some memory corruption going on. > With that file I've been able to reproduce locally. I'll let you know > when there is a fix (hopefully very soon). > > > > > 2010/12/10 Andrew Beekhof <and...@beekhof.net> > >> > >> On Fri, Dec 10, 2010 at 11:16 AM, ruslan usifov < > ruslan.usi...@gmail.com> > >> wrote: > >> > you mean some think like this: > >> > > >> > Dec 07 15:14:05 storage1 crmd: [16003]: notice: save_cib_contents: > Saved > >> > CIB > >> > contents after PE crash to /var/lib/pengine/pe-core > >> > -121f59f1-ca5c-4ce4-996c-53f4aa617ac3.bz2 > >> > >> perfect > >> > >> > > >> > > >> > ?? > >> > If so, i attached it to this email > >> > > >> > > >> > 2010/12/10 Andrew Beekhof <and...@beekhof.net> > >> >> > >> >> On Fri, Dec 10, 2010 at 10:18 AM, ruslan usifov > >> >> <ruslan.usi...@gmail.com> > >> >> wrote: > >> >> > I don't know how to see version of pacemaker, crm doesn't provide > -v > >> >> > (or > >> >> > -V > >> >> > or --version) option, but I got source from here > >> >> > http://hg.clusterlabs.org/pacemaker/stable-1.0/archive/tip.tar.bz2 > , > >> >> > as > >> >> > result I download Pacemaker-1-0-b0266dd5ffa9.tar.bz2 > >> >> > >> >> Ah, b0266dd5ffa9 tells me exactly what you're running. > >> >> > >> >> I'm very concerned that can_run_resources() is crashing, looking at > >> >> the code it should not be possible. > >> >> Do you see any logs containing the text "save_cib_contents"? If so, > >> >> it should mention a filename, could you attach it? > >> >> > >> >> > > >> >> > > >> >> > and here is my backtrace: > >> >> > > >> >> > gdb /usr/lib/heartbeat/pengine /tmp/pengine.940.core > >> >> > (gdb) bt > >> >> > #0 0xb76879b8 in can_run_resources () from > /usr/lib/libpengine.so.3 > >> >> > #1 0xb768dcd7 in node_list_attr_score () from > >> >> > /usr/lib/libpengine.so.3 > >> >> > #2 0xb768de91 in ?? () from /usr/lib/libpengine.so.3 > >> >> > #3 0xb768e193 in rsc_merge_weights () from > /usr/lib/libpengine.so.3 > >> >> > #4 0xb7681676 in native_merge_weights () from > >> >> > /usr/lib/libpengine.so.3 > >> >> > #5 0xb7698654 in group_merge_weights () from > >> >> > /usr/lib/libpengine.so.3 > >> >> > #6 0xb76a063a in ?? () from /usr/lib/libpengine.so.3 > >> >> > #7 0xb76a17ad in master_color () from /usr/lib/libpengine.so.3 > >> >> > #8 0xb7683f3e in stage5 () from /usr/lib/libpengine.so.3 > >> >> > #9 0xb76813aa in do_calculations () from /usr/lib/libpengine.so.3 > >> >> > #10 0xb7680991 in process_pe_message () from > /usr/lib/libpengine.so.3 > >> >> > #11 0x08048ed5 in ?? () > >> >> > #12 0xb7611f4e in G_CH_dispatch_int () from /usr/lib/libplumb.so.2 > >> >> > #13 0xb727f5e5 in g_main_context_dispatch () from > >> >> > /lib/libglib-2.0.so.0 > >> >> > #14 0xb72832d8 in ?? () from /lib/libglib-2.0.so.0 > >> >> > #15 0xb7283817 in g_main_loop_run () from /lib/libglib-2.0.so.0 > >> >> > #16 0x080494f8 in ?? () > >> >> > #17 0xb7337bd6 in __libc_start_main () from > >> >> > /lib/tls/i686/cmov/libc.so.6 > >> >> > #18 0x08048dc1 in ?? () > >> >> > > >> >> > _______________________________________________ > >> >> > Pacemaker mailing list: Pacemaker@oss.clusterlabs.org > >> >> > http://oss.clusterlabs.org/mailman/listinfo/pacemaker > >> >> > > >> >> > Project Home: http://www.clusterlabs.org > >> >> > Getting started: > >> >> > http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf > >> >> > Bugs: > >> >> > > >> >> > > >> >> > > http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker > >> >> > > >> >> > > >> >> > >> >> _______________________________________________ > >> >> Pacemaker mailing list: Pacemaker@oss.clusterlabs.org > >> >> http://oss.clusterlabs.org/mailman/listinfo/pacemaker > >> >> > >> >> Project Home: http://www.clusterlabs.org > >> >> Getting started: > >> >> http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf > >> >> Bugs: > >> >> > >> >> > http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker > >> > > >> > > >> > _______________________________________________ > >> > Pacemaker mailing list: Pacemaker@oss.clusterlabs.org > >> > http://oss.clusterlabs.org/mailman/listinfo/pacemaker > >> > > >> > Project Home: http://www.clusterlabs.org > >> > Getting started: > http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf > >> > Bugs: > >> > > >> > > http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker > >> > > >> > > >> > >> _______________________________________________ > >> Pacemaker mailing list: Pacemaker@oss.clusterlabs.org > >> http://oss.clusterlabs.org/mailman/listinfo/pacemaker > >> > >> Project Home: http://www.clusterlabs.org > >> Getting started: > http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf > >> Bugs: > >> > http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker > > > > > > _______________________________________________ > > Pacemaker mailing list: Pacemaker@oss.clusterlabs.org > > http://oss.clusterlabs.org/mailman/listinfo/pacemaker > > > > Project Home: http://www.clusterlabs.org > > Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf > > Bugs: > > > http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker > > > > > > _______________________________________________ > Pacemaker mailing list: Pacemaker@oss.clusterlabs.org > http://oss.clusterlabs.org/mailman/listinfo/pacemaker > > Project Home: http://www.clusterlabs.org > Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf > Bugs: > http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker >
Thanks, but i have a question I change my configuration in test inviroment (VMware) to follow: r...@ubuntu:/home/ruslan# crm configure show node ubuntu node ubuntu1 primitive drbd_web ocf:linbit:drbd \ params drbd_resource="web" \ op monitor interval="15s" \ op stop interval="0" timeout="300s" \ op start interval="0" timeout="300s" primitive iscsi_ip ocf:heartbeat:IPaddr2 \ params ip="192.168.83.5" nic="eth1:1" cidr_netmask="24" \ op monitor interval="10s" \ meta target-role="Started" primitive iscsi_lun1_web_target ocf:heartbeat:iSCSILogicalUnit \ params lun="1" path="/dev/drbd0" target_iqn="iqn.2010-06.playrix.local:san.web" implementation="iet" primitive iscsi_target lsb:iscsitarget primitive iscsi_web_target ocf:heartbeat:iSCSITarget \ params iqn="iqn.2010-06.playrix.local:san.web" implementation="iet" \ op monitor interval="10s" timeout="30s" depth="0" \ meta target-role="Started" group iscsi iscsi_ip iscsi_target iscsi_web_target iscsi_lun1_web_target \ meta target-role="Started" ms ms_drbd_web drbd_web \ meta master-max="1" master-node-max="1" clone-max="2" clone-node-max="1" notify="true" target-role="Started" is-managed="true" colocation ms_drbd_with_iscsi inf: iscsi ms_drbd_web:Master order o1 0: ms_drbd_web:promote iscsi_ip order o2 inf: iscsi_ip iscsi_target:start order o3 inf: iscsi_target iscsi_web_target order o4 inf: iscsi_web_target iscsi_lun1_web_target property $id="cib-bootstrap-options" \ dc-version="1.0.10-b0266dd5ffa9c51377c68b1f29d6bc84367f51dd" \ cluster-infrastructure="openais" \ expected-quorum-votes="2" \ stonith-enabled="false" \ no-quorum-policy="ignore" \ last-lrm-refresh="1292028155" rsc_defaults $id="rsc-options" \ resource-stickness="100" and when start one node (without second) i got follow output: r...@ubuntu:/home/ruslan# crm_mon -1 Stack: openais Current DC: ubuntu - partition WITHOUT quorum Version: 1.0.10-b0266dd5ffa9c51377c68b1f29d6bc84367f51dd 2 Nodes configured, 2 expected votes 2 Resources configured. ============ Online: [ ubuntu ] OFFLINE: [ ubuntu1 ] Master/Slave Set: ms_drbd_web Slaves: [ ubuntu ] Stopped: [ drbd_web:1 ] Failed actions: iscsi_target_monitor_0 (node=ubuntu, call=4, rc=1, status=complete): unknown error iscsi_web_target_monitor_0 (node=ubuntu, call=5, rc=5, status=complete): not installed iscsi_lun1_web_target_monitor_0 (node=ubuntu, call=6, rc=5, status=complete): not installed Hm it very strange. As i expected this rules: order o1 inf: ms_drbd_web:promote iscsi_ip order o2 inf: iscsi_ip iscsi_target:start order o3 inf: iscsi_target iscsi_web_target order o4 inf: iscsi_web_target iscsi_lun1_web_target prevent launch and monitor of all resource that depends on drbd, but with Failed actions I confused, why they monitored?
_______________________________________________ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker