On 11/11/2010 02:35 AM, Andrew Beekhof wrote: > On Wed, Oct 27, 2010 at 5:15 PM, Steven Dake <[email protected]> wrote: >> On 10/26/2010 11:17 PM, Andrew Beekhof wrote: >>> >>> On Wed, Oct 27, 2010 at 7:32 AM, nozawat<[email protected]> wrote: >>>> >>>> Hi Andrew, >>>> >>>> I send two log files of terminal.log and ha.log. >>>> The contents of the terminal log are command results of "ps -ef|grep >>>> coro" >>>> and "crm_mon -f -1". >>>> >>>> It is what processing completes normally when what did not understand me >>>> well watches log though corosync outputs core. >>> >>> Oct 27 10:53:12 hb0101 corosync[6695]: [pcmk ] plugin.c:1526 ERROR: >>> send_cluster_msg_raw: Child 7016 spawned to record non-fatal assertion >>> failure line 1526: rc == 0 >>> >>> Oct 27 10:53:12 hb0101 corosync[6695]: [pcmk ] plugin.c:1526 ERROR: >>> send_cluster_msg_raw: Message not sent (-1):<copy t="cib" >>> cib_op="cib_replace" cib_delegated_from="hb0102" >>> cib_clientname="hb0102" cib_isreplyto="hb0102" original_c >>> >>> For some reason >>> rc = pcmk_api->totem_mcast(&iovec, 1, TOTEMPG_SAFE); >>> is returning -1 >>> >>> >>> Steve: would this happen if membership was in flux? >>> I thought only IPC got stopped. >>> >> >> it could >> >> If api->totem_mcast sends many messages it can fill up the totem queue and >> return -1. The best solution to handling sending messages outside of IPC is >> to use the schedwrk api. It will request a piece of work be done when the >> token is sent (and hopefully there are more spots in the new message queue). >> It will continue to schedule work until 0 is retuned by the callback >> registered with schedwrk. > > what about a while-loop with a sleep in it? > >>
That could cause all kinds of problems with the membership system timers resulting in wierd behavior and bad membership states. That is why there is a schedwrk api. Regards -steve >> Regards >> -steve >> >>>> >>>> Regards, >>>> Tomo >>>> >>>> >>>> 2010/10/27 Andrew Beekhof<[email protected]> >>>>> >>>>> On Tue, Oct 26, 2010 at 11:22 AM, nozawat<[email protected]> wrote: >>>>>> >>>>>> Hi all, >>>>>> >>>>>> My environment is as follows. >>>>>> * cluster-glue-1.0.6 >>>>>> * resource-agents-1.0.3 >>>>>> * corosync-1.2.8 (svn revision '3059') >>>>>> * pacemaker-1.1.3-2f0326468a33acb1ada8fa744c7d36d0b315bd35 >>>>>> >>>>>> Core file was output by corosync of the DC node when I load a crm file. >>>>>> >>>>>> It is the infomation of the core file as follows. >>>>> >>>>> log file? >>>>> you're tripping over an assertion, it would be good to know which one >>>>> >>>>>> >>>>>> [r...@hb0101 ~]$ file /var/lib/corosync/core.32727 >>>>>> /var/lib/corosync/core.32727: ELF 64-bit LSB core file AMD x86-64, >>>>>> version 1 >>>>>> (SYSV), SVR4-style, from 'corosync' >>>>>> >>>>>> [r...@hb0101 ~]$ gdb /usr/sbin/corosync /var/lib/corosync/core.32727 >>>>>> GNU gdb Fedora (6.8-37.el5) >>>>>> Copyright (C) 2008 Free Software Foundation, Inc. >>>>>> License GPLv3+: GNU GPL version 3 or later >>>>>> <http://gnu.org/licenses/gpl.html> >>>>>> This is free software: you are free to change and redistribute it. >>>>>> There is NO WARRANTY, to the extent permitted by law. Type "show >>>>>> copying" >>>>>> and "show warranty" for details. >>>>>> This GDB was configured as "x86_64-redhat-linux-gnu"... >>>>>> Reading symbols from /usr/lib64/libtotem_pg.so.4...done. >>>>>> Loaded symbols for /usr/lib64/libtotem_pg.so.4 >>>>>> Reading symbols from /usr/lib64/liblogsys.so.4...done. >>>>>> Loaded symbols for /usr/lib64/liblogsys.so.4 >>>>>> Reading symbols from /usr/lib64/libcoroipcs.so.4...done. >>>>>> Loaded symbols for /usr/lib64/libcoroipcs.so.4 >>>>>> Reading symbols from /lib64/librt.so.1...done. >>>>>> Loaded symbols for /lib64/librt.so.1 >>>>>> Reading symbols from /lib64/libpthread.so.0...done. >>>>>> Loaded symbols for /lib64/libpthread.so.0 >>>>>> Reading symbols from /lib64/libdl.so.2...done. >>>>>> Loaded symbols for /lib64/libdl.so.2 >>>>>> Reading symbols from /lib64/libc.so.6...done. >>>>>> Loaded symbols for /lib64/libc.so.6 >>>>>> Reading symbols from /usr/lib64/libssl3.so...done. >>>>>> Loaded symbols for /usr/lib64/libssl3.so >>>>>> Reading symbols from /usr/lib64/libsmime3.so...done. >>>>>> Loaded symbols for /usr/lib64/libsmime3.so >>>>>> Reading symbols from /usr/lib64/libnss3.so...done. >>>>>> Loaded symbols for /usr/lib64/libnss3.so >>>>>> Reading symbols from /usr/lib64/libnssutil3.so...done. >>>>>> Loaded symbols for /usr/lib64/libnssutil3.so >>>>>> Reading symbols from /usr/lib64/libplds4.so...done. >>>>>> Loaded symbols for /usr/lib64/libplds4.so >>>>>> Reading symbols from /usr/lib64/libplc4.so...done. >>>>>> Loaded symbols for /usr/lib64/libplc4.so >>>>>> Reading symbols from /usr/lib64/libnspr4.so...done. >>>>>> Loaded symbols for /usr/lib64/libnspr4.so >>>>>> Reading symbols from /lib64/ld-linux-x86-64.so.2...done. >>>>>> Loaded symbols for /lib64/ld-linux-x86-64.so.2 >>>>>> Reading symbols from /usr/libexec/lcrso/objdb.lcrso...done. >>>>>> Loaded symbols for /usr/libexec/lcrso/objdb.lcrso >>>>>> Reading symbols from /usr/libexec/lcrso/coroparse.lcrso...done. >>>>>> Loaded symbols for /usr/libexec/lcrso/coroparse.lcrso >>>>>> Reading symbols from /usr/libexec/lcrso/pacemaker.lcrso...done. >>>>>> Loaded symbols for /usr/libexec/lcrso/pacemaker.lcrso >>>>>> Reading symbols from /usr/lib64/libplumb.so.2...done. >>>>>> Loaded symbols for /usr/lib64/libplumb.so.2 >>>>>> Reading symbols from /usr/lib64/libpils.so.2...done. >>>>>> Loaded symbols for /usr/lib64/libpils.so.2 >>>>>> Reading symbols from /usr/lib64/libbz2.so.1...done. >>>>>> Loaded symbols for /usr/lib64/libbz2.so.1 >>>>>> Reading symbols from /usr/lib64/libxslt.so.1...done. >>>>>> Loaded symbols for /usr/lib64/libxslt.so.1 >>>>>> Reading symbols from /usr/lib/libxml2.so.2...done. >>>>>> Loaded symbols for /usr/lib/libxml2.so.2 >>>>>> Reading symbols from /lib64/libuuid.so.1...done. >>>>>> Loaded symbols for /lib64/libuuid.so.1 >>>>>> Reading symbols from /lib64/libpam.so.0...done. >>>>>> Loaded symbols for /lib64/libpam.so.0 >>>>>> Reading symbols from /lib64/libglib-2.0.so.0...done. >>>>>> Loaded symbols for /lib64/libglib-2.0.so.0 >>>>>> Reading symbols from /usr/lib64/libz.so.1...done. >>>>>> Loaded symbols for /usr/lib64/libz.so.1 >>>>>> Reading symbols from /lib64/libm.so.6...done. >>>>>> Loaded symbols for /lib64/libm.so.6 >>>>>> Reading symbols from /lib64/libaudit.so.0...done. >>>>>> Loaded symbols for /lib64/libaudit.so.0 >>>>>> Reading symbols from /lib64/libnss_files.so.2...done. >>>>>> Loaded symbols for /lib64/libnss_files.so.2 >>>>>> Reading symbols from /usr/libexec/lcrso/service_evs.lcrso...done. >>>>>> Loaded symbols for /usr/libexec/lcrso/service_evs.lcrso >>>>>> Reading symbols from /usr/libexec/lcrso/service_cfg.lcrso...done. >>>>>> Loaded symbols for /usr/libexec/lcrso/service_cfg.lcrso >>>>>> Reading symbols from /usr/libexec/lcrso/service_cpg.lcrso...done. >>>>>> Loaded symbols for /usr/libexec/lcrso/service_cpg.lcrso >>>>>> Reading symbols from /usr/libexec/lcrso/service_confdb.lcrso...done. >>>>>> Loaded symbols for /usr/libexec/lcrso/service_confdb.lcrso >>>>>> Reading symbols from /usr/libexec/lcrso/service_pload.lcrso...done. >>>>>> Loaded symbols for /usr/libexec/lcrso/service_pload.lcrso >>>>>> Reading symbols from /usr/libexec/lcrso/vsf_quorum.lcrso...done. >>>>>> Loaded symbols for /usr/libexec/lcrso/vsf_quorum.lcrso >>>>>> Core was generated by `corosync'. >>>>>> Program terminated with signal 6, Aborted. >>>>>> [New process 32727] >>>>>> #0 0x0000003fff430265 in raise () from /lib64/libc.so.6 >>>>>> (gdb) where >>>>>> #0 0x0000003fff430265 in raise () from /lib64/libc.so.6 >>>>>> #1 0x0000003fff431d10 in abort () from /lib64/libc.so.6 >>>>>> #2 0x00002aaaaabaea0e in send_cluster_msg_raw () from >>>>>> /usr/libexec/lcrso/pacemaker.lcrso >>>>>> #3 0x00002aaaaabae4e2 in route_ais_message () from >>>>>> /usr/libexec/lcrso/pacemaker.lcrso >>>>>> #4 0x00002aaaaabac13f in pcmk_ipc () from >>>>>> /usr/libexec/lcrso/pacemaker.lcrso >>>>>> #5 0x00000039316026cc in pthread_ipc_consumer (conn=<value optimized >>>>>> out>) >>>>>> at coroipcs.c:727 >>>>>> #6 0x00000030000064a7 in start_thread () from /lib64/libpthread.so.0 >>>>>> #7 0x0000003fff4d3c2d in clone () from /lib64/libc.so.6 >>>>>> (gdb) >>>>>> >>>>>> >>>>>> Regards, >>>>>> Tomo >>>>>> >>>>>> >>>>>> >>>>>> _______________________________________________ >>>>>> Openais mailing list >>>>>> [email protected] >>>>>> https://lists.linux-foundation.org/mailman/listinfo/openais >>>>>> >>>> >>>> >>> _______________________________________________ >>> Openais mailing list >>> [email protected] >>> https://lists.linux-foundation.org/mailman/listinfo/openais >> >> _______________________________________________ Openais mailing list [email protected] https://lists.linux-foundation.org/mailman/listinfo/openais
