On 11/11/2010 02:35 AM, Andrew Beekhof wrote:
> On Wed, Oct 27, 2010 at 5:15 PM, Steven Dake <[email protected]> wrote:
>> On 10/26/2010 11:17 PM, Andrew Beekhof wrote:
>>>
>>> On Wed, Oct 27, 2010 at 7:32 AM, nozawat<[email protected]>  wrote:
>>>>
>>>> Hi Andrew,
>>>>
>>>>  I send two log files of terminal.log and ha.log.
>>>>  The contents of the terminal log are command results of "ps -ef|grep
>>>> coro"
>>>> and "crm_mon -f -1".
>>>>
>>>>  It is what processing completes normally when what did not understand me
>>>> well watches log though corosync outputs core.
>>>
>>> Oct 27 10:53:12 hb0101 corosync[6695]:   [pcmk  ] plugin.c:1526 ERROR:
>>> send_cluster_msg_raw: Child 7016 spawned to record non-fatal assertion
>>> failure line 1526: rc == 0
>>>
>>> Oct 27 10:53:12 hb0101 corosync[6695]:   [pcmk  ] plugin.c:1526 ERROR:
>>> send_cluster_msg_raw: Message not sent (-1):<copy t="cib"
>>> cib_op="cib_replace" cib_delegated_from="hb0102"
>>> cib_clientname="hb0102" cib_isreplyto="hb0102" original_c
>>>
>>> For some reason
>>>     rc = pcmk_api->totem_mcast(&iovec, 1, TOTEMPG_SAFE);
>>> is returning -1
>>>
>>>
>>> Steve: would this happen if membership was in flux?
>>> I thought only IPC got stopped.
>>>
>>
>> it could
>>
>> If api->totem_mcast sends many messages it can fill up the totem queue and
>> return -1.  The best solution to handling sending messages outside of IPC is
>> to use the schedwrk api.  It will request a piece of work be done when the
>> token is sent (and hopefully there are more spots in the new message queue).
>>  It will continue to schedule work until 0 is retuned by the callback
>> registered with schedwrk.
> 
> what about a while-loop with a sleep in it?
> 
>>

That could cause all kinds of problems with the membership system timers
resulting in wierd behavior and bad membership states.  That is why
there is a schedwrk api.

Regards
-steve

>> Regards
>> -steve
>>
>>>>
>>>> Regards,
>>>> Tomo
>>>>
>>>>
>>>> 2010/10/27 Andrew Beekhof<[email protected]>
>>>>>
>>>>> On Tue, Oct 26, 2010 at 11:22 AM, nozawat<[email protected]>  wrote:
>>>>>>
>>>>>> Hi all,
>>>>>>
>>>>>> My environment is as follows.
>>>>>>  * cluster-glue-1.0.6
>>>>>>  * resource-agents-1.0.3
>>>>>>  * corosync-1.2.8 (svn revision '3059')
>>>>>>  * pacemaker-1.1.3-2f0326468a33acb1ada8fa744c7d36d0b315bd35
>>>>>>
>>>>>> Core file was output by corosync of the DC node when I load a crm file.
>>>>>>
>>>>>> It is the infomation of the core file as follows.
>>>>>
>>>>> log file?
>>>>> you're tripping over an assertion, it would be good to know which one
>>>>>
>>>>>>
>>>>>> [r...@hb0101 ~]$ file /var/lib/corosync/core.32727
>>>>>> /var/lib/corosync/core.32727: ELF 64-bit LSB core file AMD x86-64,
>>>>>> version 1
>>>>>> (SYSV), SVR4-style, from 'corosync'
>>>>>>
>>>>>> [r...@hb0101 ~]$ gdb /usr/sbin/corosync /var/lib/corosync/core.32727
>>>>>> GNU gdb Fedora (6.8-37.el5)
>>>>>> Copyright (C) 2008 Free Software Foundation, Inc.
>>>>>> License GPLv3+: GNU GPL version 3 or later
>>>>>> <http://gnu.org/licenses/gpl.html>
>>>>>> This is free software: you are free to change and redistribute it.
>>>>>> There is NO WARRANTY, to the extent permitted by law.  Type "show
>>>>>> copying"
>>>>>> and "show warranty" for details.
>>>>>> This GDB was configured as "x86_64-redhat-linux-gnu"...
>>>>>> Reading symbols from /usr/lib64/libtotem_pg.so.4...done.
>>>>>> Loaded symbols for /usr/lib64/libtotem_pg.so.4
>>>>>> Reading symbols from /usr/lib64/liblogsys.so.4...done.
>>>>>> Loaded symbols for /usr/lib64/liblogsys.so.4
>>>>>> Reading symbols from /usr/lib64/libcoroipcs.so.4...done.
>>>>>> Loaded symbols for /usr/lib64/libcoroipcs.so.4
>>>>>> Reading symbols from /lib64/librt.so.1...done.
>>>>>> Loaded symbols for /lib64/librt.so.1
>>>>>> Reading symbols from /lib64/libpthread.so.0...done.
>>>>>> Loaded symbols for /lib64/libpthread.so.0
>>>>>> Reading symbols from /lib64/libdl.so.2...done.
>>>>>> Loaded symbols for /lib64/libdl.so.2
>>>>>> Reading symbols from /lib64/libc.so.6...done.
>>>>>> Loaded symbols for /lib64/libc.so.6
>>>>>> Reading symbols from /usr/lib64/libssl3.so...done.
>>>>>> Loaded symbols for /usr/lib64/libssl3.so
>>>>>> Reading symbols from /usr/lib64/libsmime3.so...done.
>>>>>> Loaded symbols for /usr/lib64/libsmime3.so
>>>>>> Reading symbols from /usr/lib64/libnss3.so...done.
>>>>>> Loaded symbols for /usr/lib64/libnss3.so
>>>>>> Reading symbols from /usr/lib64/libnssutil3.so...done.
>>>>>> Loaded symbols for /usr/lib64/libnssutil3.so
>>>>>> Reading symbols from /usr/lib64/libplds4.so...done.
>>>>>> Loaded symbols for /usr/lib64/libplds4.so
>>>>>> Reading symbols from /usr/lib64/libplc4.so...done.
>>>>>> Loaded symbols for /usr/lib64/libplc4.so
>>>>>> Reading symbols from /usr/lib64/libnspr4.so...done.
>>>>>> Loaded symbols for /usr/lib64/libnspr4.so
>>>>>> Reading symbols from /lib64/ld-linux-x86-64.so.2...done.
>>>>>> Loaded symbols for /lib64/ld-linux-x86-64.so.2
>>>>>> Reading symbols from /usr/libexec/lcrso/objdb.lcrso...done.
>>>>>> Loaded symbols for /usr/libexec/lcrso/objdb.lcrso
>>>>>> Reading symbols from /usr/libexec/lcrso/coroparse.lcrso...done.
>>>>>> Loaded symbols for /usr/libexec/lcrso/coroparse.lcrso
>>>>>> Reading symbols from /usr/libexec/lcrso/pacemaker.lcrso...done.
>>>>>> Loaded symbols for /usr/libexec/lcrso/pacemaker.lcrso
>>>>>> Reading symbols from /usr/lib64/libplumb.so.2...done.
>>>>>> Loaded symbols for /usr/lib64/libplumb.so.2
>>>>>> Reading symbols from /usr/lib64/libpils.so.2...done.
>>>>>> Loaded symbols for /usr/lib64/libpils.so.2
>>>>>> Reading symbols from /usr/lib64/libbz2.so.1...done.
>>>>>> Loaded symbols for /usr/lib64/libbz2.so.1
>>>>>> Reading symbols from /usr/lib64/libxslt.so.1...done.
>>>>>> Loaded symbols for /usr/lib64/libxslt.so.1
>>>>>> Reading symbols from /usr/lib/libxml2.so.2...done.
>>>>>> Loaded symbols for /usr/lib/libxml2.so.2
>>>>>> Reading symbols from /lib64/libuuid.so.1...done.
>>>>>> Loaded symbols for /lib64/libuuid.so.1
>>>>>> Reading symbols from /lib64/libpam.so.0...done.
>>>>>> Loaded symbols for /lib64/libpam.so.0
>>>>>> Reading symbols from /lib64/libglib-2.0.so.0...done.
>>>>>> Loaded symbols for /lib64/libglib-2.0.so.0
>>>>>> Reading symbols from /usr/lib64/libz.so.1...done.
>>>>>> Loaded symbols for /usr/lib64/libz.so.1
>>>>>> Reading symbols from /lib64/libm.so.6...done.
>>>>>> Loaded symbols for /lib64/libm.so.6
>>>>>> Reading symbols from /lib64/libaudit.so.0...done.
>>>>>> Loaded symbols for /lib64/libaudit.so.0
>>>>>> Reading symbols from /lib64/libnss_files.so.2...done.
>>>>>> Loaded symbols for /lib64/libnss_files.so.2
>>>>>> Reading symbols from /usr/libexec/lcrso/service_evs.lcrso...done.
>>>>>> Loaded symbols for /usr/libexec/lcrso/service_evs.lcrso
>>>>>> Reading symbols from /usr/libexec/lcrso/service_cfg.lcrso...done.
>>>>>> Loaded symbols for /usr/libexec/lcrso/service_cfg.lcrso
>>>>>> Reading symbols from /usr/libexec/lcrso/service_cpg.lcrso...done.
>>>>>> Loaded symbols for /usr/libexec/lcrso/service_cpg.lcrso
>>>>>> Reading symbols from /usr/libexec/lcrso/service_confdb.lcrso...done.
>>>>>> Loaded symbols for /usr/libexec/lcrso/service_confdb.lcrso
>>>>>> Reading symbols from /usr/libexec/lcrso/service_pload.lcrso...done.
>>>>>> Loaded symbols for /usr/libexec/lcrso/service_pload.lcrso
>>>>>> Reading symbols from /usr/libexec/lcrso/vsf_quorum.lcrso...done.
>>>>>> Loaded symbols for /usr/libexec/lcrso/vsf_quorum.lcrso
>>>>>> Core was generated by `corosync'.
>>>>>> Program terminated with signal 6, Aborted.
>>>>>> [New process 32727]
>>>>>> #0  0x0000003fff430265 in raise () from /lib64/libc.so.6
>>>>>> (gdb) where
>>>>>> #0  0x0000003fff430265 in raise () from /lib64/libc.so.6
>>>>>> #1  0x0000003fff431d10 in abort () from /lib64/libc.so.6
>>>>>> #2  0x00002aaaaabaea0e in send_cluster_msg_raw () from
>>>>>> /usr/libexec/lcrso/pacemaker.lcrso
>>>>>> #3  0x00002aaaaabae4e2 in route_ais_message () from
>>>>>> /usr/libexec/lcrso/pacemaker.lcrso
>>>>>> #4  0x00002aaaaabac13f in pcmk_ipc () from
>>>>>> /usr/libexec/lcrso/pacemaker.lcrso
>>>>>> #5  0x00000039316026cc in pthread_ipc_consumer (conn=<value optimized
>>>>>> out>)
>>>>>> at coroipcs.c:727
>>>>>> #6  0x00000030000064a7 in start_thread () from /lib64/libpthread.so.0
>>>>>> #7  0x0000003fff4d3c2d in clone () from /lib64/libc.so.6
>>>>>> (gdb)
>>>>>>
>>>>>>
>>>>>> Regards,
>>>>>> Tomo
>>>>>>
>>>>>>
>>>>>>
>>>>>> _______________________________________________
>>>>>> Openais mailing list
>>>>>> [email protected]
>>>>>> https://lists.linux-foundation.org/mailman/listinfo/openais
>>>>>>
>>>>
>>>>
>>> _______________________________________________
>>> Openais mailing list
>>> [email protected]
>>> https://lists.linux-foundation.org/mailman/listinfo/openais
>>
>>

_______________________________________________
Openais mailing list
[email protected]
https://lists.linux-foundation.org/mailman/listinfo/openais

Reply via email to