On 11/12/2010 12:38 AM, Andrew Beekhof wrote:
> On Thu, Nov 11, 2010 at 4:39 PM, Steven Dake <[email protected]> wrote:
>> On 11/11/2010 02:35 AM, Andrew Beekhof wrote:
>>> On Wed, Oct 27, 2010 at 5:15 PM, Steven Dake <[email protected]> wrote:
>>>> On 10/26/2010 11:17 PM, Andrew Beekhof wrote:
>>>>>
>>>>> On Wed, Oct 27, 2010 at 7:32 AM, nozawat<[email protected]>  wrote:
>>>>>>
>>>>>> Hi Andrew,
>>>>>>
>>>>>>  I send two log files of terminal.log and ha.log.
>>>>>>  The contents of the terminal log are command results of "ps -ef|grep
>>>>>> coro"
>>>>>> and "crm_mon -f -1".
>>>>>>
>>>>>>  It is what processing completes normally when what did not understand me
>>>>>> well watches log though corosync outputs core.
>>>>>
>>>>> Oct 27 10:53:12 hb0101 corosync[6695]:   [pcmk  ] plugin.c:1526 ERROR:
>>>>> send_cluster_msg_raw: Child 7016 spawned to record non-fatal assertion
>>>>> failure line 1526: rc == 0
>>>>>
>>>>> Oct 27 10:53:12 hb0101 corosync[6695]:   [pcmk  ] plugin.c:1526 ERROR:
>>>>> send_cluster_msg_raw: Message not sent (-1):<copy t="cib"
>>>>> cib_op="cib_replace" cib_delegated_from="hb0102"
>>>>> cib_clientname="hb0102" cib_isreplyto="hb0102" original_c
>>>>>
>>>>> For some reason
>>>>>     rc = pcmk_api->totem_mcast(&iovec, 1, TOTEMPG_SAFE);
>>>>> is returning -1
>>>>>
>>>>>
>>>>> Steve: would this happen if membership was in flux?
>>>>> I thought only IPC got stopped.
>>>>>
>>>>
>>>> it could
>>>>
>>>> If api->totem_mcast sends many messages it can fill up the totem queue and
>>>> return -1.  The best solution to handling sending messages outside of IPC 
>>>> is
>>>> to use the schedwrk api.  It will request a piece of work be done when the
>>>> token is sent (and hopefully there are more spots in the new message 
>>>> queue).
>>>>  It will continue to schedule work until 0 is retuned by the callback
>>>> registered with schedwrk.
>>>
>>> what about a while-loop with a sleep in it?
>>>
>>>>
>>
>> That could cause all kinds of problems with the membership system timers
>> resulting in wierd behavior and bad membership states.  That is why
>> there is a schedwrk api.
> 
> Looks painful.
> I think I'd prefer people moved to the MCP instead.
> 

Agree,

All usage of the plugin should be discouraged in favor of MCP.

Regards
-steve

>>
>> Regards
>> -steve
>>
>>>> Regards
>>>> -steve
>>>>
>>>>>>
>>>>>> Regards,
>>>>>> Tomo
>>>>>>
>>>>>>
>>>>>> 2010/10/27 Andrew Beekhof<[email protected]>
>>>>>>>
>>>>>>> On Tue, Oct 26, 2010 at 11:22 AM, nozawat<[email protected]>  wrote:
>>>>>>>>
>>>>>>>> Hi all,
>>>>>>>>
>>>>>>>> My environment is as follows.
>>>>>>>>  * cluster-glue-1.0.6
>>>>>>>>  * resource-agents-1.0.3
>>>>>>>>  * corosync-1.2.8 (svn revision '3059')
>>>>>>>>  * pacemaker-1.1.3-2f0326468a33acb1ada8fa744c7d36d0b315bd35
>>>>>>>>
>>>>>>>> Core file was output by corosync of the DC node when I load a crm file.
>>>>>>>>
>>>>>>>> It is the infomation of the core file as follows.
>>>>>>>
>>>>>>> log file?
>>>>>>> you're tripping over an assertion, it would be good to know which one
>>>>>>>
>>>>>>>>
>>>>>>>> [r...@hb0101 ~]$ file /var/lib/corosync/core.32727
>>>>>>>> /var/lib/corosync/core.32727: ELF 64-bit LSB core file AMD x86-64,
>>>>>>>> version 1
>>>>>>>> (SYSV), SVR4-style, from 'corosync'
>>>>>>>>
>>>>>>>> [r...@hb0101 ~]$ gdb /usr/sbin/corosync /var/lib/corosync/core.32727
>>>>>>>> GNU gdb Fedora (6.8-37.el5)
>>>>>>>> Copyright (C) 2008 Free Software Foundation, Inc.
>>>>>>>> License GPLv3+: GNU GPL version 3 or later
>>>>>>>> <http://gnu.org/licenses/gpl.html>
>>>>>>>> This is free software: you are free to change and redistribute it.
>>>>>>>> There is NO WARRANTY, to the extent permitted by law.  Type "show
>>>>>>>> copying"
>>>>>>>> and "show warranty" for details.
>>>>>>>> This GDB was configured as "x86_64-redhat-linux-gnu"...
>>>>>>>> Reading symbols from /usr/lib64/libtotem_pg.so.4...done.
>>>>>>>> Loaded symbols for /usr/lib64/libtotem_pg.so.4
>>>>>>>> Reading symbols from /usr/lib64/liblogsys.so.4...done.
>>>>>>>> Loaded symbols for /usr/lib64/liblogsys.so.4
>>>>>>>> Reading symbols from /usr/lib64/libcoroipcs.so.4...done.
>>>>>>>> Loaded symbols for /usr/lib64/libcoroipcs.so.4
>>>>>>>> Reading symbols from /lib64/librt.so.1...done.
>>>>>>>> Loaded symbols for /lib64/librt.so.1
>>>>>>>> Reading symbols from /lib64/libpthread.so.0...done.
>>>>>>>> Loaded symbols for /lib64/libpthread.so.0
>>>>>>>> Reading symbols from /lib64/libdl.so.2...done.
>>>>>>>> Loaded symbols for /lib64/libdl.so.2
>>>>>>>> Reading symbols from /lib64/libc.so.6...done.
>>>>>>>> Loaded symbols for /lib64/libc.so.6
>>>>>>>> Reading symbols from /usr/lib64/libssl3.so...done.
>>>>>>>> Loaded symbols for /usr/lib64/libssl3.so
>>>>>>>> Reading symbols from /usr/lib64/libsmime3.so...done.
>>>>>>>> Loaded symbols for /usr/lib64/libsmime3.so
>>>>>>>> Reading symbols from /usr/lib64/libnss3.so...done.
>>>>>>>> Loaded symbols for /usr/lib64/libnss3.so
>>>>>>>> Reading symbols from /usr/lib64/libnssutil3.so...done.
>>>>>>>> Loaded symbols for /usr/lib64/libnssutil3.so
>>>>>>>> Reading symbols from /usr/lib64/libplds4.so...done.
>>>>>>>> Loaded symbols for /usr/lib64/libplds4.so
>>>>>>>> Reading symbols from /usr/lib64/libplc4.so...done.
>>>>>>>> Loaded symbols for /usr/lib64/libplc4.so
>>>>>>>> Reading symbols from /usr/lib64/libnspr4.so...done.
>>>>>>>> Loaded symbols for /usr/lib64/libnspr4.so
>>>>>>>> Reading symbols from /lib64/ld-linux-x86-64.so.2...done.
>>>>>>>> Loaded symbols for /lib64/ld-linux-x86-64.so.2
>>>>>>>> Reading symbols from /usr/libexec/lcrso/objdb.lcrso...done.
>>>>>>>> Loaded symbols for /usr/libexec/lcrso/objdb.lcrso
>>>>>>>> Reading symbols from /usr/libexec/lcrso/coroparse.lcrso...done.
>>>>>>>> Loaded symbols for /usr/libexec/lcrso/coroparse.lcrso
>>>>>>>> Reading symbols from /usr/libexec/lcrso/pacemaker.lcrso...done.
>>>>>>>> Loaded symbols for /usr/libexec/lcrso/pacemaker.lcrso
>>>>>>>> Reading symbols from /usr/lib64/libplumb.so.2...done.
>>>>>>>> Loaded symbols for /usr/lib64/libplumb.so.2
>>>>>>>> Reading symbols from /usr/lib64/libpils.so.2...done.
>>>>>>>> Loaded symbols for /usr/lib64/libpils.so.2
>>>>>>>> Reading symbols from /usr/lib64/libbz2.so.1...done.
>>>>>>>> Loaded symbols for /usr/lib64/libbz2.so.1
>>>>>>>> Reading symbols from /usr/lib64/libxslt.so.1...done.
>>>>>>>> Loaded symbols for /usr/lib64/libxslt.so.1
>>>>>>>> Reading symbols from /usr/lib/libxml2.so.2...done.
>>>>>>>> Loaded symbols for /usr/lib/libxml2.so.2
>>>>>>>> Reading symbols from /lib64/libuuid.so.1...done.
>>>>>>>> Loaded symbols for /lib64/libuuid.so.1
>>>>>>>> Reading symbols from /lib64/libpam.so.0...done.
>>>>>>>> Loaded symbols for /lib64/libpam.so.0
>>>>>>>> Reading symbols from /lib64/libglib-2.0.so.0...done.
>>>>>>>> Loaded symbols for /lib64/libglib-2.0.so.0
>>>>>>>> Reading symbols from /usr/lib64/libz.so.1...done.
>>>>>>>> Loaded symbols for /usr/lib64/libz.so.1
>>>>>>>> Reading symbols from /lib64/libm.so.6...done.
>>>>>>>> Loaded symbols for /lib64/libm.so.6
>>>>>>>> Reading symbols from /lib64/libaudit.so.0...done.
>>>>>>>> Loaded symbols for /lib64/libaudit.so.0
>>>>>>>> Reading symbols from /lib64/libnss_files.so.2...done.
>>>>>>>> Loaded symbols for /lib64/libnss_files.so.2
>>>>>>>> Reading symbols from /usr/libexec/lcrso/service_evs.lcrso...done.
>>>>>>>> Loaded symbols for /usr/libexec/lcrso/service_evs.lcrso
>>>>>>>> Reading symbols from /usr/libexec/lcrso/service_cfg.lcrso...done.
>>>>>>>> Loaded symbols for /usr/libexec/lcrso/service_cfg.lcrso
>>>>>>>> Reading symbols from /usr/libexec/lcrso/service_cpg.lcrso...done.
>>>>>>>> Loaded symbols for /usr/libexec/lcrso/service_cpg.lcrso
>>>>>>>> Reading symbols from /usr/libexec/lcrso/service_confdb.lcrso...done.
>>>>>>>> Loaded symbols for /usr/libexec/lcrso/service_confdb.lcrso
>>>>>>>> Reading symbols from /usr/libexec/lcrso/service_pload.lcrso...done.
>>>>>>>> Loaded symbols for /usr/libexec/lcrso/service_pload.lcrso
>>>>>>>> Reading symbols from /usr/libexec/lcrso/vsf_quorum.lcrso...done.
>>>>>>>> Loaded symbols for /usr/libexec/lcrso/vsf_quorum.lcrso
>>>>>>>> Core was generated by `corosync'.
>>>>>>>> Program terminated with signal 6, Aborted.
>>>>>>>> [New process 32727]
>>>>>>>> #0  0x0000003fff430265 in raise () from /lib64/libc.so.6
>>>>>>>> (gdb) where
>>>>>>>> #0  0x0000003fff430265 in raise () from /lib64/libc.so.6
>>>>>>>> #1  0x0000003fff431d10 in abort () from /lib64/libc.so.6
>>>>>>>> #2  0x00002aaaaabaea0e in send_cluster_msg_raw () from
>>>>>>>> /usr/libexec/lcrso/pacemaker.lcrso
>>>>>>>> #3  0x00002aaaaabae4e2 in route_ais_message () from
>>>>>>>> /usr/libexec/lcrso/pacemaker.lcrso
>>>>>>>> #4  0x00002aaaaabac13f in pcmk_ipc () from
>>>>>>>> /usr/libexec/lcrso/pacemaker.lcrso
>>>>>>>> #5  0x00000039316026cc in pthread_ipc_consumer (conn=<value optimized
>>>>>>>> out>)
>>>>>>>> at coroipcs.c:727
>>>>>>>> #6  0x00000030000064a7 in start_thread () from /lib64/libpthread.so.0
>>>>>>>> #7  0x0000003fff4d3c2d in clone () from /lib64/libc.so.6
>>>>>>>> (gdb)
>>>>>>>>
>>>>>>>>
>>>>>>>> Regards,
>>>>>>>> Tomo
>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>> _______________________________________________
>>>>>>>> Openais mailing list
>>>>>>>> [email protected]
>>>>>>>> https://lists.linux-foundation.org/mailman/listinfo/openais
>>>>>>>>
>>>>>>
>>>>>>
>>>>> _______________________________________________
>>>>> Openais mailing list
>>>>> [email protected]
>>>>> https://lists.linux-foundation.org/mailman/listinfo/openais
>>>>
>>>>
>>
>>

_______________________________________________
Openais mailing list
[email protected]
https://lists.linux-foundation.org/mailman/listinfo/openais

Reply via email to