OK So I believe there are a total of 3 defects we have:
cpg_leave followed by cpg_finalize (fixed with Chrissie's recent patch) some other cpg relating to cpg_join (unfixed) The possible racing in the totem callbacks with ckpt's lib_exit_fn (fixed by patch in this thread) So what I'd like to know for sure is if the expiry_callback backtrace can be reproduced with this patch. Then we can direct our energies towards coming up with a test case for the remaining cpg issue. Regards -teve On Sun, 2009-03-22 at 00:31 +0100, Lars Marowsky-Bree wrote: > On 2009-03-21T12:45:57, Steven Dake <[email protected]> wrote: > > > see subject > > My current trace with this is back to square 1: > > (gdb) bt > #0 0x00007f8aba35b13c in notify_lib_joinlist (gi=0x763e10, conn=0x0, > joined_list_entries=1, joined_list=0x7fffcb5ec490, left_list_entries=0, > left_list=0x0, id=4) at cpg.c:386 > #1 0x00007f8aba35c7c4 in do_proc_join (name=0x7fffcb5ec560, pid=9100, > nodeid=5, > reason=1) at cpg.c:757 > #2 0x00007f8aba35ca0d in message_handler_req_exec_cpg_procjoin > (message=0x7fffcb5ec550, > nodeid=5) at cpg.c:804 > #3 0x000000000041e42e in deliver_fn (nodeid=5, iovec=0x7fffcb5ec710, > iov_len=1, > endian_conversion_required=0) at main.c:415 > #4 0x00000000004182ca in app_deliver_fn (nodeid=5, iovec=0x7fffcb5ec700, > iov_len=1, > endian_conversion_required=0) at totempg.c:443 > #5 0x0000000000417f5e in totempg_deliver_fn (nodeid=5, iovec=0x758188, > iov_len=1, > endian_conversion_required=0) at totempg.c:587 > #6 0x0000000000417232 in totemmrp_deliver_fn (nodeid=5, iovec=0x758188, > iov_len=3, > endian_conversion_required=0) at totemmrp.c:82 > #7 0x0000000000415107 in messages_deliver_to_app (instance=0x7f8ac34bd010, > skip=0, > end_point=2257) at totemsrp.c:3553 > #8 0x0000000000414cdd in message_handler_orf_token (instance=0x7f8ac34bd010, > msg=0x7f8abc007514, msg_len=70, endian_conversion_needed=0) at > totemsrp.c:3425 > #9 0x000000000041706d in main_deliver_fn (context=0x7f8ac34bd010, > msg=0x7f8abc007514, > msg_len=70) at totemsrp.c:4136 > #10 0x000000000040af4c in none_token_recv (rrp_instance=0x7f8abc006970, > iface_no=0, > context=0x7f8ac34bd010, msg=0x7f8abc007514, msg_len=70, token_seq=80683) > at totemrrp.c:506 > #11 0x000000000040c91e in rrp_deliver_fn (context=0x7f8abc006e50, > msg=0x7f8abc007514, > msg_len=70) at totemrrp.c:1308 > #12 0x0000000000409025 in net_deliver_fn (handle=0, fd=3, revents=1, > data=0x7f8abc006e90) > at totemnet.c:676 > #13 0x000000000040738a in poll_run (handle=0) at aispoll.c:402 > ---Type <return> to continue, or q <return> to quit--- > #14 0x000000000041ed7d in main (argc=1, argv=0x7fffcb5efd58) at main.c:634 > > I'll let it run a bit and see if I can get a different trace. > > > Regards, > Lars > _______________________________________________ Openais mailing list [email protected] https://lists.linux-foundation.org/mailman/listinfo/openais
