- **status**: assigned --> fixed
- **Comment**:
commit 8758c96eaf3d62ec99b99a7ae8d3ebf6884793c1
Author: Minh Chau <[email protected]>
Date: Mon Oct 26 13:12:07 2020 +1100
base: Use non-blocking socketpair in sysf_exc module V3 [#3222]
---
** [tickets:#3222] base: write() from socketpair in exec module causes amfnd
hanging**
**Status:** fixed
**Milestone:** 5.20.11
**Created:** Tue Sep 22, 2020 01:02 PM UTC by Minh Hon Chau
**Last Updated:** Tue Sep 22, 2020 01:06 PM UTC
**Owner:** Minh Hon Chau
Reproduction:
- Load amf_demo app with a modification that adds 800 components per SU
- Unlock-in/Unlock both SUs
- pkill amf_demo. Note, this pkill will terminate 800 components at once
Observation:
- amfnd's stuck, and node reboot eventually due to amfwd
- strace shows that all amfnd's threads are stuck at write()
root@SC-1:~# ps -ef | grep osafamfnd
root 329 1 0 22:59 ? 00:00:08
/usr/local/lib/opensaf/osafamfnd --tracemask=0xffffffff
root 18743 11375 0 23:00 pts/4 00:00:00 grep --color=auto osafamfnd
root@SC-1:~# strace -ffp 329
strace: Process 329 attached with 5 threads
[pid 334] write(32, "\0\0\0\0\0\0\0\0\0\0\0\0", 12 <unfinished ...>
[pid 333] write(32, "\0\0\0\0\0\0\0\0\0\0\0\0", 12 <unfinished ...>
[pid 332] write(32, "\0\0\0\0\0\0\0\0\0\0\0\0", 12 <unfinished ...>
[pid 331] write(32, "\0\0\0\0\0\0\0\0\0\0\0\0", 12 <unfinished ...>
[pid 329] write(32, "\0\0\0\0\0\0\0\0\0\0\0\0", 12
(gdb) info threads
Id Target Id Frame
* 1 Thread 0x7f78af005780 (LWP 322) 0x00007f78adb364bd in write ()
at ../sysdeps/unix/syscall-template.S:84
2 Thread 0x7f78abfb3700 (LWP 326) 0x00007f78adb364bd in write ()
at ../sysdeps/unix/syscall-template.S:84
3 Thread 0x7f78aefc2b00 (LWP 327) 0x00007f78adb364bd in write ()
at ../sysdeps/unix/syscall-template.S:84
4 Thread 0x7f78af002b00 (LWP 324) 0x00007f78adb364bd in write ()
at ../sysdeps/unix/syscall-template.S:84
5 Thread 0x7f78aefe2b00 (LWP 325) 0x00007f78adb364bd in write ()
at ../sysdeps/unix/syscall-template.S:84
(gdb) bt
0 0x00007f78adb364bd in write () at ../sysdeps/unix/syscall-template.S:84
1 0x00007f78ae31c742 in ncs_exec_module_signal_hdlr (signal=<optimised out>)
at src/base/sysf_exc_scr.c:111
2 <signal handler called>
---
Sent from sourceforge.net because [email protected] is
subscribed to https://sourceforge.net/p/opensaf/tickets/
To unsubscribe from further messages, a project admin can change settings at
https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a
mailing list, you can unsubscribe from the mailing list._______________________________________________
Opensaf-tickets mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets