When os_dump_core() raises SIGTERM to bring down all
UML processes this would also trigger the quite complex
do_uml_exitcalls() routine.
This is why UML crashed often while panicking.

Let's make os_dump_core() short and painless by killing all UML
processes with SIGHUP and calling the only sane exit call in this
context (remove_umid_dir()) by hand.

Signed-off-by: Richard Weinberger <rich...@nod.at>
---
 arch/um/include/shared/os.h |    1 +
 arch/um/os-Linux/umid.c     |    2 +-
 arch/um/os-Linux/util.c     |   22 +++++++---------------
 3 files changed, 9 insertions(+), 16 deletions(-)

diff --git a/arch/um/include/shared/os.h b/arch/um/include/shared/os.h
index 83c7c2e..dedb345 100644
--- a/arch/um/include/shared/os.h
+++ b/arch/um/include/shared/os.h
@@ -226,6 +226,7 @@ extern int os_get_thread_area(user_desc_t *info, int pid);
 extern int umid_file_name(char *name, char *buf, int len);
 extern int set_umid(char *name);
 extern char *get_umid(void);
+extern void remove_umid_dir(void);
 
 /* signal.c */
 extern void timer_init(void);
diff --git a/arch/um/os-Linux/umid.c b/arch/um/os-Linux/umid.c
index a27defb..48312a4 100644
--- a/arch/um/os-Linux/umid.c
+++ b/arch/um/os-Linux/umid.c
@@ -382,7 +382,7 @@ __uml_setup("uml_dir=", set_uml_dir,
 "    The location to place the pid and umid files.\n\n"
 );
 
-static void remove_umid_dir(void)
+void remove_umid_dir(void)
 {
        char dir[strlen(uml_dir) + UMID_LEN + 1], err;
 
diff --git a/arch/um/os-Linux/util.c b/arch/um/os-Linux/util.c
index 42827ca..73d32f2 100644
--- a/arch/um/os-Linux/util.c
+++ b/arch/um/os-Linux/util.c
@@ -103,30 +103,21 @@ void os_dump_core(void)
        signal(SIGSEGV, SIG_DFL);
 
        /*
-        * We are about to SIGTERM this entire process group to ensure that
-        * nothing is around to run after the kernel exits.  The
-        * kernel wants to abort, not die through SIGTERM, so we
-        * ignore it here.
+        * Send SIGHUP to kill non-kernel processes, kernel processes
+        * ignore SIGHUP.
+        * We are using SIGHUP because it has the same effekt as SIGTERM
+        * but it does not trigger another signal handlers.
         */
+       kill(0, SIGHUP);
 
-       signal(SIGTERM, SIG_IGN);
-       kill(0, SIGTERM);
        /*
         * Most of the other processes associated with this UML are
         * likely sTopped, so give them a SIGCONT so they see the
-        * SIGTERM.
+        * SIGHUP.
         */
        kill(0, SIGCONT);
 
        /*
-        * Now, having sent signals to everyone but us, make sure they
-        * die by ptrace.  Processes can survive what's been done to
-        * them so far - the mechanism I understand is receiving a
-        * SIGSEGV and segfaulting immediately upon return.  There is
-        * always a SIGSEGV pending, and (I'm guessing) signals are
-        * processed in numeric order so the SIGTERM (signal 15 vs
-        * SIGSEGV being signal 11) is never handled.
-        *
         * Run a waitpid loop until we get some kind of error.
         * Hopefully, it's ECHILD, but there's not a lot we can do if
         * it's something else.  Tell os_kill_ptraced_process not to
@@ -137,5 +128,6 @@ void os_dump_core(void)
        while ((pid = waitpid(-1, NULL, WNOHANG | __WALL)) > 0)
                os_kill_ptraced_process(pid, 0);
 
+       remove_umid_dir();
        uml_abort();
 }
-- 
1.7.4.2


------------------------------------------------------------------------------
WhatsUp Gold - Download Free Network Management Software
The most intuitive, comprehensive, and cost-effective network 
management toolset available today.  Delivers lowest initial 
acquisition cost and overall TCO of any competing solution.
http://p.sf.net/sfu/whatsupgold-sd
_______________________________________________
User-mode-linux-devel mailing list
User-mode-linux-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/user-mode-linux-devel

Reply via email to