Module Name: src Committed By: manu Date: Thu Sep 27 01:03:40 UTC 2018
Modified Files: src/sys/kern: vfs_trans.c Log Message: Work around deadlock between fstchg and fstcnt When suspending a filesystem in fstrans_setstate(), we wait on fstcnt for threads to finish transactions. While we do this, any thread trying to start a filesystem transaction will wait on fstchg in fstrans_start(), a situation which can deadlock. The wait for fstcnt in fstrans_setstate() can be interrupted by a signal, but the wait for fstchg in fstrans_start() cannot. Once most processes are stuck in fstchg, it is impossible to send a signal to the thread that waits on fstcnt, because no process respond anymore to user input. We fix that by adding a timeout to the wait on fstcnt in fstrans_setstate(). This means suspending a filesystem may fail, but it was already the case when the sleep was interupted by a signal, hence calling function must already handle a possible failure. Fixes kern/53624 To generate a diff of this commit: cvs rdiff -u -r1.48 -r1.49 src/sys/kern/vfs_trans.c Please note that diffs are not public domain; they are subject to the copyright notices on the relevant files.
Modified files: Index: src/sys/kern/vfs_trans.c diff -u src/sys/kern/vfs_trans.c:1.48 src/sys/kern/vfs_trans.c:1.49 --- src/sys/kern/vfs_trans.c:1.48 Sun Jun 18 14:00:17 2017 +++ src/sys/kern/vfs_trans.c Thu Sep 27 01:03:40 2018 @@ -1,4 +1,4 @@ -/* $NetBSD: vfs_trans.c,v 1.48 2017/06/18 14:00:17 hannken Exp $ */ +/* $NetBSD: vfs_trans.c,v 1.49 2018/09/27 01:03:40 manu Exp $ */ /*- * Copyright (c) 2007 The NetBSD Foundation, Inc. @@ -30,7 +30,7 @@ */ #include <sys/cdefs.h> -__KERNEL_RCSID(0, "$NetBSD: vfs_trans.c,v 1.48 2017/06/18 14:00:17 hannken Exp $"); +__KERNEL_RCSID(0, "$NetBSD: vfs_trans.c,v 1.49 2018/09/27 01:03:40 manu Exp $"); /* * File system transaction operations. @@ -42,6 +42,7 @@ __KERNEL_RCSID(0, "$NetBSD: vfs_trans.c, #include <sys/param.h> #include <sys/systm.h> +#include <sys/kernel.h> #include <sys/atomic.h> #include <sys/buf.h> #include <sys/kmem.h> @@ -532,10 +533,14 @@ fstrans_setstate(struct mount *mp, enum /* * All threads see the new state now. * Wait for transactions invalid at this state to leave. + * We cannot wait forever because many processes would + * get stuck waiting for fstcnt in fstrans_start(). This + * is acute when suspending the root filesystem. */ error = 0; while (! state_change_done(mp)) { - error = cv_wait_sig(&fstrans_count_cv, &fstrans_lock); + error = cv_timedwait_sig(&fstrans_count_cv, + &fstrans_lock, hz / 4); if (error) { new_state = fmi->fmi_state = FSTRANS_NORMAL; break;