On Tue, Mar 09, 2010 at 01:57:07PM +0100, Borja Marcos wrote: > > On Mar 9, 2010, at 1:29 PM, Pawel Jakub Dawidek wrote: > > > On Tue, Mar 09, 2010 at 10:15:53AM +0100, Stefan Bethke wrote: > >> Over the past couple of months, I've more or less regularly observed > >> machines having more and more processes stuck in the zfs wchan. The > >> processes never recover from that, and trying to reboot only gets the > >> entire system stuck, without any console messages. I can enter the > >> debugger, and I have saved a couple of dumps. > >> > >> The situation seems to be triggered by zfs receive'ing snapshots from the > >> sister machine (both synchronize their active ZFS filesystems to each > >> other, using zfs send and zfs receive). It appears it's the receiving > >> causing trouble. > >> > >> Both machines run 8-stable from mid-February, with a single-disk ZFS pool, > >> with ARC limited to 512M, prefetch and ZIL disabled via loader.conf. > >> > >> What should I be looking at to further diagnose? > > > > What kind of hardware do you have there? There is 3-way deadlock I've a > > fix for which would be hard to trigger on single or dual core machines. > > > > Feel free to try the fix: > > > > http://people.freebsd.org/~pjd/patches/zfs_3way_deadlock.patch > > Maybe related to the deadlock I reported when I was receiving an incremental > snapshot while the target dataset was being read?
Could be. This deadlock is in general related to zfs recv functionality. -- Pawel Jakub Dawidek http://www.wheelsystems.com [email protected] http://www.FreeBSD.org FreeBSD committer Am I Evil? Yes, I Am!
pgpWv81oJw3Zu.pgp
Description: PGP signature
