Re: ZFS stalls -- and maybe we should be talking about defaults?

2013-03-08 Thread Borja Marcos
On Mar 5, 2013, at 11:09 PM, Jeremy Chadwick wrote: - Disks are GPT and are *partitioned, and ZFS refers to the partitions not the raw disk -- this matters (honest, it really does; the ZFS code handles things differently with raw disks) Not on FreeBSD as far I can see. My statement

Re: ZFS stalls -- and maybe we should be talking about defaults?

2013-03-07 Thread Karl Denninger
On 3/7/2013 1:21 AM, Peter Jeremy wrote: On 2013-Mar-04 16:48:18 -0600, Karl Denninger k...@denninger.net wrote: The subject machine in question has 12GB of RAM and dual Xeon 5500-series processors. It also has an ARECA 1680ix in it with 2GB of local cache and the BBU for it. The ZFS

Re: ZFS stalls -- and maybe we should be talking about defaults?

2013-03-07 Thread Steven Hartland
- Original Message - From: Karl Denninger k...@denninger.net Where I am right now is this: 1. I *CANNOT* reproduce the spins on the test machine with Postgres stopped in any way. Even with multiple ZFS send/recv copies going on and the load average north of 20 (due to all the geli

Re: ZFS stalls -- and maybe we should be talking about defaults?

2013-03-07 Thread Karl Denninger
On 3/7/2013 12:57 PM, Steven Hartland wrote: - Original Message - From: Karl Denninger k...@denninger.net Where I am right now is this: 1. I *CANNOT* reproduce the spins on the test machine with Postgres stopped in any way. Even with multiple ZFS send/recv copies going on and the

Re: ZFS stalls -- and maybe we should be talking about defaults?

2013-03-07 Thread Steven Hartland
- Original Message - From: Karl Denninger k...@denninger.net To: freebsd-stable@freebsd.org Sent: Thursday, March 07, 2013 7:07 PM Subject: Re: ZFS stalls -- and maybe we should be talking about defaults? On 3/7/2013 12:57 PM, Steven Hartland wrote: - Original Message

Re: ZFS stalls -- and maybe we should be talking about defaults?

2013-03-07 Thread Karl Denninger
On 3/7/2013 1:27 PM, Steven Hartland wrote: - Original Message - From: Karl Denninger k...@denninger.net To: freebsd-stable@freebsd.org Sent: Thursday, March 07, 2013 7:07 PM Subject: Re: ZFS stalls -- and maybe we should be talking about defaults? On 3/7/2013 12:57 PM, Steven

Re: ZFS stalls -- and maybe we should be talking about defaults?

2013-03-07 Thread Steven Hartland
- Original Message - From: Karl Denninger k...@denninger.net I will test that but first I have to get the test machine to reliably stall so I know I'm not chasing my tail. Very sensible. Assuming you can reproduce it, one thing that might be interesting to try is to eliminate all

Re: ZFS stalls -- and maybe we should be talking about defaults?

2013-03-06 Thread Daniel Kalchev
On 06.03.13 02:42, Steven Hartland wrote: - Original Message - From: Daniel Kalchev On Mar 6, 2013, at 12:09 AM, Jeremy Chadwick j...@koitsu.org wrote: I say that knowing lots of people use ZFS-on-root, which is great -- I just wonder how many of them have tested all the crazy

Re: ZFS stalls -- and maybe we should be talking about defaults?

2013-03-06 Thread John-Mark Gurney
Karl Denninger wrote this message on Tue, Mar 05, 2013 at 06:56 -0600: When it happens on my system anything that is CPU-bound continues to execute. I can switch consoles and network I/O also works. If I have an iostat running at the time all I/O counters go to and remain at zero while the

Re: ZFS stalls -- and maybe we should be talking about defaults?

2013-03-06 Thread Peter Jeremy
On 2013-Mar-04 16:48:18 -0600, Karl Denninger k...@denninger.net wrote: The subject machine in question has 12GB of RAM and dual Xeon 5500-series processors. It also has an ARECA 1680ix in it with 2GB of local cache and the BBU for it. The ZFS spindles are all exported as JBOD drives. I set up

Re: ZFS stalls -- and maybe we should be talking about defaults?

2013-03-05 Thread Steven Hartland
- Original Message - From: Jeremy Chadwick j...@koitsu.org To: Ben Morrow b...@morrow.me.uk Cc: freebsd-stable@freebsd.org Sent: Tuesday, March 05, 2013 5:32 AM Subject: Re: ZFS stalls -- and maybe we should be talking about defaults? On Tue, Mar 05, 2013 at 05:05:47AM +, Ben

Re: ZFS stalls -- and maybe we should be talking about defaults?

2013-03-05 Thread Jeremy Chadwick
On Tue, Mar 05, 2013 at 09:12:47AM -, Steven Hartland wrote: - Original Message - From: Jeremy Chadwick j...@koitsu.org To: Ben Morrow b...@morrow.me.uk Cc: freebsd-stable@freebsd.org Sent: Tuesday, March 05, 2013 5:32 AM Subject: Re: ZFS stalls -- and maybe we should

Re: ZFS stalls -- and maybe we should be talking about defaults?

2013-03-05 Thread Andriy Gapon
Completely unrelated to the main thread: on 05/03/2013 07:32 Jeremy Chadwick said the following: That said, I still do not recommend ZFS for a root filesystem Why? (this biting people still happens even today) What exactly? - Disks are GPT and are *partitioned, and ZFS refers to the

Re: ZFS stalls -- and maybe we should be talking about defaults?

2013-03-05 Thread Karl Denninger
: ZFS stalls -- and maybe we should be talking about defaults? On Tue, Mar 05, 2013 at 05:05:47AM +, Ben Morrow wrote: Quoth Karl Denninger k...@denninger.net: Note that the machine is not booting from ZFS -- it is booting from and has its swap on a UFS 2-drive mirror (handled by the disk

Re: ZFS stalls -- and maybe we should be talking about defaults?

2013-03-05 Thread Gary Palmer
On Tue, Mar 05, 2013 at 12:40:38AM -0500, Garrett Wollman wrote: In article 8c68812328e3483ba9786ef155911...@multiplay.co.uk, kill...@multiplay.co.uk writes: Now interesting you should say that I've seen a stall recently on ZFS only box running on 6 x SSD RAIDZ2. The stall was caused by

Re: ZFS stalls -- and maybe we should be talking about defaults?

2013-03-05 Thread Freddie Cash
On Tue, Mar 5, 2013 at 7:22 AM, Gary Palmer gpal...@freebsd.org wrote: Just as a note that there was a page I read in the past few months that pointed out that having a huge ARC may not always be in the best interests of the system. Some operation on the filesystem (I forget what, apologies)

Re: ZFS stalls -- and maybe we should be talking about defaults?

2013-03-05 Thread Jeremy Chadwick
On Tue, Mar 05, 2013 at 01:09:41PM +0200, Andriy Gapon wrote: Completely unrelated to the main thread: on 05/03/2013 07:32 Jeremy Chadwick said the following: That said, I still do not recommend ZFS for a root filesystem Why? Too long a history of problems with it and weird edge cases

Re: ZFS stalls -- and maybe we should be talking about defaults?

2013-03-05 Thread Freddie Cash
On Tue, Mar 5, 2013 at 2:09 PM, Jeremy Chadwick j...@koitsu.org wrote: On Tue, Mar 05, 2013 at 01:09:41PM +0200, Andriy Gapon wrote: - Disks are GPT and are *partitioned, and ZFS refers to the partitions not the raw disk -- this matters (honest, it really does; the ZFS code handles

Re: ZFS stalls -- and maybe we should be talking about defaults?

2013-03-05 Thread Jeremy Chadwick
On Tue, Mar 05, 2013 at 02:18:30PM -0800, Freddie Cash wrote: On Tue, Mar 5, 2013 at 2:09 PM, Jeremy Chadwick j...@koitsu.org wrote: On Tue, Mar 05, 2013 at 01:09:41PM +0200, Andriy Gapon wrote: - Disks are GPT and are *partitioned, and ZFS refers to the partitions not the raw

Re: ZFS stalls -- and maybe we should be talking about defaults?

2013-03-05 Thread Daniel Kalchev
On Mar 6, 2013, at 12:09 AM, Jeremy Chadwick j...@koitsu.org wrote: I say that knowing lots of people use ZFS-on-root, which is great -- I just wonder how many of them have tested all the crazy scenarios and then tried to boot from things. I have verified that ZFS-on-root works reliably in

Re: ZFS stalls -- and maybe we should be talking about defaults?

2013-03-05 Thread Daniel Kalchev
On Mar 5, 2013, at 8:17 PM, Freddie Cash fjwc...@gmail.com wrote: ZFS send/recv would eventually complete, but what used to take 15-20 minutes would take 6-8 hours to complete. I've reduced the ARC to only 32 GB, with arc_meta set to 28 GB, and things are running much smoother now

Re: ZFS stalls -- and maybe we should be talking about defaults?

2013-03-05 Thread Steven Hartland
- Original Message - From: Daniel Kalchev On Mar 6, 2013, at 12:09 AM, Jeremy Chadwick j...@koitsu.org wrote: I say that knowing lots of people use ZFS-on-root, which is great -- I just wonder how many of them have tested all the crazy scenarios and then tried to boot from things.

Re: ZFS stalls -- and maybe we should be talking about defaults?

2013-03-05 Thread Ben Morrow
Quoth Steven Hartland kill...@multiplay.co.uk: - Original Message - From: Daniel Kalchev On Mar 6, 2013, at 12:09 AM, Jeremy Chadwick j...@koitsu.org wrote: I say that knowing lots of people use ZFS-on-root, which is great -- I just wonder how many of them have tested all the

Re: ZFS stalls -- and maybe we should be talking about defaults?

2013-03-05 Thread Jeremy Chadwick
On Tue, Mar 05, 2013 at 06:56:02AM -0600, Karl Denninger wrote: { I've snipped lots of text. For those who are reading this follow-up } { and wish to read the snipped portions, please see this URL: } { http://lists.freebsd.org/pipermail/freebsd-stable/2013-March/072696.html } 1. Is

Re: ZFS stalls -- and maybe we should be talking about defaults?

2013-03-05 Thread Jeremy Chadwick
On Tue, Mar 05, 2013 at 09:08:09PM -0800, Jeremy Chadwick wrote: * How long the stall is in duration (ex. if there's some way to roughly calculate this using date in a shell script) They're variable. Some last fractions of a second and are not really all that noticeable unless you

ZFS stalls -- and maybe we should be talking about defaults?

2013-03-04 Thread Karl Denninger
Well now this is interesting. I have converted a significant number of filesystems to ZFS over the last week or so and have noted a few things. A couple of them aren't so good. The subject machine in question has 12GB of RAM and dual Xeon 5500-series processors. It also has an ARECA 1680ix in

Re: ZFS stalls -- and maybe we should be talking about defaults?

2013-03-04 Thread Steven Hartland
? Regards Steve - Original Message - From: Karl Denninger k...@denninger.net To: freebsd-stable@freebsd.org Sent: Monday, March 04, 2013 10:48 PM Subject: ZFS stalls -- and maybe we should be talking about defaults? Well now this is interesting. I have converted a significant number

Re: ZFS stalls -- and maybe we should be talking about defaults?

2013-03-04 Thread Dennis Glatting
I get stalls with 256GB of RAM with arc_max=64G (my limit is usually 25% ) on a 64 core system with 20 new 3TB Seagate disks under LSI2008 chips without much load. Interestingly pbzip2 consistently created a problem on a volume whereas gzip does not. Here, stalls happen across several systems

Re: ZFS stalls -- and maybe we should be talking about defaults?

2013-03-04 Thread Karl Denninger
changed fixed it or does it require both? Regards Steve - Original Message - From: Karl Denninger k...@denninger.net To: freebsd-stable@freebsd.org Sent: Monday, March 04, 2013 10:48 PM Subject: ZFS stalls -- and maybe we should be talking about defaults? Well now

Re: ZFS stalls -- and maybe we should be talking about defaults?

2013-03-04 Thread Karl Denninger
Stick this in /boot/loader.conf and see if your lockups goes away: vfs.zfs.write_limit_override=102400 I've got a sentinal running that watches for zero-bandwidth zpool iostat 5s that has been running for close to 12 hours now and with the two tunables I changed it doesn't appear to be

Re: ZFS stalls -- and maybe we should be talking about defaults?

2013-03-04 Thread Steven Hartland
- Original Message - From: Karl Denninger k...@denninger.net Stick this in /boot/loader.conf and see if your lockups goes away: vfs.zfs.write_limit_override=102400 ... If it turns out that the write_limit_override tunable is the one responsible for stopping the hangs I can drop

Re: ZFS stalls -- and maybe we should be talking about defaults?

2013-03-04 Thread Karl Denninger
On 3/4/2013 9:25 PM, Steven Hartland wrote: - Original Message - From: Karl Denninger k...@denninger.net Stick this in /boot/loader.conf and see if your lockups goes away: vfs.zfs.write_limit_override=102400 ... If it turns out that the write_limit_override tunable is the one

Re: ZFS stalls -- and maybe we should be talking about defaults?

2013-03-04 Thread Dennis Glatting
On Mon, 2013-03-04 at 20:58 -0600, Karl Denninger wrote: Stick this in /boot/loader.conf and see if your lockups goes away: vfs.zfs.write_limit_override=102400 K. I've got a sentinal running that watches for zero-bandwidth zpool iostat 5s that has been running for close to 12 hours

Re: ZFS stalls -- and maybe we should be talking about defaults?

2013-03-04 Thread Dennis Glatting
On Tue, 2013-03-05 at 03:25 +, Steven Hartland wrote: - Original Message - From: Karl Denninger k...@denninger.net Stick this in /boot/loader.conf and see if your lockups goes away: vfs.zfs.write_limit_override=102400 ... If it turns out that the

Re: ZFS stalls -- and maybe we should be talking about defaults?

2013-03-04 Thread Steven Hartland
- Original Message - From: Karl Denninger k...@denninger.net Then it happened I thought the machine had wedged, but minutes (not seconds) later, everything sprung into action again. That's exactly what I can reproduce here; the stalls are anywhere from a few seconds to well north of a

Re: ZFS stalls -- and maybe we should be talking about defaults?

2013-03-04 Thread Karl Denninger
On 3/4/2013 10:01 PM, Steven Hartland wrote: - Original Message - From: Karl Denninger k...@denninger.net Then it happened I thought the machine had wedged, but minutes (not seconds) later, everything sprung into action again. That's exactly what I can reproduce here; the stalls are

Re: ZFS stalls -- and maybe we should be talking about defaults?

2013-03-04 Thread Ben Morrow
Quoth Karl Denninger k...@denninger.net: Note that the machine is not booting from ZFS -- it is booting from and has its swap on a UFS 2-drive mirror (handled by the disk adapter; looks like a single da0 drive to the OS) and that drive stalls as well when it freezes. It's definitely a

Re: ZFS stalls -- and maybe we should be talking about defaults?

2013-03-04 Thread Jeremy Chadwick
On Tue, Mar 05, 2013 at 05:05:47AM +, Ben Morrow wrote: Quoth Karl Denninger k...@denninger.net: Note that the machine is not booting from ZFS -- it is booting from and has its swap on a UFS 2-drive mirror (handled by the disk adapter; looks like a single da0 drive to the OS) and

Re: ZFS stalls -- and maybe we should be talking about defaults?

2013-03-04 Thread Garrett Wollman
In article 8c68812328e3483ba9786ef155911...@multiplay.co.uk, kill...@multiplay.co.uk writes: Now interesting you should say that I've seen a stall recently on ZFS only box running on 6 x SSD RAIDZ2. The stall was caused by fairly large mysql import, with nothing else running. Then it happened I