On Mon, Apr 27, 2009 at 04:47:27PM -0500, Gary Mills wrote:
> On Sat, Apr 18, 2009 at 04:27:55PM -0500, Gary Mills wrote:
> > We have an IMAP server with ZFS for mailbox storage that has recently
> > become extremely slow on most weekday mornings and afternoons.  When
> > one of these incidents happens, the number of processes increases, the
> > load average increases, but ZFS I/O bandwidth decreases.  Users notice
> > very slow response to IMAP requests.  On the server, even `ps' becomes
> > slow.
> 
> The cause turned out to be this ZFS bug:
> 
>     6596237: Stop looking and start ganging
> 
> Apparently, the ZFS code was searching the free list looking for the
> perfect fit for each write.  With a fragmented pool, this search took
> a very long time, delaying the write.  Eventually, the requests arrived
> faster than writes could be sent to the devices, causing the server
> to be unresponsive.

We also had another problem, due to this ZFS bug:

    6591646: Hang while trying to enter a txg while holding a txg open

This was a deadlock, with one thread blocking hundreds of other
threads.  Our symptom was that all zpool I/O would stop and the `ps'
command would hang.  A reboot was the only way out.

If you have a support contract, Sun will supply an IDR that fixes
both problems.

-- 
-Gary Mills-    -Unix Support-    -U of M Academic Computing and Networking-
_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Reply via email to