Re: [zfs-discuss] CR# 6574286, remove slog device

Dave Tue, 19 May 2009 21:46:54 -0700


Eric Schrock wrote:

On May 19, 2009, at 12:57 PM, Dave wrote:
If you don't have mirrored slogs and the slog fails, you may lose anydata that was in a txg group waiting to be committed to the main poolvdevs - you will never know if you lost any data or not.
None of the above is correct. First off, you only lose data if the slogfails *and* the machine panics/reboots before the transaction groups issynced (5-30s by default depending on load, though there is a CR filedto immediately sync on slog failure). You will not lose any data oncethe txg is synced - syncing the transaction group does not requirereading from the slog, so failure of the log device does not impactnormal operation.

Thanks for correcting my statement. There is still a potentialapproximate 60 second window for data loss if there are 2 transactiongroups waiting to sync with a 30 second txg commit timer, correct?

The latter half of the above statement is also incorrect. Should youfind yourself in the double-failure described above, you will get an FMAfault that describes the nature of the problem and the implications. Ifthe slog is truly dead, you can 'zpool clear' (or 'fmadm repair') thefault and use whatever data you still have in the pool. If the slog isjust missing, you can insert it and continue without losing data. In nocases will ZFS silently continue without committed data.

How will it know that data was actually lost? Or does it just alert youthat it's possible data was lost?

There's also the worry that the pool is not importable if you did havethe double failure scenario and the log really is gone. Re: bug ID6733267 . E.g. if you had done a 'zpool import -o cachefile=none mypool'.


--
Dave
_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] CR# 6574286, remove slog device

Reply via email to