Re: [zfs-discuss] ZFS disk failure question

2009-10-22 Thread Richard Elling
On Oct 22, 2009, at 12:29 PM, Jason Frank wrote: Thank you for your follow-up. The doc looks great. Having good examples goes a long way to helping others that have my problem. Ideally, the replacement would all happen magically, and I would have had everything marked as good, with one failed

Re: [zfs-discuss] ZFS disk failure question

2009-10-22 Thread Jason Frank
Thank you for your follow-up. The doc looks great. Having good examples goes a long way to helping others that have my problem. Ideally, the replacement would all happen magically, and I would have had everything marked as good, with one failed disk (like a certain other storage vendor that has

Re: [zfs-discuss] ZFS disk failure question

2009-10-22 Thread Cindy Swearingen
Hi Jason, Since spare replacement is an important process, I've rewritten this section to provide 3 main examples, here: http://docs.sun.com/app/docs/doc/817-2271/gcvcw?a=view Scroll down the section: Activating and Deactivating Hot Spares in Your Storage Pool Example 4–7 Manually Replacing a

Re: [zfs-discuss] ZFS disk failure question

2009-10-15 Thread Jason Frank
See, I get overly literal when working on failed production storage (and yes, I do have backups...) I wasn't wanting to cancel the in-progress spare replacement. I had a completed spare replacement, and I wanted to make it "official". So, that didn't really fit my scenario either. I'm glad you

Re: [zfs-discuss] ZFS disk failure question

2009-10-15 Thread Jason Frank
Thank you, that did the trick. That's not terribly obvious from the man page though. The man page says it detaches the devices from a mirror, and I had a raidz2. Since I'm messing with production data, I decided I wasn't going to chance it when I was reading the man page. You might consider chan

Re: [zfs-discuss] ZFS disk failure question

2009-10-14 Thread Trevor Pretty
Cindy How does the SS7000 do it? Today I demoed pulling a disk and the spare just automatically became part of the pool. After it was re-silvered I then pulled three more (latest Q3 version with triple RAID-Z). I then plugged all the drives back in (different slots) and everything was back t

Re: [zfs-discuss] ZFS disk failure question

2009-10-14 Thread Cindy Swearingen
> You should just be able to detach 'c0t6d0' in the config below. The > spare (c0t7d0) will assume its place and be removed from the idle spare > list, becoming a "normal" vdev in the process. Yes, that's what I thought too. This is build 124 bfu'd. See the output below when I just detach the s

Re: [zfs-discuss] ZFS disk failure question

2009-10-14 Thread Eric Schrock
On 10/14/09 14:33, Cindy Swearingen wrote: Hi Eric, I tried that and found that I needed to detach and remove the spare before replacing the failed disk with the spare disk. You should just be able to detach 'c0t6d0' in the config below. The spare (c0t7d0) will assume its place and be remove

Re: [zfs-discuss] ZFS disk failure question

2009-10-14 Thread Miles Nordin
> "cs" == Cindy Swearingen writes: cs> # zpool detach test c0t7d0 cs> # zpool remove test c0t7d0 cs> # zpool replace test c0t6d0 c0t7d0 This is less than ideal because it unnecessarily leaves the pool's redundancy reduced while the replacement resilver is happening. During this

Re: [zfs-discuss] ZFS disk failure question

2009-10-14 Thread Cindy Swearingen
I think it is difficult to cover all the possible ways to replace a disk with a spare. This example in the ZFS Admin Guide didn't work for me: http://docs.sun.com/app/docs/doc/819-5461/gcvcw?a=view See the manual replacement example. After the zpool detach and zpool replace operations, the spar

Re: [zfs-discuss] ZFS disk failure question

2009-10-14 Thread Cindy Swearingen
Hi Eric, I tried that and found that I needed to detach and remove the spare before replacing the failed disk with the spare disk. What actually worked is below. Thanks, Cindy # zpool status test pool: test state: DEGRADED status: One or more devices could not be opened. Sufficient replic

Re: [zfs-discuss] ZFS disk failure question

2009-10-14 Thread Eric Schrock
On 10/14/09 14:26, Jason Frank wrote: Thank you, that did the trick. That's not terribly obvious from the man page though. The man page says it detaches the devices from a mirror, and I had a raidz2. Since I'm messing with production data, I decided I wasn't going to chance it when I was readi

Re: [zfs-discuss] ZFS disk failure question

2009-10-14 Thread Eric Schrock
On 10/14/09 14:17, Cindy Swearingen wrote: Hi Jason, I think you are asking how do you tell ZFS that you want to replace the failed disk c8t7d0 with the spare, c8t11d0? I just tried do this on my Nevada build 124 lab system, simulating a disk failure and using zpool replace to replace the faile

Re: [zfs-discuss] ZFS disk failure question

2009-10-14 Thread Cindy Swearingen
Hi Jason, I think you are asking how do you tell ZFS that you want to replace the failed disk c8t7d0 with the spare, c8t11d0? I just tried do this on my Nevada build 124 lab system, simulating a disk failure and using zpool replace to replace the failed disk with the spare. The spare is now busy

[zfs-discuss] ZFS disk failure question

2009-10-14 Thread Jason Frank
So, my Areca controller has been complaining via email of read errors for a couple days on SATA channel 8. The disk finally gave up last night at 17:40. I got to say I really appreciate the Areca controller taking such good care of me. For some reason, I wasn't able to log into the server las