I have a somewhat similar problem. My pool was created under FreeBSD. pool: mass id: 17237791880291344463 state: DEGRADED action: The pool can be imported despite missing or damaged devices. The fault tolerance of the pool may be compromised if imported. config: mass DEGRADED raidz1 DEGRADED ad8 ONLINE replacing DEGRADED ad10 ONLINE 7166408809693659020 FAULTED corrupted data ad14 ONLINE ad12 ONLINE
One of hdds started producing DMA errors. I tried to replace it but failed because zpool commands just hung. Soon the drive was back to normal and the pool worked again though the status 'replacing' remained and I couldn't remove 7166408809693659020. Then one day I couldn't import it anymore. zpool import was getting stuck. I tried running Solaris in the VirtualBox. After invoking `zpool import -f mass` a lot of data is read from the disks then a little is written and then it sits there and waits. It looks like every zpool and zfs command hangs too. sol-nv-b98-x86-v1, osol-0906-x86, osol-1002-123-x86 -- no difference. Removing c9t1d0p0 symlink doesn't help so it's probably not connected to 'replacing' state. Currently there is no evidence of any problems with the hdds, the unfortunate one has 5 reallocated sectors. jack at opensolaris:/dev/rdsk# zpool import pool: mass id: 17237791880291344463 state: DEGRADED status: The pool was last accessed by another system. action: The pool can be imported despite missing or damaged devices. The fault tolerance of the pool may be compromised if imported. see: http://www.sun.com/msg/ZFS-8000-EY config: mass DEGRADED raidz1 DEGRADED c9t0d0p0 ONLINE replacing DEGRADED c9t1d0p0 ONLINE 7166408809693659020 UNAVAIL cannot open c9t3d0p0 ONLINE c9t2d0p0 ONLINE jack at opensolaris:~# zdb -l /dev/rdsk/c9t0d0p0 -------------------------------------------- LABEL 0 -------------------------------------------- version=13 name='mass' state=0 txg=56673850 pool_guid=17237791880291344463 hostid=3705723203 hostname='hostname' top_guid=17208459754718949456 guid=15572876870803484326 vdev_tree type='raidz' id=0 guid=17208459754718949456 nparity=1 metaslab_array=14 metaslab_shift=34 ashift=9 asize=2000412475392 is_log=0 children[0] type='disk' id=0 guid=15572876870803484326 path='/dev/ad8' whole_disk=0 DTL=55 children[1] type='replacing' id=1 guid=16446985611132454904 whole_disk=0 children[0] type='disk' id=0 guid=10380122112867903132 path='/dev/ad10' whole_disk=0 DTL=54 children[1] type='disk' id=1 guid=7166408809693659020 path='/dev/ad10' whole_disk=0 not_present=1 DTL=209 children[2] type='disk' id=2 guid=5186164191909664446 path='/dev/ad14' whole_disk=0 DTL=53 children[3] type='disk' id=3 guid=16038735897748748777 path='/dev/ad12' whole_disk=0 DTL=52 -------------------------------------------- LABEL 1 -------------------------------------------- version=13 name='mass' state=0 txg=56673850 pool_guid=17237791880291344463 hostid=3705723203 hostname='hostname' top_guid=17208459754718949456 guid=15572876870803484326 vdev_tree type='raidz' id=0 guid=17208459754718949456 nparity=1 metaslab_array=14 metaslab_shift=34 ashift=9 asize=2000412475392 is_log=0 children[0] type='disk' id=0 guid=15572876870803484326 path='/dev/ad8' whole_disk=0 DTL=55 children[1] type='replacing' id=1 guid=16446985611132454904 whole_disk=0 children[0] type='disk' id=0 guid=10380122112867903132 path='/dev/ad10' whole_disk=0 DTL=54 children[1] type='disk' id=1 guid=7166408809693659020 path='/dev/ad10' whole_disk=0 not_present=1 DTL=209 children[2] type='disk' id=2 guid=5186164191909664446 path='/dev/ad14' whole_disk=0 DTL=53 children[3] type='disk' id=3 guid=16038735897748748777 path='/dev/ad12' whole_disk=0 DTL=52 -------------------------------------------- LABEL 2 -------------------------------------------- version=13 name='mass' state=0 txg=56673850 pool_guid=17237791880291344463 hostid=3705723203 hostname='hostname' top_guid=17208459754718949456 guid=15572876870803484326 vdev_tree type='raidz' id=0 guid=17208459754718949456 nparity=1 metaslab_array=14 metaslab_shift=34 ashift=9 asize=2000412475392 is_log=0 children[0] type='disk' id=0 guid=15572876870803484326 path='/dev/ad8' whole_disk=0 DTL=55 children[1] type='replacing' id=1 guid=16446985611132454904 whole_disk=0 children[0] type='disk' id=0 guid=10380122112867903132 path='/dev/ad10' whole_disk=0 DTL=54 children[1] type='disk' id=1 guid=7166408809693659020 path='/dev/ad10' whole_disk=0 not_present=1 DTL=209 children[2] type='disk' id=2 guid=5186164191909664446 path='/dev/ad14' whole_disk=0 DTL=53 children[3] type='disk' id=3 guid=16038735897748748777 path='/dev/ad12' whole_disk=0 DTL=52 -------------------------------------------- LABEL 3 -------------------------------------------- version=13 name='mass' state=0 txg=56673850 pool_guid=17237791880291344463 hostid=3705723203 hostname='hostname' top_guid=17208459754718949456 guid=15572876870803484326 vdev_tree type='raidz' id=0 guid=17208459754718949456 nparity=1 metaslab_array=14 metaslab_shift=34 ashift=9 asize=2000412475392 is_log=0 children[0] type='disk' id=0 guid=15572876870803484326 path='/dev/ad8' whole_disk=0 DTL=55 children[1] type='replacing' id=1 guid=16446985611132454904 whole_disk=0 children[0] type='disk' id=0 guid=10380122112867903132 path='/dev/ad10' whole_disk=0 DTL=54 children[1] type='disk' id=1 guid=7166408809693659020 path='/dev/ad10' whole_disk=0 not_present=1 DTL=209 children[2] type='disk' id=2 guid=5186164191909664446 path='/dev/ad14' whole_disk=0 DTL=53 children[3] type='disk' id=3 guid=16038735897748748777 path='/dev/ad12' whole_disk=0 DTL=52 jack at opensolaris:/dev/rdsk# zpool import -f mass jack at opensolaris:~# echo "0t959::pid2proc|::walk thread|::findstack -v" | mdb -k stack pointer for thread d44a8340: d38dcc24 d38dcc64 swtch+0x188() d38dcc74 cv_wait+0x53(d8c8b9a6, d8c8b968, d38dccb4, f95c4741) d38dccb4 txg_wait_synced+0x90(d8c8b800, 0, 0, 2) d38dcd34 spa_load+0xe51(d4e76380, d46954d0, 2, 1) d38dcd84 spa_import_common+0xc3() d38dcdb4 spa_import+0x18(d4b0a000, d46954d0, 0, f95ed844) d38dcde4 zfs_ioc_pool_import+0xcd(d4b0a000, 0, 0) d38dce14 zfsdev_ioctl+0xe0() d38dce44 cdev_ioctl+0x31(2d80000, 5a02, 80420c0, 100003, d8837f28, d38dcf00) d38dce74 spec_ioctl+0x6b(d39b0e80, 5a02, 80420c0, 100003, d8837f28, d38dcf00) d38dcec4 fop_ioctl+0x49(d39b0e80, 5a02, 80420c0, 100003, d8837f28, d38dcf00) d38dcf84 ioctl+0x171() d38dcfac sys_call+0x10c() jack at opensolaris:~# echo "d4e76380::print struct spa spa_dsl_pool->dp_tx.tx_sync_thread|::findstack -v" | mdb -k stack pointer for thread d8e1fdc0: d8e1fbd8 d8e1fc18 swtch+0x188() d8e1fc28 cv_wait+0x53(dc0eca68, dc0eca60, 0, 0) d8e1fc58 zio_wait+0x55(dc0ec820, dc0ec820) d8e1fcb8 dsl_pool_sync+0x24f(d8c8b800, 360c63b, 0) d8e1fd28 spa_sync+0x452(d4e76380, 360c63b, 0, 0) d8e1fda8 txg_sync_thread+0x308(d8c8b800, 0) d8e1fdb8 thread_start+8() jack at opensolaris:~# -- This message posted from opensolaris.org