I have a somewhat similar problem. 
My pool was created under FreeBSD.

  pool: mass
    id: 17237791880291344463
 state: DEGRADED
action: The pool can be imported despite missing or damaged devices.  The
        fault tolerance of the pool may be compromised if imported.
config:
        mass                       DEGRADED
          raidz1                   DEGRADED
            ad8                    ONLINE
            replacing              DEGRADED
              ad10                 ONLINE
              7166408809693659020  FAULTED  corrupted data
            ad14                   ONLINE
            ad12                   ONLINE

One of hdds started producing DMA errors. I tried to replace it but failed 
because zpool commands just hung.
Soon the drive was back to normal and the pool worked again though the status 
'replacing' remained and I couldn't remove 7166408809693659020.
Then one day I couldn't import it anymore. zpool import was getting stuck.

I tried running Solaris in the VirtualBox.
After invoking `zpool import -f mass` a lot of data is read from the disks then 
a little is written and then it sits there and waits.
It looks like every zpool and zfs command hangs too.
sol-nv-b98-x86-v1, osol-0906-x86, osol-1002-123-x86 -- no difference.
Removing c9t1d0p0 symlink doesn't help so it's probably not connected to 
'replacing' state.
Currently there is no evidence of any problems with the hdds, the unfortunate 
one has 5 reallocated sectors.

jack at opensolaris:/dev/rdsk# zpool import
  pool: mass
    id: 17237791880291344463
 state: DEGRADED
status: The pool was last accessed by another system.
action: The pool can be imported despite missing or damaged devices.  The
        fault tolerance of the pool may be compromised if imported.
   see: http://www.sun.com/msg/ZFS-8000-EY
config:
        mass                       DEGRADED
          raidz1                   DEGRADED
            c9t0d0p0               ONLINE
            replacing              DEGRADED
              c9t1d0p0             ONLINE
              7166408809693659020  UNAVAIL  cannot open
            c9t3d0p0               ONLINE
            c9t2d0p0               ONLINE
jack at opensolaris:~# zdb -l /dev/rdsk/c9t0d0p0 
--------------------------------------------
LABEL 0
--------------------------------------------
    version=13
    name='mass'
    state=0
    txg=56673850
    pool_guid=17237791880291344463
    hostid=3705723203
    hostname='hostname'
    top_guid=17208459754718949456
    guid=15572876870803484326
    vdev_tree
        type='raidz'
        id=0
        guid=17208459754718949456
        nparity=1
        metaslab_array=14
        metaslab_shift=34
        ashift=9
        asize=2000412475392
        is_log=0
        children[0]
                type='disk'
                id=0
                guid=15572876870803484326
                path='/dev/ad8'
                whole_disk=0
                DTL=55
        children[1]
                type='replacing'
                id=1
                guid=16446985611132454904
                whole_disk=0
                children[0]
                        type='disk'
                        id=0
                        guid=10380122112867903132
                        path='/dev/ad10'
                        whole_disk=0
                        DTL=54
                children[1]
                        type='disk'
                        id=1
                        guid=7166408809693659020
                        path='/dev/ad10'
                        whole_disk=0
                        not_present=1
                        DTL=209
        children[2]
                type='disk'
                id=2
                guid=5186164191909664446
                path='/dev/ad14'
                whole_disk=0
                DTL=53
        children[3]
                type='disk'
                id=3
                guid=16038735897748748777
                path='/dev/ad12'
                whole_disk=0
                DTL=52
--------------------------------------------
LABEL 1
--------------------------------------------
    version=13
    name='mass'
    state=0
    txg=56673850
    pool_guid=17237791880291344463
    hostid=3705723203
    hostname='hostname'
    top_guid=17208459754718949456
    guid=15572876870803484326
    vdev_tree
        type='raidz'
        id=0
        guid=17208459754718949456
        nparity=1
        metaslab_array=14
        metaslab_shift=34
        ashift=9
        asize=2000412475392
        is_log=0
        children[0]
                type='disk'
                id=0
                guid=15572876870803484326
                path='/dev/ad8'
                whole_disk=0
                DTL=55
        children[1]
                type='replacing'
                id=1
                guid=16446985611132454904
                whole_disk=0
                children[0]
                        type='disk'
                        id=0
                        guid=10380122112867903132
                        path='/dev/ad10'
                        whole_disk=0
                        DTL=54
                children[1]
                        type='disk'
                        id=1
                        guid=7166408809693659020
                        path='/dev/ad10'
                        whole_disk=0
                        not_present=1
                        DTL=209
        children[2]
                type='disk'
                id=2
                guid=5186164191909664446
                path='/dev/ad14'
                whole_disk=0
                DTL=53
        children[3]
                type='disk'
                id=3
                guid=16038735897748748777
                path='/dev/ad12'
                whole_disk=0
                DTL=52
--------------------------------------------
LABEL 2
--------------------------------------------
    version=13
    name='mass'
    state=0
    txg=56673850
    pool_guid=17237791880291344463
    hostid=3705723203
    hostname='hostname'
    top_guid=17208459754718949456
    guid=15572876870803484326
    vdev_tree
        type='raidz'
        id=0
        guid=17208459754718949456
        nparity=1
        metaslab_array=14
        metaslab_shift=34
        ashift=9
        asize=2000412475392
        is_log=0
        children[0]
                type='disk'
                id=0
                guid=15572876870803484326
                path='/dev/ad8'
                whole_disk=0
                DTL=55
        children[1]
                type='replacing'
                id=1
                guid=16446985611132454904
                whole_disk=0
                children[0]
                        type='disk'
                        id=0
                        guid=10380122112867903132
                        path='/dev/ad10'
                        whole_disk=0
                        DTL=54
                children[1]
                        type='disk'
                        id=1
                        guid=7166408809693659020
                        path='/dev/ad10'
                        whole_disk=0
                        not_present=1
                        DTL=209
        children[2]
                type='disk'
                id=2
                guid=5186164191909664446
                path='/dev/ad14'
                whole_disk=0
                DTL=53
        children[3]
                type='disk'
                id=3
                guid=16038735897748748777
                path='/dev/ad12'
                whole_disk=0
                DTL=52
--------------------------------------------
LABEL 3
--------------------------------------------
    version=13
    name='mass'
    state=0
    txg=56673850
    pool_guid=17237791880291344463
    hostid=3705723203
    hostname='hostname'
    top_guid=17208459754718949456
    guid=15572876870803484326
    vdev_tree
        type='raidz'
        id=0
        guid=17208459754718949456
        nparity=1
        metaslab_array=14
        metaslab_shift=34
        ashift=9
        asize=2000412475392
        is_log=0
        children[0]
                type='disk'
                id=0
                guid=15572876870803484326
                path='/dev/ad8'
                whole_disk=0
                DTL=55
        children[1]
                type='replacing'
                id=1
                guid=16446985611132454904
                whole_disk=0
                children[0]
                        type='disk'
                        id=0
                        guid=10380122112867903132
                        path='/dev/ad10'
                        whole_disk=0
                        DTL=54
                children[1]
                        type='disk'
                        id=1
                        guid=7166408809693659020
                        path='/dev/ad10'
                        whole_disk=0
                        not_present=1
                        DTL=209
        children[2]
                type='disk'
                id=2
                guid=5186164191909664446
                path='/dev/ad14'
                whole_disk=0
                DTL=53
        children[3]
                type='disk'
                id=3
                guid=16038735897748748777
                path='/dev/ad12'
                whole_disk=0
                DTL=52
jack at opensolaris:/dev/rdsk# zpool import -f mass


jack at opensolaris:~# echo "0t959::pid2proc|::walk thread|::findstack -v" | 
mdb -k
stack pointer for thread d44a8340: d38dcc24
  d38dcc64 swtch+0x188()
  d38dcc74 cv_wait+0x53(d8c8b9a6, d8c8b968, d38dccb4, f95c4741)
  d38dccb4 txg_wait_synced+0x90(d8c8b800, 0, 0, 2)
  d38dcd34 spa_load+0xe51(d4e76380, d46954d0, 2, 1)
  d38dcd84 spa_import_common+0xc3()
  d38dcdb4 spa_import+0x18(d4b0a000, d46954d0, 0, f95ed844)
  d38dcde4 zfs_ioc_pool_import+0xcd(d4b0a000, 0, 0)
  d38dce14 zfsdev_ioctl+0xe0()
  d38dce44 cdev_ioctl+0x31(2d80000, 5a02, 80420c0, 100003, d8837f28, d38dcf00)
  d38dce74 spec_ioctl+0x6b(d39b0e80, 5a02, 80420c0, 100003, d8837f28, d38dcf00)
  d38dcec4 fop_ioctl+0x49(d39b0e80, 5a02, 80420c0, 100003, d8837f28, d38dcf00)
  d38dcf84 ioctl+0x171()
  d38dcfac sys_call+0x10c()
jack at opensolaris:~# echo "d4e76380::print struct spa 
spa_dsl_pool->dp_tx.tx_sync_thread|::findstack -v" | mdb -k
stack pointer for thread d8e1fdc0: d8e1fbd8
  d8e1fc18 swtch+0x188()
  d8e1fc28 cv_wait+0x53(dc0eca68, dc0eca60, 0, 0)
  d8e1fc58 zio_wait+0x55(dc0ec820, dc0ec820)
  d8e1fcb8 dsl_pool_sync+0x24f(d8c8b800, 360c63b, 0)
  d8e1fd28 spa_sync+0x452(d4e76380, 360c63b, 0, 0)
  d8e1fda8 txg_sync_thread+0x308(d8c8b800, 0)
  d8e1fdb8 thread_start+8()
jack at opensolaris:~#
-- 
This message posted from opensolaris.org

Reply via email to