2011-12-02 18:25, Steve Gonczi пишет:
Hi Jim,
Try to run a zdb -b poolname ..
This should report any leaked or double allocated blocks.
(It may or may not run, it tends to run out of memory and crash on large
datasets)
I would be curious what zdb reports, and whether you are able to run it w/o
crashing with out of memory.
Ok, when/if it completes scrubbing the pool, I'll try that.
But it is likely to fail, unless there are some new failsafe
workarounds for such failures in oi_151a.
In the meanwhile, here are copies of zdb walks which I did
a couple of weeks ago while repairing (finally replacing)
the rpool on this box. At that time it was booted with
oi_148a LiveUSB. Some of the walks (those WITH leak-checks
not disabled) never completed:
root@openindiana:~# time zdb -bb -e 1601233584937321596
Traversing all blocks to verify nothing leaked ...
(box hung: LAN Disconnected; RAM/SWAP used up according to vmstat 1)
root@openindiana:~# time zdb -bsvc -e 1601233584937321596
Traversing all blocks to verify checksums and verify nothing leaked ...
Assertion failed: zio_wait(zio_claim(0L, zcb-zcb_spa, refcnt ? 0 :
spa_first_txg(zcb-zcb_spa), bp, 0L, 0L, ZIO_FLAG_CANFAIL)) == 0 (0x2 ==
0x0), file ../zdb.c, line 1950
Abort
real7197m41.288s
user291m39.256s
sys 25m48.133s
This took most of the week just to fail.
And a walk without leak checks took half a day to find
some discrepancies and unreachable blocks:
root@openindiana:~# time zdb -bsvL -e 1601233584937321596
Traversing all blocks ...
block traversal size 9044729487360 != alloc 9044729499648
(unreachable 12288)
bp count:85245222
bp logical:8891466103808 avg: 104304
bp physical: 7985508591104 avg: 93676 compression:
1.11
bp allocated: 12429007810560 avg: 145802 compression:
0.72
bp deduped:3384278323200ref1: 13909855
deduplication: 1.27
SPA allocated: 9044729499648 used: 75.64%
Blocks LSIZE PSIZE ASIZE avgcomp %Total Type
- - - - - -- unallocated
232K 4K 72.0K 36.0K8.00 0.00 object directory
3 1.50K 1.50K108K 36.0K1.00 0.00 object array
232K 2.50K 72.0K 36.0K 12.80 0.00 packed nvlist
- - - - - -- packed nvlist size
7.80K 988M208M 1.12G147K4.75 0.01 bpobj
- - - - - -- bpobj header
- - - - - -- SPA space map
header
183K 753M517M 6.49G 36.3K1.46 0.06 SPA space map
22 1020K 1020K 1.58M 73.6K1.00 0.00 ZIL intent log
933K 14.6G 3.11G 25.2G 27.6K4.69 0.22 DMU dnode
1.75K 3.50M896K 42.0M 24.0K4.00 0.00 DMU objset
- - - - - -- DSL directory
390 243K200K 13.7M 36.0K1.21 0.00 DSL directory
child map
388 298K208K 13.6M 36.0K1.43 0.00 DSL dataset
snap map
715 10.2M 1.14M 25.1M 36.0K8.92 0.00 DSL props
- - - - - -- DSL dataset
- - - - - -- ZFS znode
- - - - - -- ZFS V0 ACL
76.1M 8.06T 7.25T 11.2T150K1.1198.67 ZFS plain file
2.17M 2.76G 1.33G 52.7G 24.3K2.08 0.46 ZFS directory
341 314K171K 7.99M 24.0K1.84 0.00 ZFS master node
857 25.5M 1.16M 20.1M 24.1K 21.94 0.00 ZFS delete queue
- - - - - -- zvol object
- - - - - -- zvol prop
- - - - - -- other uint8[]
- - - - - -- other uint64[]
- - - - - -- other ZAP
- - - - - -- persistent
error log
33 4.02M763K 4.46M139K5.39 0.00 SPA history
- - - - - -- SPA history offsets
1512 512 36.0K 36.0K1.00 0.00 Pool properties
- - - - - -- DSL permissions
17.1K 12.7M 8.63M411M 24.0K1.48 0.00 ZFS ACL
- - - - - -- ZFS SYSACL
5 80.0K 5.00K120K 24.0K 16.00 0.00 FUID table
- - - - - -- FUID table size
1.37K 723K705K 49.3M 36.0K1.03 0.00 DSL dataset
next clones
- - - - - -- scan work queue
2.69K 2.57M 1.36M 64.6M 24.0K1.89 0.00 ZFS user/group used
- - - - - -- ZFS user/group
quota
- - - -