Hi,
I'm evaluating btrfs for a future deployment, and managed to
(repeatedly
) get btrfs to the state where the system can't mount, can't fsck and
can't
recover.
The test setup is pretty small, 6 devices of various size:
butter-1.5GA vg_dolt -wi-a
1.50g
butter-1.5GB vg_dolt -wi-a
1.50g
butter-2GA vg_dolt -wi-a
2.00g
butter-2GB vg_dolt -wi-a
2.00g
butter-3GA vg_dolt -wi-a
3.00g
butter-3GB vg_dolt -wi-a
3.00g
Created an btrfs volume:
mkfs.btrfs -d raid10 -m raid1 /dev/mapper/vg_dolt-butter--1.5GA
/dev/mapper/vg_dolt-butter--1.5GA /dev/mapper/vg_dolt-butter--2GA
/dev/mapper/vg_dolt-butter--2GB /dev/mapper/vg_dolt-butter--3GA
/dev/mapper/vg_dolt-butter--3GB
( Note how above it is mistyped, This is a 5 disk raid10. Where 1.5GA
was
listed twice. )
--
mount it and fill it with files ( I downloaded parts of the fedora
src.rpm
tree ).
unmount the partition
Zero one drive
dd if=/dev/zero of=/dev/vg_dolt/butter-3GB bs=1M skip=100
( It's sort of hard to fake a corrupt drive, this is a decent way of
doing
it )
trying to mount it gives the following setup:
Jun 28 23:58:34 dolt kernel: [2815554.803082] device fsid
379e495a-9ba7-4485-ae74-6c8939f7b22e devid 3 transid 27
/dev/mapper/vg_dolt-butter--2GB
Jun 28 23:58:34 dolt kernel: [2815554.850211] btrfs: disk space caching
is
enabled
Jun 28 23:58:34 dolt kernel: [2815554.850856] btrfs: failed to read
chunk
tree on dm-6
Jun 28 23:58:34 dolt kernel: [2815554.856453] btrfs: open_ctree failed
Jun 28 23:58:44 dolt kernel: [2815565.475519] device fsid
379e495a-9ba7-4485-ae74-6c8939f7b22e devid 3 transid 27
/dev/mapper/vg_dolt-butter--2GB
Jun 28 23:58:44 dolt kernel: [2815565.476939] btrfs: enabling auto
recovery
Jun 28 23:58:44 dolt kernel: [2815565.476944] btrfs: disk space caching
is
enabled
Jun 28 23:58:44 dolt kernel: [2815565.477648] btrfs: failed to read
chunk
tree on dm-6
Jun 28 23:58:44 dolt kernel: [2815565.486300] btrfs: open_ctree failed
Jun 28 23:58:52 dolt kernel: [2815573.522271] device fsid
379e495a-9ba7-4485-ae74-6c8939f7b22e devid 2 transid 27
/dev/mapper/vg_dolt-butter--2GA
Jun 28 23:58:52 dolt kernel: [2815573.536624] btrfs: enabling auto
recovery
Jun 28 23:58:52 dolt kernel: [2815573.536628] btrfs: disk space caching
is
enabled
Jun 28 23:58:52 dolt kernel: [2815573.537185] btrfs: failed to read
chunk
tree on dm-6
Jun 28 23:58:52 dolt kernel: [2815573.542938] btrfs: open_ctree failed
[root@dolt mnt]# btrfsck /dev/vg_dolt/butter-2GA
failed to read /dev/sr0
failed to read /dev/sr0
warning, device 5 is missing
warning devid 5 not found already
checking extents
checking fs roots
checking root refs
Segmentation fault
[root@dolt mnt]# mount -o recovery,ro /dev/mapper/vg_dolt-butter--2GA
/mnt/test/
mount: wrong fs type, bad option, bad superblock on
/dev/mapper/vg_dolt-butter--2GA,
missing codepage or helper program, or other error
In some cases useful info is found in syslog - try
dmesg | tail or so
[root@dolt mnt]#
debuginfo-install btrfs-progs-0.20.rc1.20121017git91d9eec-3.fc18.x86_64
[root@dolt mnt]# gdb btrfsck /dev/vg_dolt/butter-2GA
GNU gdb (GDB) Fedora (7.5.1-38.fc18)
Copyright (C) 2012 Free Software Foundation, Inc.
License GPLv3+: GNU GPL version 3 or later
http://gnu.org/licenses/gpl.html
This is free software: you are free to change and redistribute it.
There is NO WARRANTY, to the extent permitted by law. Type show
copying
and show warranty for details.
This GDB was configured as x86_64-redhat-linux-gnu.
For bug reporting instructions, please see:
http://www.gnu.org/software/gdb/bugs/...
Reading symbols from /usr/sbin/btrfsck...Reading symbols from
/usr/lib/debug/usr/sbin/btrfsck.debug...done.
done.
/dev/vg_dolt/butter-2GA is not a core dump: File format not recognized
(gdb) run /dev/vg_dolt/butter-2GA
Starting program: /usr/sbin/btrfsck /dev/vg_dolt/butter-2GA
failed to read /dev/sr0
failed to read /dev/sr0
warning, device 5 is missing
warning devid 5 not found already
checking extents
checking fs roots
checking root refs
Program received signal SIGSEGV, Segmentation fault.
__GI___libc_free (mem=0x80) at malloc.c:2907
2907 if (chunk_is_mmapped(p)) /* release mmapped
memory. */
(gdb) bt full
#0 __GI___libc_free (mem=0x80) at malloc.c:2907
ar_ptr = optimized out
p = optimized out
hook = 0x0
#1 0x0040d429 in close_all_devices (fs_info=0x6323e0) at
disk-io.c:1088
list = 0x631050
next = 0x6300b0
tmp = 0x630430
device = 0x6300b0
#2 0x0040e3df in close_ctree (root=root@entry=0x6426e0) at
disk-io.c:1135
ret = optimized out
fs_info = 0x6323e0
__PRETTY_FUNCTION__ = close_ctree
#3 0x00401d8d in main (ac=optimized out, av=optimized out)
at
btrfsck.c:3593
root_cache = {root = {rb_node = 0x0, rotate_notify =
0x423aad
__libc_csu_init+93}}
root = optimized out
info = optimized out
trans = optimized out
bytenr =