On Mon, Jun 27, 2016 at 3:36 PM, Saint Germain <saint...@gmail.com> wrote:
> Hello,
>
> I am on Debian Jessie with a kernel from backports:
> 4.6.0-0.bpo.1-amd64
>
> I am also using btrfs-tools 4.4.1-1.1~bpo8+1
>
> When trying to replace a RAID1 drive (with btrfs replace start
> -f /dev/sda1 /dev/sdd1), the operation is cancelled after completing
> only 5%.
>
> I got this error in the /var/log/syslog:
> ------------[ cut here ]------------
> WARNING: CPU: 2 PID: 2617 at 
> /build/linux-9LouV5/linux-4.6.1/fs/btrfs/dev-replace.c:430 
> btrfs_dev_replace_start+0x2be/0x400 [btrfs]
> Modules linked in: uas(E) usb_storage(E) bnep(E) ftdi_sio(E) usbserial(E) 
> snd_hda_codec_hdmi(E) nls_utf8(E) nls_cp437(E) vfat(E) fat(E) intel_rapl(E) 
> x86_pkg_temp_thermal(E) intel_powerclamp(E) coretemp(E) kvm_intel(E) kvm(E) 
> iTCO_wdt(E) irqbypass(E) iTCO_vendor_support(E) crct10dif_pclmul(E) 
> crc32_pclmul(E) ghash_clmulni_intel(E) hmac(E) drbg(E) ansi_cprng(E) 
> aesni_intel(E) aes_x86_64(E) lrw(E) gf128mul(E) glue_helper(E) ablk_helper(E) 
> cryptd(E) wl(POE) btusb(E) btrtl(E) btbcm(E) btintel(E) cfg80211(E) 
> bluetooth(E) efi_pstore(E) snd_hda_codec_realtek(E) evdev(E) crc16(E) 
> serio_raw(E) pcspkr(E) efivars(E) joydev(E) snd_hda_codec_generic(E) 
> rfkill(E) snd_hda_intel(E) nuvoton_cir(E) rc_core(E) snd_hda_codec(E) i915(E) 
> battery(E) snd_hda_core(E) snd_hwdep(E) soc_button_array(E) tpm_tis(E) 
> drm_kms_helper(E) intel_smartconnect(E) snd_pcm(E) tpm(E) video(E) 
> i2c_i801(E) snd_timer(E) drm(E) snd(E) lpc_ich(E) i2c_algo_bit(E) 
> soundcore(E) mfd_core(E) mei_me(E) processor(E) button(E) mei(E) shpchp(E) 
> fuse(E) autofs4(E) hid_logitech_hidpp(E) btrfs(E) hid_logitech_dj(E) 
> usbhid(E) hid(E) xor(E) raid6_pq(E) sg(E) sr_mod(E) cdrom(E) sd_mod(E) 
> crc32c_intel(E) ahci(E) libahci(E) libata(E) psmouse(E) scsi_mod(E) 
> xhci_pci(E) ehci_pci(E) xhci_hcd(E) ehci_hcd(E) e1000e(E) usbcore(E) ptp(E) 
> pps_core(E) usb_common(E) fjes(E)
> CPU: 2 PID: 2617 Comm: btrfs Tainted: P           OE   4.6.0-0.bpo.1-amd64 #1 
> Debian 4.6.1-1~bpo8+1
> Hardware name: To Be Filled By O.E.M. To Be Filled By O.E.M./Z87E-ITX, BIOS 
> P2.10 10/04/2013
>  0000000000000286 00000000f0ba7fe7 ffffffff813123c5 0000000000000000
>  0000000000000000 ffffffff8107af94 ffff880186caf000 00000000fffffffb
>  ffff8800c76b0800 ffff8800cae70000 ffff8800cae70ee0 00007ffdd5397d98
> Call Trace:
>  [<ffffffff813123c5>] ? dump_stack+0x5c/0x77
>  [<ffffffff8107af94>] ? __warn+0xc4/0xe0
>  [<ffffffffc030990e>] ? btrfs_dev_replace_start+0x2be/0x400 [btrfs]
>  [<ffffffffc02d24f2>] ? btrfs_ioctl+0x1d42/0x2190 [btrfs]
>  [<ffffffff811a8e2d>] ? handle_mm_fault+0x154d/0x1cb0
>  [<ffffffff81205379>] ? do_vfs_ioctl+0x99/0x5d0
>  [<ffffffff81205926>] ? SyS_ioctl+0x76/0x90
>  [<ffffffff815c83b6>] ? system_call_fast_compare_end+0xc/0x96
> ---[ end trace 9fbfaa137cc5a72a ]---
>
>
>
> What should I do to replace correctly my drive ?

I don't often see handle_mm_fault with btrfs problems, maybe the
entire dmesg from mounting the fs and including btrfs replace would
reveal a related problem that instigates the failure?

If the device being replaced is acting unreliably, then you'd want to
use -r with replace to ignore that device unless it's absolutely
necessary to read from it.

-- 
Chris Murphy
--
To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Reply via email to