Re: kernel BUG at fs/btrfs/extent-tree.c:8113! (4.1.3 kernel)

2015-08-12 Thread Josef Bacik
On 08/12/2015 10:47 AM, Marc MERLIN wrote: On Tue, Aug 11, 2015 at 11:40:45AM -0400, Josef Bacik wrote: From a48cf7a9ae44a17d927df5542c8b0be287aee9ed Mon Sep 17 00:00:00 2001 From: Josef Bacik jba...@fb.com Date: Tue, 11 Aug 2015 11:39:37 -0400 Subject: [PATCH] Btrfs: kill BUG_ON() in

Re: [PATCH v2] Btrfs: check if previous transaction aborted to avoid fs corruption

2015-08-12 Thread Liu Bo
On Wed, Aug 12, 2015 at 03:44:51PM +0100, fdman...@kernel.org wrote: From: Filipe Manana fdman...@suse.com While we are committing a transaction, it's possible the previous one is still finishing its commit and therefore we wait for it to finish first. However we were not checking if that

[PATCH] Btrfs: check if previous transaction aborted to avoid fs corruption

2015-08-12 Thread fdmanana
From: Filipe Manana fdman...@suse.com While we are committing a transaction, it's possible the previous one is still finishing its commit and therefore we wait for it to finish first. However we were not checking if that previous transaction ended up getting aborted after we waited for it to

Re: kernel BUG at fs/btrfs/extent-tree.c:8113! (4.1.3 kernel)

2015-08-12 Thread Marc MERLIN
On Tue, Aug 11, 2015 at 11:40:45AM -0400, Josef Bacik wrote: From a48cf7a9ae44a17d927df5542c8b0be287aee9ed Mon Sep 17 00:00:00 2001 From: Josef Bacik jba...@fb.com Date: Tue, 11 Aug 2015 11:39:37 -0400 Subject: [PATCH] Btrfs: kill BUG_ON() in btrfs_lookup_extent_info() Replace it with an

Re: [PATCH] Btrfs: check if previous transaction aborted to avoid fs corruption

2015-08-12 Thread Josef Bacik
On 08/12/2015 10:11 AM, fdman...@kernel.org wrote: From: Filipe Manana fdman...@suse.com While we are committing a transaction, it's possible the previous one is still finishing its commit and therefore we wait for it to finish first. However we were not checking if that previous transaction

[PATCH v2] Btrfs: check if previous transaction aborted to avoid fs corruption

2015-08-12 Thread fdmanana
From: Filipe Manana fdman...@suse.com While we are committing a transaction, it's possible the previous one is still finishing its commit and therefore we wait for it to finish first. However we were not checking if that previous transaction ended up getting aborted after we waited for it to

Re: Scaling to 100k+ snapshots/subvolumes

2015-08-12 Thread Tristan Zajonc
Thanks, this is helpful. We are primarily scaling the number of snapshots. Unfortunately these snapshots typically have very minor changes compared their parent, so this sounds potentially problematic. It sounds like I will need to do some testing of both snapshots and quotas to determine

Deleted files cause btrfs-send to fail

2015-08-12 Thread Marc Joliet
Hi all, Starting today I have an interesting problem: I deleted some files as part (old fcrontabs), which now persistently causes btrfs-send to fail. The error message I get is: Aug 12 23:32:24 thetick make_backups.sh[1059]: ERROR: send ioctl failed with -2: No such file or directory Aug 12

Re: Damaged filesystem, can read, can't repair, error says to contact devs

2015-08-12 Thread Timothy Normand Miller
Actually, it didn't resume. The btrfs delete missing was using 100% of the I/O bandwidth but wasn't actually doing any disk reads of writes. I tried to reboot, but the system wouldn't go down, so after waiting 10 minutes, I power-cycled. Now I can't mount at all and here's what dmesg says about

Re: Damaged filesystem, can read, can't repair, error says to contact devs

2015-08-12 Thread Timothy Normand Miller
On Wed, Aug 12, 2015 at 2:10 PM, Chris Murphy li...@colorremedies.com wrote: Anyway it looks like it's hardware related, but I don't know what device ata4.00 is, so maybe this helps: http://superuser.com/questions/617192/mapping-ata-device-number-to-logical-device-name # ata=4; ls -l

Re: Damaged filesystem, can read, can't repair, error says to contact devs

2015-08-12 Thread Timothy Normand Miller
Ok, here's what's happening. A few years ago, I took my old WD green drives and put them in a box as backups to a new array of Seagate drives. When one of those seagate drives failed (just out of warranty, of course), I replaced it with one of the WD's. That was cooking along just fine until

Re: Damaged filesystem, can read, can't repair, error says to contact devs

2015-08-12 Thread Timothy Normand Miller
It resumed on its own. Weird. On Wed, Aug 12, 2015 at 4:23 PM, Timothy Normand Miller theo...@gmail.com wrote: On Wed, Aug 12, 2015 at 2:10 PM, Chris Murphy li...@colorremedies.com wrote: Anyway it looks like it's hardware related, but I don't know what device ata4.00 is, so maybe this

Re: [PATCH] fstests: btrfs regression test for defrag tail extents

2015-08-12 Thread Liu Bo
On Tue, Aug 11, 2015 at 12:32:01PM +1000, Dave Chinner wrote: On Mon, Aug 10, 2015 at 04:12:59PM +0800, Liu Bo wrote: Regression test for btrfs defragment tool, it's aimed to verify that tail extents won't be skipped as a separate extent while the previous extents have been defrag'ed into a

Re: [PATCH] fstests: btrfs regression test for defrag tail extents

2015-08-12 Thread Liu Bo
On Mon, Aug 10, 2015 at 10:17:52AM +0100, Filipe David Manana wrote: On Mon, Aug 10, 2015 at 9:12 AM, Liu Bo bo.li@oracle.com wrote: Regression test for btrfs defragment tool, it's aimed to verify that tail extents won't be skipped as a separate extent while the previous extents have

Oddness with phantom device replacing real device.

2015-08-12 Thread David Seikel
I don't actually think that this is a BTRFS problem, but it's showing symptoms within BTRFS, and I have no other clues, so maybe the BTRFS experts can help me figure out what is actually going wrong. I'm a sysadmin working for a company that does scientific modelling. They have many TBs of data.

[PATCH v6 04/11] btrfs: remove bio splitting and merge_bvec_fn() calls

2015-08-12 Thread Ming Lin
From: Kent Overstreet kent.overstr...@gmail.com Btrfs has been doing bio splitting from btrfs_map_bio(), by checking device limits as well as calling -merge_bvec_fn() etc. That is not necessary any more, because generic_make_request() is now able to handle arbitrarily sized bios. So clean up

Re: Scaling to 100k+ snapshots/subvolumes

2015-08-12 Thread Hugo Mills
On Tue, Aug 11, 2015 at 11:33:45AM -0700, Tristan Zajonc wrote: In an early thread Duncan mentioned that btrfs does not scale well in the number of subvolumes (including snapshots). He recommended keeping the total number under 1000. I just wanted to understand this limitation further. Is

Re: [RFC 4/8] jbd, jbd2: Do not fail journal because of frozen_buffer allocation failure

2015-08-12 Thread Michal Hocko
On Wed 05-08-15 09:49:24, Greg Thelen wrote: mho...@kernel.org wrote: From: Michal Hocko mho...@suse.com Journal transaction might fail prematurely because the frozen_buffer is allocated by GFP_NOFS request: [ 72.440013] do_get_write_access: OOM for frozen_buffer [ 72.440014]

Re: kernel BUG at fs/btrfs/extent-tree.c:8113! (4.1.3 kernel)

2015-08-12 Thread Josef Bacik
On 08/12/2015 12:09 PM, Marc MERLIN wrote: On Wed, Aug 12, 2015 at 11:15:39AM -0400, Josef Bacik wrote: On 08/12/2015 10:47 AM, Marc MERLIN wrote: On Tue, Aug 11, 2015 at 11:40:45AM -0400, Josef Bacik wrote: From a48cf7a9ae44a17d927df5542c8b0be287aee9ed Mon Sep 17 00:00:00 2001 From: Josef

Re: kernel BUG at fs/btrfs/extent-tree.c:8113! (4.1.3 kernel)

2015-08-12 Thread Marc MERLIN
On Wed, Aug 12, 2015 at 11:15:39AM -0400, Josef Bacik wrote: On 08/12/2015 10:47 AM, Marc MERLIN wrote: On Tue, Aug 11, 2015 at 11:40:45AM -0400, Josef Bacik wrote: From a48cf7a9ae44a17d927df5542c8b0be287aee9ed Mon Sep 17 00:00:00 2001 From: Josef Bacik jba...@fb.com Date: Tue, 11 Aug 2015

Re: RAID0 wrong (raw) device?

2015-08-12 Thread Chris Murphy
On Wed, Aug 12, 2015 at 7:07 AM, Ulli Horlacher frams...@rus.uni-stuttgart.de wrote: /dev/sdb and /dev/sde are in reality the same physical disk! When does all of this confusion happen? Is it already confused before mkfs or only after mkfs or only after mount? I would find out what instigates

RAID0 wrong (raw) device?

2015-08-12 Thread Ulli Horlacher
I have 2 identical servers with 2 x 2 Hitachi (HGST) SATA disks (and some other disks) which are mirrored with drbd. On top of this drbd setup I have created a btrfs RAID0 filesystem. The problem now is, that btrfs shows the raw device instead of the drbd device. root@toy02:~# mkfs.btrfs

Re: Damaged filesystem, can read, can't repair, error says to contact devs

2015-08-12 Thread Timothy Normand Miller
I added a new device and then did a delete missing. I lost the terminal (should have used gnu screen), so I didn't see the stdout, but the operation aborted at some point. There's ton of output in dmesg related to this, along with some OOPSes, which I have attached as dmesg2 here:

Re: btrfs-progs: btrfs balance returns enospc error on a system with 80% free space

2015-08-12 Thread Catalin
On Tue, Aug 11, 2015 at 12:18 PM, Catalin wrote: I have a recently installed an Arch Linux x86_64 system on a 50GB btrfs partition and every time I try btrfs balance start it gives me an enospc error even though I have less than 20% of the available space full. I have tried the recommended

Re: bedup --defrag freezing

2015-08-12 Thread Konstantin Svist
On 08/06/2015 04:10 AM, Austin S Hemmelgarn wrote: On 2015-08-05 17:45, Konstantin Svist wrote: Hi, I've been running btrfs on Fedora for a while now, with bedup --defrag running in a night-time cronjob. Last few runs seem to have gotten stuck, without possibility of even killing the

Re: RAID0 wrong (raw) device?

2015-08-12 Thread Hugo Mills
[adding Ulli back into the cc list] On Wed, Aug 12, 2015 at 11:03:00AM -0600, Chris Murphy wrote: On Wed, Aug 12, 2015 at 7:07 AM, Ulli Horlacher frams...@rus.uni-stuttgart.de wrote: /dev/sdb and /dev/sde are in reality the same physical disk! When does all of this confusion happen? Is

Re: Damaged filesystem, can read, can't repair, error says to contact devs

2015-08-12 Thread Chris Murphy
There are hardware problems here... [112531.319224] ata4.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x6 frozen [112531.319231] ata4.00: failed command: WRITE DMA EXT [112531.319240] ata4.00: cmd 35/00:00:00:8d:46/00:04:08:00:00/e0 tag 0 dma 524288 out res

Re: kernel BUG at fs/btrfs/extent-tree.c:8113! (4.1.3 kernel)

2015-08-12 Thread Marc MERLIN
On Wed, Aug 12, 2015 at 12:18:45PM -0400, Josef Bacik wrote: Going to need more info to figure this one out Thanks for the patch, here's the output: enabling repair mode Checking filesystem on /dev/mapper/crypt_sdd1 UUID: 024ba4d0-dacb-438d-9f1b-eeb34083fe49 checking extents wtf, parent

Re: RAID0 wrong (raw) device?

2015-08-12 Thread Chris Murphy
On Wed, Aug 12, 2015 at 11:43 AM, Hugo Mills h...@carfax.org.uk wrote: [adding Ulli back into the cc list] On Wed, Aug 12, 2015 at 11:03:00AM -0600, Chris Murphy wrote: On Wed, Aug 12, 2015 at 7:07 AM, Ulli Horlacher frams...@rus.uni-stuttgart.de wrote: /dev/sdb and /dev/sde are in reality

Re: bedup --defrag freezing

2015-08-12 Thread Chris Murphy
On Wed, Aug 12, 2015 at 12:44 PM, Konstantin Svist fry@gmail.com wrote: On 08/06/2015 04:10 AM, Austin S Hemmelgarn wrote: On 2015-08-05 17:45, Konstantin Svist wrote: Hi, I've been running btrfs on Fedora for a while now, with bedup --defrag running in a night-time cronjob. Last few

Re: raid1 on uneven-sized disks

2015-08-12 Thread Jim MacBaine
On Mon, Aug 10, 2015 at 6:13 AM, Duncan 1i5t5.dun...@cox.net wrote: We do recommend that you stay relatively current on both kernel and userspace, however. So a current 4.1 series kernel and btrfs-progs 4.1.2 are excellent, but consider another filesystem if you're the type who was still on