Hot rb_next, setup_cluster_no_bitmap

2011-08-03 Thread Simon Kirby
Hi! Since upgrading from 2.6.35+bits to 2.6.38 and then more recently to 3.0, our big btrfs backup box with 20 * 3 TB AOE-attached btrfs volumes started showing more CPU usage and backups were no longer completing in a day. I tried Linus HEAD from yesterday merged with btrfs for-linus (same as

Re: Hot rb_next, setup_cluster_no_bitmap

2011-08-03 Thread Simon Kirby
On Wed, Aug 03, 2011 at 03:06:55PM -0700, Simon Kirby wrote: I see Josef's 86d4a77ba3dc4ace238a0556541a41df2bd71d49 introduced the bitmaps list. I could try temporarily reverting this (some fixups needed) if anybody thinks my cache bouncing idea might be slightly possible. I'll try

Re: Hot rb_next, setup_cluster_no_bitmap

2011-08-03 Thread Simon Kirby
On Wed, Aug 03, 2011 at 03:39:49PM -0700, Simon Kirby wrote: On Wed, Aug 03, 2011 at 03:06:55PM -0700, Simon Kirby wrote: I see Josef's 86d4a77ba3dc4ace238a0556541a41df2bd71d49 introduced the bitmaps list. I could try temporarily reverting this (some fixups needed) if anybody thinks my

Re: Hot rb_next, setup_cluster_no_bitmap

2011-08-03 Thread Simon Kirby
Perhaps as a further clue as to what is going on, on this same backup box after all of the rsyncs are finished/killed and a good amount of time has passed (no cleaner processes running in the background or anything), sync is still consistently takes ~4 minutes to run, and pushes out a lot to disk

Re: Hot rb_next, setup_cluster_no_bitmap

2011-08-04 Thread Simon Kirby
On Thu, Aug 04, 2011 at 10:04:29AM -0400, Chris Mason wrote: Excerpts from Simon Kirby's message of 2011-08-03 21:32:10 -0400: Perhaps as a further clue as to what is going on, on this same backup box after all of the rsyncs are finished/killed and a good amount of time has passed (no

Re: [PATCH] Btrfs: fix orphan cleanup regression

2011-09-21 Thread Simon Kirby
on. Thanks, Reported-by: Simon Kirby s...@hostway.ca Signed-off-by: Josef Bacik jo...@redhat.com Seems to fix the problem, and looks good. Thanks! Simon- Tested-by: Simon Kirby s...@hostway.ca --- fs/btrfs/inode.c | 36 +--- 1 files changed, 17 insertions

Re: [PATCH] Btrfs: fix orphan cleanup regression

2011-10-03 Thread Simon Kirby
On Wed, Sep 21, 2011 at 02:40:06PM -0700, Simon Kirby wrote: On Wed, Sep 21, 2011 at 04:57:56PM -0400, Josef Bacik wrote: In fixing how we deal with bad inodes, we had a regression in the orphan cleanup code, since it expects to get a bad inode back. So fix it to deal with getting

Re: warning and bug on 3.2-rc4 + for-linus from yesterday

2011-12-09 Thread Simon Kirby
On Fri, Dec 09, 2011 at 12:39:48PM -0800, Simon Kirby wrote: Hello! We recently upgraded our backup server kernel (rsync with snapshots and compression) to Linus git master from yesterday (3.2-rc4+ 09d9673d53005) that contains the btrfs for-linus as of yesterday. We've been seeing a few

Re: warning and bug on 3.2-rc4 + for-linus from yesterday

2011-12-12 Thread Simon Kirby
Hello! On Mon, Dec 12, 2011 at 01:30:31AM +0100, David Sterba wrote: On Fri, Dec 09, 2011 at 12:39:48PM -0800, Simon Kirby wrote: [ cut here ] WARNING: at mm/page-writeback.c:1763 __set_page_dirty_nobuffers+0x17b/0x190() Hardware name: PowerEdge 1950 Modules

Re: remote mirroring in the works?

2010-08-31 Thread Simon Kirby
On Mon, Aug 30, 2010 at 11:14:51AM -0700, K. Richard Pixley wrote: On 20100830 10:59, Roy Sigurd Karlsbakk wrote: I think drbd does precisely what you want. It's not useful for fault tolerance, nor for load balancing, but it will produce a remote block copy that can be used as a sort of

Re: remote mirroring in the works?

2010-08-31 Thread Simon Kirby
On Tue, Aug 31, 2010 at 07:07:29AM +0200, Fred van Zwieten wrote: Hmmm, maybe, but rsync would take a lot of time to find the changes. the actual blocks of a snap _are_ the changes, that's why SnapMirror is very efficient. And, I don't see how rsync will retain the snap's between both sites.

2.6.35.4 fumble-spolision

2010-09-09 Thread Simon Kirby
Hello! We seem to have done something to our backup volumes while switching between 2.6.33.2 and 2.6.35.4 which is now causing this crash while booting either kernel and mounting one of the volumes. Crash from 3.6.35.4 plus Li Zefan's free-space-cache.c patches. extent-tree.c is untouched, but

Re: 2.6.35.4 fumble-spolision

2010-09-10 Thread Simon Kirby
Sorry, I meant to change that subject. :) On Thu, Sep 09, 2010 at 08:49:09PM -0400, Chris Mason wrote: What does btrfsck say? Does the problem go away in 2.6.33? Will try 2.6.33 again tomorrow. btrfsck runs for about 44 seconds and then outputs as copied below. Simon- # btrfsck

Re: 2.6.35.4 fumble-spolision

2010-09-10 Thread Simon Kirby
On Fri, Sep 10, 2010 at 04:21:16PM +0800, Yan, Zheng wrote: ? ? ? ? ? ? ? ?tree block key (5643880 60 602) level 0 ? ? ? ? ? ? ? ?tree block backref root 853 failed to find block number 1923814297600 Abort How large is the FS ? Is it possible to run btrfs-image and send the output

btrfs-debug-image crash

2010-09-10 Thread Simon Kirby
I was running this while the fs was mounted, which is probably not what was expected, but here's a crash report anyway. :) Simon- [/usr/src/btrfs-tools/btrfs-tools-0.19+20100601]# gdb --args ./btrfs-debug-tree /dev/etherd/e14.0 GNU gdb 6.8-debian Copyright (C) 2008 Free Software Foundation,

ring btrfs

2010-09-29 Thread Simon Kirby
Just curious... I've been wondering if it would possible/useful to make a btrfs mode where it _always_ writes in a ring on the entire disk. Since it can write anywhere already, it could just write sequentially always. The write process would have to be changed to read a blob, throw away the

btrfsck failures on old backup volumes

2012-05-16 Thread Simon Kirby
Hi! We have some btrfs rsync-snapshot backup servers which have been running since about mid-2009, with a pretty good record so far. We've been following the development kernels and hitting some bugs here an there, but we still haven't managed to lose anything yet. The main problems we have ran

[PATCH] Btrfs: Include the device in most error printk()s

2013-02-13 Thread Simon Kirby
or fs_info is not passed to the function emitting the error. Signed-off-by: Simon Kirby s...@hostway.ca --- fs/btrfs/extent-tree.c | 49 +++ fs/btrfs/free-space-cache.c | 28 +++-- fs/btrfs/inode.c| 49

[3.8-rc7] Locking issues on snapshot cleanup?

2013-02-15 Thread Simon Kirby
We were having a problem with I/O to one btrfs volume on a 3.8-rc6 box hanging completely with no CPU usage or I/O (according to iostat, iotop). Here is after a reboot to 3.8-rc7 with lockdep enabled: [ 317.024264] btrfs: disk space caching is enabled [ 526.036161] INFO: task btrfs-cleaner:5281

[PATCH] Btrfs: Include the device in most error printk()s

2013-02-20 Thread Simon Kirby
() message rather than at each caller. As suggested by David Sterba, unsplit strings to improve searchability. Signed-off-by: Simon Kirby s...@hostway.ca --- fs/btrfs/ctree.h| 21 +++-- fs/btrfs/extent-tree.c | 64 --- fs/btrfs/free

[PATCH v2] Btrfs: Include the device in most error printk()s

2013-03-19 Thread Simon Kirby
() message rather than at each caller. Signed-off-by: Simon Kirby s...@hostway.ca Reviewed-by: David Sterba dste...@suse.cz --- v2: - Ported to btrfs-next - unsplit strings to improve searchability - Change auto deleting messages to btrfs_debug - Fix a typo fs/btrfs/ctree.h| 21