Re: fresh btrfs filesystem, out of disk space, hundreds of gigs free

2014-03-22 Thread Jon Nelson
Duncan 1i5t5.duncan at cox.net writes: Jon Nelson posted on Fri, 21 Mar 2014 19:00:51 -0500 as excerpted: Using openSUSE 13.1 on x86_64 which - as of this writing - is 3.11.10, Would a more recent kernel than 3.11 have done me any good? [Reordered the kernel question from below to here

fresh btrfs filesystem, out of disk space, hundreds of gigs free

2014-03-21 Thread Jon Nelson
Using openSUSE 13.1 on x86_64 which - as of this writing - is 3.11.10, I tried to copy a bunch of files over to a btrfs filesystem (which was mounted as /, in fact). After some time, things ground to a halt and I got out of disk space errors. btrfs fi df / showed about 1TB of *data* free, and

report: my btrfs filesystem failed hard today

2013-07-17 Thread Jon Nelson
I had a btrfs filesystem under 3.9.8 that failed /hard/ today. So hard that the filesystem could not be mounted because there wasn't enough free space, unless it was mounted read only. This happened after I ran out of metadata space (is there a way to increase the amount of metadata storage)

Re: report: my btrfs filesystem failed hard today

2013-07-17 Thread Jon Nelson
On Wed, Jul 17, 2013 at 6:04 PM, Josef Bacik jba...@fusionio.com wrote: On Wed, Jul 17, 2013 at 05:44:23PM -0500, Jon Nelson wrote: I had a btrfs filesystem under 3.9.8 that failed /hard/ today. So hard that the filesystem could not be mounted because there wasn't enough free space, unless

Re: hang on 3.9, 3.10-rc5

2013-06-21 Thread Jon Nelson
On Fri, Jun 21, 2013 at 6:30 AM, Chris Mason chris.ma...@fusionio.com wrote: Quoting Jon Nelson (2013-06-20 21:46:46) Is this what you are looking for? After this, the CPU gets stuck and I have to reboot. [360491.932226] [ cut here ] [360491.932261] kernel BUG

Re: hang on 3.9, 3.10-rc5

2013-06-20 Thread Jon Nelson
] ---[ end trace 1475a0830dcadf9c ]--- [360491.946051] note: btrfs-endio-wri[22166] exited with preempt_count 1 On Thu, Jun 20, 2013 at 8:11 PM, Chris Mason chris.ma...@fusionio.com wrote: Quoting Jon Nelson (2013-06-18 13:19:04) Josef Bacik jbacik at fusionio.com writes: On Tue, Jun 11, 2013 at 11

Re: hang on 3.9, 3.10-rc5

2013-06-18 Thread Jon Nelson
Josef Bacik jbacik at fusionio.com writes: On Tue, Jun 11, 2013 at 11:43:30AM -0400, Sage Weil wrote: I'm also seeing this hang regularly with both 3.9 and 3.10-rc5. Is this is a known problem? In this case there is no powercycling; just a regular ceph-osd workload. .. I'm able to

Re: WARNING: at fs/btrfs/free-space-cache.c:921 __btrfs_write_out_cache+0x6b9/0x9a0 [btrfs]()

2013-05-01 Thread Jon Nelson
Josef Bacik jbacik at fusionio.com writes: .. Ok well that's not good, I'm not sure how you got a 156 gigabyte block group, but thats why that warning is going off. Can you pull btrfs-image down from here git://github.com/josefbacik/btrfs-progs.git What is the difference between this

Re: corrupted filesystem. mounts with -o recovery,ro but not -o recovery or -o ro

2013-04-23 Thread Jon Nelson
a reboot. I *can* mount this with -o recovery,ro but nothing else works. On Sat, Apr 20, 2013 at 8:20 AM, Jon Nelson jnel...@jamponi.net wrote: Using 3.8.8, I tried mounting with -o recovery and -o recovery,nospace_cache (which shouldn't be any different, if I'm understanding the kernel sources

Re: corrupted filesystem. mounts with -o recovery,ro but not -o recovery or -o ro

2013-04-23 Thread Jon Nelson
On Tue, Apr 23, 2013 at 10:03 AM, Liu Bo bo.li@oracle.com wrote: Can you please show us where it BUG_ON(or logs) when mounting with -o recovery? (the stack info below seems not to be a resulf of '-o recovery'?) I have this from 3.8.8: 2013-04-19T21:19:47.060937-05:00 turnip kernel: [

Re: minor patch to cmds-restore.c

2013-04-20 Thread Jon Nelson
On Fri, Apr 19, 2013 at 11:48 PM, Eric Sandeen sand...@redhat.com wrote: On 4/19/13 7:11 PM, Jon Nelson wrote: The following is a minor patch to cmds-restore.c Hi Jon - just a note - When sending a patch like this, it's best to follow the standard patch format, which closely mimics

Re: corrupted filesystem. mounts with -o recovery,ro but not -o recovery or -o ro

2013-04-20 Thread Jon Nelson
to tell the superblock to go ahead and use backup root #1 in this case? On Tue, Apr 16, 2013 at 11:44 AM, Jon Nelson jnel...@jamponi.net wrote: Tried to mount with -o recovery using 3.8.7. No change. Does anybody have any suggestions? On Sat, Apr 13, 2013 at 6:21 PM, Jon Nelson jnel

minor patch to cmds-restore.c

2013-04-19 Thread Jon Nelson
The following is a minor patch to cmds-restore.c diff --git a/cmds-restore.c b/cmds-restore.c index c75e187..273c813 100644 --- a/cmds-restore.c +++ b/cmds-restore.c @@ -917,14 +917,16 @@ out: } const char * const cmd_restore_usage[] = { - btrfs restore [options] device, + btrfs

Re: corrupted filesystem. mounts with -o recovery,ro but not -o recovery or -o ro

2013-04-16 Thread Jon Nelson
Tried to mount with -o recovery using 3.8.7. No change. Does anybody have any suggestions? On Sat, Apr 13, 2013 at 6:21 PM, Jon Nelson jnel...@jamponi.net wrote: I have a 4-disk btrfs filesystem in raid1 mode. I'm running openSUSE 12.3, 3.7.10, x86_64. A few days ago something went wrong

corrupted filesystem. mounts with -o recovery,ro but not -o recovery or -o ro

2013-04-13 Thread Jon Nelson
I have a 4-disk btrfs filesystem in raid1 mode. I'm running openSUSE 12.3, 3.7.10, x86_64. A few days ago something went wrong and the filesystem re-mounted itself RO. After reboot, it didn't come up. After a fair bit of work, I can get the filesystem to mount with -o recovery,ro. However, if I

device error stats resettable?

2013-04-01 Thread Jon Nelson
I have a device that is part of a 4-device btrfs raid1 setup. I had accidentally jiggled the cable for this device and it started racking up errors (about 90,000). After fixing the cable (and a scrub), all of the errors are fixed (woo!), but the device still shows lots of errors. Is there a way to

Re: device error stats resettable?

2013-04-01 Thread Jon Nelson
On Mon, Apr 1, 2013 at 5:39 PM, anand jain anand.j...@oracle.com wrote: Is there a way to reset the device error count? there is -z, is it not what you are looking for ? -- # btrfs dev stat --help usage: btrfs device stats [-z] path|device Show current device IO stats. -z to

Re: btrfs-show vs. btrfs different output

2013-03-22 Thread Jon Nelson
On Thu, Mar 21, 2013 at 11:25 AM, Eric Sandeen sand...@redhat.com wrote: On 3/21/13 10:29 AM, Jon Nelson wrote: On Thu, Mar 21, 2013 at 10:11 AM, Eric Sandeen sand...@redhat.com wrote: On 3/21/13 10:04 AM, Jon Nelson wrote: ... 2. the current git btrfs-show and btrfs fi show both output

crash with 3.7.10 and balance.

2013-03-22 Thread Jon Nelson
I'm running openSUSE 12.3 on x86_64. I was running a balance: btrfs balance -dusage=5 -v / using the latest btrfs tools code from git (as of this writing) and got a crash: [304158.496250] btrfs: found 75 extents [304159.309289] btrfs: relocating block group 2303295684608 flags 17 [304159.839886]

btrfs-show vs. btrfs different output

2013-03-21 Thread Jon Nelson
I'm running openSUSE 12.3 x86_64 which has an unknown git version, but reports v0.19. I'm also supplying the output from git which reports itself as: v0.20-rc1-253-g7854c8b The problem is that btrfs-show (git) and btrfs fi show (git) give /different/ output from each other which is also different

Re: btrfs-show vs. btrfs different output

2013-03-21 Thread Jon Nelson
On Thu, Mar 21, 2013 at 10:11 AM, Eric Sandeen sand...@redhat.com wrote: On 3/21/13 10:04 AM, Jon Nelson wrote: ... 2. the current git btrfs-show and btrfs fi show both output *different* devices for device with UUID b5dc52bd-21bf-4173-8049-d54d88c82240, and they're both wrong. does blkid

Re: hunt for 2.6.37 dm-crypt+ext4 corruption? (was: Re: dm-crypt barrier support is effective)

2010-12-13 Thread Jon Nelson
On Sun, Dec 12, 2010 at 8:06 PM, Ted Ts'o ty...@mit.edu wrote: On Sun, Dec 12, 2010 at 07:11:28AM -0600, Jon Nelson wrote: I'm glad you've been able to reproduce the problem! If you should need any further assistance, please do not hesitate to ask. This patch seems to fix the problem for me

Re: hunt for 2.6.37 dm-crypt+ext4 corruption? (was: Re: dm-crypt barrier support is effective)

2010-12-12 Thread Jon Nelson
On Sat, Dec 11, 2010 at 9:16 PM, Jon Nelson jnel...@jamponi.net wrote: On Sat, Dec 11, 2010 at 7:40 PM, Ted Ts'o ty...@mit.edu wrote: Yes, indeed.  Is this in the virtualized environment or on real hardware at this point?  And how many CPU's do you have configured in your virtualized

Re: hunt for 2.6.37 dm-crypt+ext4 corruption? (was: Re: dm-crypt barrier support is effective)

2010-12-12 Thread Jon Nelson
On Sun, Dec 12, 2010 at 6:43 AM, Ted Ts'o ty...@mit.edu wrote: On Sun, Dec 12, 2010 at 04:18:29AM -0600, Jon Nelson wrote: I have one CPU configured in the environment, 512MB of memory. I have not done any memory-constriction tests whatsoever. I've finally been able to reproduce it myself

Re: hunt for 2.6.37 dm-crypt+ext4 corruption? (was: Re: dm-crypt barrier support is effective)

2010-12-11 Thread Jon Nelson
On Sat, Dec 11, 2010 at 7:40 PM, Ted Ts'o ty...@mit.edu wrote: Yes, indeed. Is this in the virtualized environment or on real hardware at this point? And how many CPU's do you have configured in your virtualized environment, and how memory memory? Is having a certain number of CPU's

Re: hunt for 2.6.37 dm-crypt+ext4 corruption? (was: Re: dm-crypt barrier support is effective)

2010-12-10 Thread Jon Nelson
On Fri, Dec 10, 2010 at 12:52 AM, Jon Nelson jnel...@jamponi.net wrote: On Thu, Dec 9, 2010 at 8:38 PM, Ted Ts'o ty...@mit.edu wrote: On Fri, Dec 10, 2010 at 02:53:30AM +0100, Matt wrote: Try a kernel before 5a87b7a5da250c9be6d757758425dfeaf8ed3179 from the tests I've done that one showed

Re: hunt for 2.6.37 dm-crypt+ext4 corruption? (was: Re: dm-crypt barrier support is effective)

2010-12-10 Thread Jon Nelson
On Fri, Dec 10, 2010 at 8:58 AM, Jon Nelson jnel...@jamponi.net wrote: On Fri, Dec 10, 2010 at 12:52 AM, Jon Nelson jnel...@jamponi.net wrote: On Thu, Dec 9, 2010 at 8:38 PM, Ted Ts'o ty...@mit.edu wrote: On Fri, Dec 10, 2010 at 02:53:30AM +0100, Matt wrote: Try a kernel before

Re: hunt for 2.6.37 dm-crypt+ext4 corruption? (was: Re: dm-crypt barrier support is effective)

2010-12-10 Thread Jon Nelson
On Fri, Dec 10, 2010 at 10:54 AM, Jon Nelson jnel...@jamponi.net wrote: On Fri, Dec 10, 2010 at 8:58 AM, Jon Nelson jnel...@jamponi.net wrote: On Fri, Dec 10, 2010 at 12:52 AM, Jon Nelson jnel...@jamponi.net wrote: On Thu, Dec 9, 2010 at 8:38 PM, Ted Ts'o ty...@mit.edu wrote: On Fri, Dec 10

Re: hunt for 2.6.37 dm-crypt+ext4 corruption? (was: Re: dm-crypt barrier support is effective)

2010-12-09 Thread Jon Nelson
On Thu, Dec 9, 2010 at 2:13 PM, Ted Ts'o ty...@mit.edu wrote: On Thu, Dec 09, 2010 at 12:10:58PM -0600, Jon Nelson wrote: You should be OK, there. Are you using encryption or no? I had difficulty replicating the issue without encryption. Yes, I'm using encryption.  LUKS with aes-xts-plain

Re: hunt for 2.6.37 dm-crypt+ext4 corruption? (was: Re: dm-crypt barrier support is effective)

2010-12-09 Thread Jon Nelson
On Thu, Dec 9, 2010 at 8:38 PM, Ted Ts'o ty...@mit.edu wrote: On Fri, Dec 10, 2010 at 02:53:30AM +0100, Matt wrote: Try a kernel before 5a87b7a5da250c9be6d757758425dfeaf8ed3179 from the tests I've done that one showed the least or no corruption if you count the empty /etc/env.d/03opengl as

Re: hunt for 2.6.37 dm-crypt+ext4 corruption? (was: Re: dm-crypt barrier support is effective)

2010-12-08 Thread Jon Nelson
On Tue, Dec 7, 2010 at 9:37 PM, Jon Nelson jnel...@jamponi.net wrote: On Tue, Dec 7, 2010 at 1:35 PM, Ted Ts'o ty...@mit.edu wrote: On Tue, Dec 07, 2010 at 01:22:43PM -0500, Mike Snitzer wrote: 1. create a database (from bash): createdb test 2. place the following contents in a file (I

Re: hunt for 2.6.37 dm-crypt+ext4 corruption? (was: Re: dm-crypt barrier support is effective)

2010-12-07 Thread Jon Nelson
I finally found some time to test this out. With 2.6.37-rc4 (openSUSE KOTD kernel) I easily encounter the issue. Using a virtual machine, I created a stock, minimal openSUSE 11.3 x86_64 install, installed all updates, installed postgresql and the 'KOTD' (Kernel of the Day) kernel, and ran the

Re: hunt for 2.6.37 dm-crypt+ext4 corruption? (was: Re: dm-crypt barrier support is effective)

2010-12-07 Thread Jon Nelson
On Tue, Dec 7, 2010 at 12:22 PM, Mike Snitzer snit...@redhat.com wrote: On Tue, Dec 07 2010 at  1:10pm -0500, Jon Nelson jnel...@jamponi.net wrote: I finally found some time to test this out. With 2.6.37-rc4 (openSUSE KOTD kernel) I easily encounter the issue. Using a virtual machine, I

Re: hunt for 2.6.37 dm-crypt+ext4 corruption? (was: Re: dm-crypt barrier support is effective)

2010-12-07 Thread Jon Nelson
On Tue, Dec 7, 2010 at 12:52 PM, Chris Mason chris.ma...@oracle.com wrote: Excerpts from Jon Nelson's message of 2010-12-07 13:45:14 -0500: On Tue, Dec 7, 2010 at 12:22 PM, Mike Snitzer snit...@redhat.com wrote: On Tue, Dec 07 2010 at  1:10pm -0500, Jon Nelson jnel...@jamponi.net wrote

Re: hunt for 2.6.37 dm-crypt+ext4 corruption? (was: Re: dm-crypt barrier support is effective)

2010-12-07 Thread Jon Nelson
On Tue, Dec 7, 2010 at 2:02 PM, Chris Mason chris.ma...@oracle.com wrote: Excerpts from Jon Nelson's message of 2010-12-07 14:34:40 -0500: On Tue, Dec 7, 2010 at 12:52 PM, Chris Mason chris.ma...@oracle.com wrote: postgresql errors. Typically, header corruption but from the limited

Re: hunt for 2.6.37 dm-crypt+ext4 corruption? (was: Re: dm-crypt barrier support is effective)

2010-12-07 Thread Jon Nelson
On Tue, Dec 7, 2010 at 2:33 PM, Chris Mason chris.ma...@oracle.com wrote: Excerpts from Jon Nelson's message of 2010-12-07 15:25:47 -0500: On Tue, Dec 7, 2010 at 2:02 PM, Chris Mason chris.ma...@oracle.com wrote: Excerpts from Jon Nelson's message of 2010-12-07 14:34:40 -0500: On Tue, Dec 7,

Re: hunt for 2.6.37 dm-crypt+ext4 corruption? (was: Re: dm-crypt barrier support is effective)

2010-12-07 Thread Jon Nelson
On Tue, Dec 7, 2010 at 2:41 PM, Chris Mason chris.ma...@oracle.com wrote: Excerpts from Jon Nelson's message of 2010-12-07 15:25:47 -0500: On Tue, Dec 7, 2010 at 2:02 PM, Chris Mason chris.ma...@oracle.com wrote: Excerpts from Jon Nelson's message of 2010-12-07 14:34:40 -0500: On Tue, Dec 7,

Re: hunt for 2.6.37 dm-crypt+ext4 corruption? (was: Re: dm-crypt barrier support is effective)

2010-12-07 Thread Jon Nelson
On Tue, Dec 7, 2010 at 1:35 PM, Ted Ts'o ty...@mit.edu wrote: On Tue, Dec 07, 2010 at 01:22:43PM -0500, Mike Snitzer wrote: 1. create a database (from bash): createdb test 2. place the following contents in a file (I used 't.sql'): begin; create temporary table foo as select x as

Re: hunt for 2.6.37 dm-crypt+ext4 corruption? (was: Re: dm-crypt barrier support is effective)

2010-12-07 Thread Jon Nelson
On Tue, Dec 7, 2010 at 3:02 PM, Chris Mason chris.ma...@oracle.com wrote: Excerpts from Jon Nelson's message of 2010-12-07 15:48:58 -0500: On Tue, Dec 7, 2010 at 2:41 PM, Chris Mason chris.ma...@oracle.com wrote: Excerpts from Jon Nelson's message of 2010-12-07 15:25:47 -0500: On Tue, Dec 7,

Re: hunt for 2.6.37 dm-crypt+ext4 corruption? (was: Re: dm-crypt barrier support is effective)

2010-12-07 Thread Jon Nelson
On Tue, Dec 7, 2010 at 1:35 PM, Ted Ts'o ty...@mit.edu wrote: On Tue, Dec 07, 2010 at 01:22:43PM -0500, Mike Snitzer wrote: 1. create a database (from bash): createdb test 2. place the following contents in a file (I used 't.sql'): begin; create temporary table foo as select x as

Re: hunt for 2.6.37 dm-crypt+ext4 corruption? (was: Re: dm-crypt barrier support is effective)

2010-12-07 Thread Jon Nelson
On Tue, Dec 7, 2010 at 2:41 PM, Chris Mason chris.ma...@oracle.com wrote: Excerpts from Jon Nelson's message of 2010-12-07 15:25:47 -0500: On Tue, Dec 7, 2010 at 2:02 PM, Chris Mason chris.ma...@oracle.com wrote: Excerpts from Jon Nelson's message of 2010-12-07 14:34:40 -0500: On Tue, Dec 7,

Re: dm-crypt barrier support is effective

2010-12-01 Thread Jon Nelson
On Wed, Dec 1, 2010 at 12:24 PM, Milan Broz mb...@redhat.com wrote: On 12/01/2010 06:35 PM, Matt wrote: Thanks for pointing to v6 ! I hadn't noticed that there was a new one :) Well, so I'll restore my box to a working/productive state and will try out v6 (I'm pretty confident that it'll work

weird ENOSPC with defragment directory

2010-08-16 Thread Jon Nelson
Most other directories on /var/cache, *except* those created by squid, can be defragmented. The filesystem was converted from ext3/4. turnip:~ # uname -a Linux turnip 2.6.34-12-default #1 SMP 2010-06-29 02:39:08 +0200 x86_64 x86_64 x86_64 GNU/Linux (stock openSUSE 11.3 kernel) turnip:~ btrfsctl

Re: weird ENOSPC with defragment directory

2010-08-16 Thread Jon Nelson
On Mon, Aug 16, 2010 at 10:15 PM, Jon Nelson jnel...@jamponi.net wrote: Most other directories on /var/cache, *except* those created by squid, can be defragmented. The filesystem was converted from ext3/4. turnip:~ # uname -a Linux turnip 2.6.34-12-default #1 SMP 2010-06-29 02:39:08 +0200