Re: btrfs issues in 3.14

2014-05-09 Thread Liu Bo
On Thu, May 08, 2014 at 10:51:03AM -0300, Kenny MacDermid wrote:
 On Wed, May 7, 2014 at 11:48 PM, Liu Bo bo.li@oracle.com wrote:
 
  On Wed, May 07, 2014 at 09:35:06AM -0300, Kenny MacDermid wrote:
   On Tue, May 6, 2014 at 11:22 PM, Liu Bo bo.li@oracle.com wrote:
   
What does sysrq+w say when the hang happens?
  
   The whole system isn't hung, I may have explained that wrong. The
   system will hang if I try to shutdown, and the process will hang if I
   try to kill -9 it.
  
   It looks like the browser is in this state currently so I did an 'echo
   w /proc/sysrq-trigger' and have attached the full dmesg with the
   browser issues and the output.
 
  Those stacks show the blocked tasks are waiting for a page's writeback, but
  they don't show what blocks the endio process of that page.
 
  I'd recommand you to try the lastest 3.15.0-rc4 or btrfs-next, as many fixes
  are merged during this period.
 
 
 Thank you, I upgraded to the Arch package for 3.15.0-1-mainline (it's
 rc4) and I'll let you know if the errors reoccur.

FYI, this patch seems to address your problem.

Btrfs: fix hang on error (such as ENOSPC) when writing extent pages
https://patchwork.kernel.org/patch/4139971/

-liubo

 
 Should the filesystem be rebuilt again?
 
 A 'btrfs check' of it returned:
 
 checking extents
 checking free space cache
 checking fs roots
 checking csums
 checking root refs
 Checking filesystem on /dev/mapper/home
 UUID: 9a60a25f-eeb4-494c-b1af-ebd8e4f79b6b
 free space inode generation (0) did not match free space cache generation 
 (6409)
 free space inode generation (0) did not match free space cache generation 
 (6397)
 found 41686685877 bytes used err is 0
 total csum bytes: 74074632
 total tree bytes: 907673600
 total fs tree bytes: 807567360
 total extent tree bytes: 18251776
 btree space waste bytes: 116552179
 file data blocks allocated: 112191107072
  referenced 75535110144
 Btrfs v3.14.1
--
To unsubscribe from this list: send the line unsubscribe linux-btrfs in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: btrfs issues in 3.14

2014-05-08 Thread Kenny MacDermid
On Wed, May 7, 2014 at 11:48 PM, Liu Bo bo.li@oracle.com wrote:

 On Wed, May 07, 2014 at 09:35:06AM -0300, Kenny MacDermid wrote:
  On Tue, May 6, 2014 at 11:22 PM, Liu Bo bo.li@oracle.com wrote:
  
   What does sysrq+w say when the hang happens?
 
  The whole system isn't hung, I may have explained that wrong. The
  system will hang if I try to shutdown, and the process will hang if I
  try to kill -9 it.
 
  It looks like the browser is in this state currently so I did an 'echo
  w /proc/sysrq-trigger' and have attached the full dmesg with the
  browser issues and the output.

 Those stacks show the blocked tasks are waiting for a page's writeback, but
 they don't show what blocks the endio process of that page.

 I'd recommand you to try the lastest 3.15.0-rc4 or btrfs-next, as many fixes
 are merged during this period.


Thank you, I upgraded to the Arch package for 3.15.0-1-mainline (it's
rc4) and I'll let you know if the errors reoccur.

Should the filesystem be rebuilt again?

A 'btrfs check' of it returned:

checking extents
checking free space cache
checking fs roots
checking csums
checking root refs
Checking filesystem on /dev/mapper/home
UUID: 9a60a25f-eeb4-494c-b1af-ebd8e4f79b6b
free space inode generation (0) did not match free space cache generation (6409)
free space inode generation (0) did not match free space cache generation (6397)
found 41686685877 bytes used err is 0
total csum bytes: 74074632
total tree bytes: 907673600
total fs tree bytes: 807567360
total extent tree bytes: 18251776
btree space waste bytes: 116552179
file data blocks allocated: 112191107072
 referenced 75535110144
Btrfs v3.14.1
--
To unsubscribe from this list: send the line unsubscribe linux-btrfs in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: btrfs issues in 3.14

2014-05-07 Thread Kenny MacDermid
On Wed, May 7, 2014 at 9:35 AM, Kenny MacDermid
kenny.macder...@gmail.com wrote:
 On Tue, May 6, 2014 at 11:22 PM, Liu Bo bo.li@oracle.com wrote:

 What does sysrq+w say when the hang happens?

 The whole system isn't hung, I may have explained that wrong. The
 system will hang if I try to shutdown, and the process will hang if I
 try to kill -9 it.

 It looks like the browser is in this state currently so I did an 'echo
 w /proc/sysrq-trigger' and have attached the full dmesg with the
 browser issues and the output.

I had to hard reboot to clear that issue, and I decided to do another
'btrfs check' while /home was unmounted. It generated the following
output:

checking extents
checking free space cache
Wanted bytes 45056, found 32768 for off 63805808640
Wanted bytes 90016, found 32768 for off 63805808640
cache appears valid but isnt 62843256832
Checking filesystem on //dev/mapper/home
UUID: 9a60a25f-eeb4-494c-b1af-ebd8e4f79b6b
found 13672418478 bytes used err is -22
total csum bytes: 72089212
total tree bytes: 906100736
total fs tree bytes: 808370176
total extent tree bytes: 18153472
btree space waste bytes: 116247440
file data blocks allocated: 101046853632
 referenced 73680674816
Btrfs v3.14.1

This is on the new filesystem. I redid the dmcrypt and the lvm lv when
I recreated the filesystem as well, so it's less than a week old.
Before rebuilding the old was was telling me:

Checking filesystem on /dev/mapper/home
UUID: 4f5d7a10-d003-48a7-a901-bf22d534888f
free space inode generation (0) did not match free space cache
generation (115200)
found 29963117667 bytes used err is 1
total csum bytes: 63740440
total tree bytes: 745504768
total fs tree bytes: 624951296
total extent tree bytes: 36749312
btree space waste bytes: 119018687
file data blocks allocated: 181026942976
 referenced 73759866880
Btrfs v0.20-rc1-358-g194aa4a-dirty

and

checking extents
checking free space cache
checking fs roots
root 257 inode 29647 errors 200, dir isize wrong
root 257 inode 391917 errors 200, dir isize wrong
root 257 inode 497392 errors 410, odd dir item, nbytes wrong
Checking filesystem on /dev/mapper/home
UUID: 4f5d7a10-d003-48a7-a901-bf22d534888f
free space inode generation (0) did not match free space cache
generation (115200)
found 31310902624 bytes used err is 1
total csum bytes: 63579480
total tree bytes: 743342080
total fs tree bytes: 623198208
total extent tree bytes: 36601856
btree space waste bytes: 118906643
file data blocks allocated: 180831965184
 referenced 73631731712
Btrfs v3.14
--
To unsubscribe from this list: send the line unsubscribe linux-btrfs in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: btrfs issues in 3.14

2014-05-07 Thread Liu Bo
On Wed, May 07, 2014 at 09:35:06AM -0300, Kenny MacDermid wrote:
 On Tue, May 6, 2014 at 11:22 PM, Liu Bo bo.li@oracle.com wrote:
 
  What does sysrq+w say when the hang happens?
 
 The whole system isn't hung, I may have explained that wrong. The
 system will hang if I try to shutdown, and the process will hang if I
 try to kill -9 it.
 
 It looks like the browser is in this state currently so I did an 'echo
 w /proc/sysrq-trigger' and have attached the full dmesg with the
 browser issues and the output.

Those stacks show the blocked tasks are waiting for a page's writeback, but
they don't show what blocks the endio process of that page.

I'd recommand you to try the lastest 3.15.0-rc4 or btrfs-next, as many fixes
are merged during this period.

thanks,
-liubo
--
To unsubscribe from this list: send the line unsubscribe linux-btrfs in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: btrfs issues in 3.14

2014-05-06 Thread Liu Bo
On Tue, May 06, 2014 at 08:49:04PM -0300, Kenny MacDermid wrote:
 Hello,
 
 I've been having a number of issues with processes hanging due to
 btrfs using 3.14 kernels. This seems pretty new as it has been working
 fine before. I also rebuilt the filesystem and am still receiving
 hangs.
 
 The filesystem is running on dmcrypt which is running on lvm2 which is
 running on an SSD (SAMSUNG MZMTD256HAGM-000L1).
 
 When the issue occurs the process is unable to be killed and the
 system will not fully shutdown.
 
 $ uname -a
 Linux orange 3.14.2-1-ARCH #1 SMP PREEMPT Sun Apr 27 11:28:44 CEST
 2014 x86_64 GNU/Linux
 
 $ btrfs --version
 Btrfs v3.14.1
 
 $ btrfs fi show
 Btrfs v3.14.1
 
 $ btrfs fi df /home
 Data, single: total=71.01GiB, used=68.72GiB
 System, DUP: total=8.00MiB, used=16.00KiB
 System, single: total=4.00MiB, used=0.00
 Metadata, DUP: total=1.50GiB, used=863.33MiB
 Metadata, single: total=8.00MiB, used=0.00
 
 I opened bugs 75181 and 75191 and I'll include the relevant journalctl
 entries. The kernel was upgraded from 3.14.1-1 to 3.14.2-1 during this
 time, and the filesystem was rebuilt after the orphan issue.
 
 I'm not on this list so please CC me on replies.

What does sysrq+w say when the hang happens?

-liubo
--
To unsubscribe from this list: send the line unsubscribe linux-btrfs in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html