Re: [Ocfs2-devel] RFC: OCFS2 heartbeat improvements

2012-08-23 Thread Sunil Mushran
On Wed, Aug 22, 2012 at 9:01 PM, Jie Liu jeff@oracle.com wrote: BTW, Sunil mentioned there already has an IO priority patch set but not yet merged. However, I only searched an old posts back to 2006 at: http://www.digipedia.pl/usenet/thread/11947/7120/ Am I missing something? No, I

Re: [Ocfs2-devel] RFC: OCFS2 heartbeat improvements

2012-08-23 Thread Sunil Mushran
On Wed, Aug 22, 2012 at 8:44 PM, Tao Ma t...@tao.ma wrote: I guess the final solution will be WRITE_FUA, and I see btrfs uses it to write out the superblock. It will be handled differently by the underlying block layer so that it will not be in the elevator queue. It should work but I am not

Re: [Ocfs2-devel] RFC: OCFS2 heartbeat improvements

2012-08-23 Thread Tao Ma
On 08/24/2012 01:33 AM, Sunil Mushran wrote: On Wed, Aug 22, 2012 at 8:44 PM, Tao Ma t...@tao.ma mailto:t...@tao.ma wrote: I guess the final solution will be WRITE_FUA, and I see btrfs uses it to write out the superblock. It will be handled differently by the underlying block

Re: [Ocfs2-devel] RFC: OCFS2 heartbeat improvements

2012-08-23 Thread Jie Liu
On 08/24/12 01:25, Sunil Mushran wrote: On Wed, Aug 22, 2012 at 9:01 PM, Jie Liu jeff@oracle.com mailto:jeff@oracle.com wrote: BTW, Sunil mentioned there already has an IO priority patch set but not yet merged. However, I only searched an old posts back to 2006 at:

Re: [Ocfs2-devel] RFC: OCFS2 heartbeat improvements

2012-08-22 Thread srinivas eeda
On 8/22/2012 7:17 AM, Jie Liu wrote: Hi All, These days, I am investigating an issue regarding OCFS2 unexpected reboot in some real world use cases. This problem occurred when the network status goes south, when the disk IO load is too high, etc... I suspect it might caused by ocfs2 fencing

Re: [Ocfs2-devel] RFC: OCFS2 heartbeat improvements

2012-08-22 Thread Sunil Mushran
Yes. WRITE_SYNC should be good. Not FUA. Also, you may want to look into using io priorities. The code is all there. Just needs activation. On Wed, Aug 22, 2012 at 10:13 AM, srinivas eeda srinivas.e...@oracle.comwrote: On 8/22/2012 7:17 AM, Jie Liu wrote: Hi All, These days, I am

Re: [Ocfs2-devel] RFC: OCFS2 heartbeat improvements

2012-08-22 Thread Jie Liu
On 08/23/12 01:13, srinivas eeda wrote: On 8/22/2012 7:17 AM, Jie Liu wrote: Hi All, These days, I am investigating an issue regarding OCFS2 unexpected reboot in some real world use cases. This problem occurred when the network status goes south, when the disk IO load is too high, etc...

Re: [Ocfs2-devel] RFC: OCFS2 heartbeat improvements

2012-08-22 Thread Jie Liu
On 08/23/12 01:18, Sunil Mushran wrote: Yes. WRITE_SYNC should be good. Not FUA. Also, you may want to look into using io priorities. The code is all there. Just needs activation. Yes, I'll search the list to find them out. Thanks, -Jeff On Wed, Aug 22, 2012 at 10:13 AM, srinivas eeda

Re: [Ocfs2-devel] RFC: OCFS2 heartbeat improvements

2012-08-22 Thread Tao Ma
Hi Jeff, On 08/22/2012 10:17 PM, Jie Liu wrote: Hi All, These days, I am investigating an issue regarding OCFS2 unexpected reboot in some real world use cases. This problem occurred when the network status goes south, when the disk IO load is too high, etc... I suspect it might caused by

Re: [Ocfs2-devel] RFC: OCFS2 heartbeat improvements

2012-08-22 Thread Jie Liu
Hi Tao, On 08/23/12 11:44, Tao Ma wrote: Hi Jeff, On 08/22/2012 10:17 PM, Jie Liu wrote: Hi All, These days, I am investigating an issue regarding OCFS2 unexpected reboot in some real world use cases. This problem occurred when the network status goes south, when the disk IO load is too