Re: [Ocfs2-users] Mysterious server reboot

2011-04-04 Thread Nikola Savic
Hello, If anyone is interested, here is update on problem we had with OCFS and DRL drop ref bug. After installing kernel-2.6.32-100.0.19.el5.x86_64.rpm and OCFS2 1.6 packages from Oracle's Centos5.5 public yum, we didn't experience DLM drop ref bug and kernel didn't oops or panic. Maybe

Re: [Ocfs2-users] Mysterious server reboot

2011-03-26 Thread Nikola Savic
Hi all, Just keep you informed :) After 7 days of normal operations, we again had server failure because of OCFS2/DLM drop reference bug. I have added log is on end of message. We're running Centos5 with latest available RedHat kernel 2.6.18-238.5.1.el5 and OCFS2 1.4.7 installed from

Re: [Ocfs2-users] Mysterious server reboot

2011-03-18 Thread Sunil Mushran
This specific bug (associated with the message) has been fixed here. http://oss.oracle.com/git/?p=ocfs2-1.4.git;a=commit;h=1f667766cb67ed05b4d706aa82e8ad0b12eaae8b This should result in an oops and thus panic. But just on this node. If other nodes are rebooting then I suspect some sysctl values

Re: [Ocfs2-users] Mysterious server reboot

2011-03-18 Thread Nikola Savic
Sunil Mushran wrote: This specific bug (associated with the message) has been fixed here. http://oss.oracle.com/git/?p=ocfs2-1.4.git;a=commit;h=1f667766cb67ed05b4d706aa82e8ad0b12eaae8b This should result in an oops and thus panic. But just on this node. What is solution? Are there RPM

Re: [Ocfs2-users] Mysterious server reboot

2011-03-18 Thread Sunil Mushran
On 03/18/2011 04:56 PM, Nikola Savic wrote: Sunil Mushran wrote: This specific bug (associated with the message) has been fixed here. http://oss.oracle.com/git/?p=ocfs2-1.4.git;a=commit;h=1f667766cb67ed05b4d706aa82e8ad0b12eaae8b This should result in an oops and thus panic. But just on this

Re: [Ocfs2-users] Mysterious server reboot

2011-03-18 Thread Nikola Savic
Hi again, I have installed newest kernel available for RHEL5 (2.6.18-238.5.1.el5) and OCFS2 packages to match. After only few hours of running, while rsync backup was done, on one of nodes I got following error (after which it hang and required reset): Mar 19 03:42:52 server3 kernel: