Hi,

I experienced the same problems. The elevator=deadline parameter didn't help. 
But increasing the threshold to 60 did it. I think you could decrease the 
threshold, but didn't test it. In another posting, it is said to take a timeout 
between 60 and 90 seconds. This would mean a threshold between 31 and 46.

I'll test this later.

Best regards,
Christian


-----Ursprüngliche Nachricht-----
Von: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] Im Auftrag von Weller, Michael
Gesendet: Sonntag, 2. April 2006 14:18
An: Silviu Marin-Caea; [email protected]
Betreff: Re: [Ocfs2-users] heartbeat write timeout

Thx for the hints, I'll try that.

With regards to the updates, while I generally agree, I can't update the kernel 
here, because we'll loose vendor warranty in that case. I know this is an odd 
concept, but that's how it works. We'll even loose Oracle support because the 
kernel update would void HP SAN-support.

I mentioned SAN Failover, which for example does not work with current kernel 
and current (even the not so current HP checked variant) Qlogic driver.

Anyway, I'll try your suggestions on monday and drop the list a note if it 
worked.

Thanks,
Michael.

 ---

Dr. Michael Weller

ITZ Informationstechnologie GmbH
Consulting/Systemengineering
Bismarckstrasse 57
D-45128 Essen

Phone Office    +49 201 24714 28
FAX   Office    +49 201 24714 33
Phone Mobile    +49 172 2178078
E-Mail          mailto:[EMAIL PROTECTED]
> -----Ursprüngliche Nachricht-----
> Von: [EMAIL PROTECTED] [mailto:ocfs2-users-
> [EMAIL PROTECTED] Im Auftrag von Silviu Marin-Caea
> Gesendet: Sonntag, 2. April 2006 08:26
> An: [email protected]
> Betreff: Re: [Ocfs2-users] heartbeat write timeout
> 
> On Saturday 01 April 2006 22:36, Weller, Michael wrote:
> 
> > we are bound to SLES9SP3 (and EXACTLY that, nothing less, not a patch
> > more)
> 
> Having latest updates does not hurt, on the contrary, it helps.  For
> example,
> the latest kernel has OCFS2 1.1.8, while the kernel from SP3 has 1.1.7.
> There are a number of bugfixes.
> 
> SLES updates do really have a purpose.  Apply them after testing in a
> non-production system.
> 
> > It locks up immediately. Definitely nothing like a 12s timeout expires.
> 
> It just looks like it's immediate, actually, the 12s do expire.
> 
> > You mention a FAQ regarding some config option which I didn't come
> > across up to now, where can I find it?
> 
> /boot/grub/menu.lst
> 
> change elevator=cfq to elevator=deadline
> 
> http://oss.oracle.com/projects/ocfs2/
> scroll down, look at the red text
> 
> > Which options would you recommend to fix the problem or at least make
> > locks much less likely.
> 
> You could also increase the timeout:
> 
> /etc/sysconfig/o2cb
> 
> # O2CB_HEARTBEAT_THRESHOLD: Iterations before a node is considered dead.
> O2CB_HEARTBEAT_THRESHOLD=16
> 
> 
> _______________________________________________
> Ocfs2-users mailing list
> [email protected]
> http://oss.oracle.com/mailman/listinfo/ocfs2-users



_______________________________________________
Ocfs2-users mailing list
[email protected]
http://oss.oracle.com/mailman/listinfo/ocfs2-users

Reply via email to