We’ve just setup ocfs2 v1.2 on a 6-way Redhat Linux (Linux rac99003 2.6.9-34.ELsmp #1 SMP Tue Mar 7 15:16:40 CST 2006 x86_64 x86_64 x86_64 GNU/Linux) cluster.  In trying to copy over a 5gb file from a server outside the cluster to a node on the cluster, the copy always hangs the cluster being copied to.  We’ve set the elevator=deadline in the /boot/grub/grub.conf file with no luck.  Any ideas?  Here’s the /var/log/messages output when the hand occurs:

 

Apr 10 15:20:22 rac99003 kernel: (5518,0):__dlm_print_nodes:384 Nodes in my domain ("AD8CB11991A54F7B87050F9336E43B77"):

Apr 10 15:20:22 rac99003 kernel: (5518,0):__dlm_print_nodes:388  node 0

Apr 10 15:20:22 rac99003 kernel: (5518,0):__dlm_print_nodes:388  node 1

Apr 10 15:20:22 rac99003 kernel: (5518,0):__dlm_print_nodes:388  node 2

Apr 10 15:20:22 rac99003 kernel: (5518,0):__dlm_print_nodes:388  node 3

Apr 10 15:20:22 rac99003 kernel: (5518,0):__dlm_print_nodes:388  node 4

Apr 10 15:20:22 rac99003 kernel: (5518,0):__dlm_print_nodes:388  node 5

Apr 10 15:20:44 rac99003 sshd(pam_unix)[7778]: session opened for user oracle by (uid=0)

Apr 10 15:21:06 rac99003 su(pam_unix)[6075]: session opened for user oracle by (uid=0)

Apr 10 15:21:06 rac99003 logger: Running CRSD with TZ =

Apr 10 15:21:08 rac99003 su(pam_unix)[7991]: session opened for user oracle by (uid=0)

Apr 10 15:26:26 rac99003 kernel: lpfc 0000:05:03.0: 1:0748 abort handler timed out waiting for abort to complete. Data: x0 x0 x3 x2c1cd

Apr 10 15:26:26 rac99003 kernel: lpfc 0000:05:01.0: 0:0748 abort handler timed out waiting for abort to complete. Data: x0 x0 x3 x2c1ca

Apr 10 15:27:17 rac99003 sshd(pam_unix)[17175]: authentication failure; logname= uid=0 euid=0 tty=ssh ruser= rhost=172.19.198.57  user=root

Apr 10 15:27:23 rac99003 sshd(pam_unix)[17452]: session opened for user root by root(uid=0)

Apr 10 15:27:28 rac99003 kernel: lpfc 0000:05:03.0: 1:0748 abort handler timed out waiting for abort to complete. Data: x0 x0 x3 x2c1ce

Apr 10 15:27:28 rac99003 kernel: lpfc 0000:05:01.0: 0:0748 abort handler timed out waiting for abort to complete. Data: x0 x0 x3 x2c1cf

 

John H. Thompson

Infrastructure and Database Services

H-E-B

646 S. Main Ave.

San Antonio, TX 78204

Office: 210-938-8528

 

_______________________________________________
Ocfs2-users mailing list
Ocfs2-users@oss.oracle.com
http://oss.oracle.com/mailman/listinfo/ocfs2-users

Reply via email to