Hello,

We think we've seen this issue a few times now and I wonder if anyone out there has any insight as to what's going on.

We have a large (40+) cluster accessing an OCFS2 v 1.2.1 filesystem. All nodes are Fedora Core 4 (kernel 2.6.13-1.1532_FC4smp). One node is x86_64, the rest are i686.

On the affected node (the x86_64 host), we tried to start the Sun grid Engine dbwriter service. All the related processes entered a permanent "D" state. All root logins received a syslog message saying:

kernel: Kernel BUG at "/root/dnld/ocfs2/ocfs2-1.2.1/fs/ocfs2/file.c":787
kernel: invalid operand: 0000 [1] SMP

I've attached a syslog fragment with the full stack trace.

We were able to log into the box and reboot it. The dbwriter then started normally.

Does anyone have suggestions on how to avoid this problem going forward?

--
Eric Jones                  ejones at jimmy dot harvard dot edu
System Adminstrator         Department of Biostatistics & Computational Biology
617-632-2447                Dana-Farber Cancer Institute

Jul 19 15:44:32 tools kernel: (1331,3):ocfs2_extend_file:787 ERROR: bug 
expression: i_size_read(inode) != (le64_to_cpu(fe->i_size) - *bytes_extended)
Jul 19 15:44:32 tools kernel: (1331,3):ocfs2_extend_file:787 ERROR: Inode 
139458055 i_size = 8920728, dinode i_size = 8922501, bytes_extended = 0, 
new_i_size = 8920817
Jul 19 15:44:36 tools kernel: ----------- [cut here ] --------- [please bite 
here ] ---------
Jul 19 15:44:36 tools kernel: Kernel BUG at 
"/root/dnld/ocfs2/ocfs2-1.2.1/fs/ocfs2/file.c":787
Jul 19 15:44:36 tools kernel: invalid operand: 0000 [1] SMP 
Jul 19 15:44:36 tools kernel: CPU 3 
Jul 19 15:44:36 tools kernel: Modules linked in: nls_utf8 loop nfsd exportfs 
parport_pc lp parport ipv6 ocfs2(U) ocfs2_dlmfs(U) ocfs2_dlm(U) 
ocfs2_nodemanager(U) configfs(U) autofs4 nfs lockd nfs_acl sunrpc usb_storage 
dm_mod video button battery ac ohci_hcd i2c_amd8111 i2c_amd756 i2c_core shpchp 
e1000(U) bonding qla2300 qla2xxx scsi_transport_fc ext3 jbd mptsas(U) 
mptscsih(U) mptbase(U) sd_mod scsi_mod
Jul 19 15:44:36 tools kernel: Pid: 1331, comm: java Not tainted 
2.6.13-1.1532_FC4smp
Jul 19 15:44:36 tools kernel: RIP: 0010:[<ffffffff882a4499>] 
<ffffffff882a4499>{:ocfs2:ocfs2_extend_file+779}
Jul 19 15:44:36 tools kernel: RSP: 0018:ffff8100888c7cd8  EFLAGS: 00010292
Jul 19 15:44:36 tools kernel: RAX: 0000000000000090 RBX: 0000000000000000 RCX: 
ffffffff8041bea8
Jul 19 15:44:36 tools kernel: RDX: ffffffff8041bea8 RSI: 0000000000000296 RDI: 
ffffffff8041bea0
Jul 19 15:44:36 tools kernel: RBP: ffff8100d3289000 R08: 0000000000881e98 R09: 
0000000000000001
Jul 19 15:44:36 tools kernel: R10: 0000000008828bfa R11: ffffffff8011a954 R12: 
ffff810073f49a20
Jul 19 15:44:36 tools kernel: R13: ffff810073f49a20 R14: ffff81015d2f3500 R15: 
0000000000881ef1
Jul 19 15:44:36 tools kernel: FS:  00002aaaaaacffc0(0000) 
GS:ffffffff80502980(0000) knlGS:00000000f7fd8a40
Jul 19 15:44:36 tools kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
Jul 19 15:44:36 tools kernel: CR2: 00002aaaefdb9000 CR3: 00000000dd6f8000 CR4: 
00000000000006e0
Jul 19 15:44:36 tools kernel: Process java (pid: 1331, threadinfo 
ffff8100888c6000, task ffff81009c637740)
Jul 19 15:44:36 tools kernel: Stack: 0000000000881e98 0000000000882585 
0000000000000000 0000000000881ef1 
Jul 19 15:44:36 tools kernel:        ffff8100888c7e18 ffff8100f7dfb000 
0000000073f49a20 ffff810073f49740 
Jul 19 15:44:36 tools kernel:        ffff8100888c7e88 0000000000000000 
Jul 19 15:44:36 tools kernel: Call 
Trace:<ffffffff882b2be4>{:ocfs2:ocfs2_write_lock_maybe_extend+2590}
Jul 19 15:44:36 tools kernel:        <ffffffff8016e5b8>{do_no_page+909} 
<ffffffff882a30cd>{:ocfs2:ocfs2_file_write+370}
Jul 19 15:44:36 tools kernel:        <ffffffff801809af>{vfs_write+202} 
<ffffffff80180ee3>{sys_write+69}
Jul 19 15:44:36 tools kernel:        <ffffffff8010db02>{tracesys+209} 
Jul 19 15:44:36 tools kernel: 
Jul 19 15:44:36 tools kernel: Code: 0f 0b a3 48 be 2c 88 ff ff ff ff c2 13 03 
4d 39 c7 0f 83 cf 
Jul 19 15:44:36 tools kernel: RIP 
<ffffffff882a4499>{:ocfs2:ocfs2_extend_file+779} RSP <ffff8100888c7cd8>
Jul 19 15:44:36 tools kernel:  <3>Debug: sleeping function called from invalid 
context at include/linux/rwsem.h:43
Jul 19 15:44:36 tools kernel: in_atomic():0, irqs_disabled():1
Jul 19 15:44:36 tools kernel: 
Jul 19 15:44:36 tools kernel: Call 
Trace:<ffffffff80136667>{profile_task_exit+21} <ffffffff80137862>{do_exit+34}
Jul 19 15:44:36 tools kernel:        <ffffffff8020ff99>{vgacon_cursor+216} 
<ffffffff8010f4a5>{default_do_nmi+0}
Jul 19 15:44:36 tools kernel:        <ffffffff8010fee6>{do_invalid_op+163} 
<ffffffff882a4499>{:ocfs2:ocfs2_extend_file+779}
Jul 19 15:44:36 tools kernel:        <ffffffff8033dd02>{__wait_on_bit+96} 
<ffffffff882c4968>{:ocfs2:ocfs2_buffer_cached+190}
Jul 19 15:44:36 tools kernel:        <ffffffff8010e4bd>{error_exit+0} 
<ffffffff8011a954>{flat_send_IPI_mask+0}
Jul 19 15:44:36 tools kernel:        
<ffffffff882a4499>{:ocfs2:ocfs2_extend_file+779} 
<ffffffff882a4495>{:ocfs2:ocfs2_extend_file+775}
Jul 19 15:44:36 tools kernel:        
<ffffffff882b2be4>{:ocfs2:ocfs2_write_lock_maybe_extend+2590}
Jul 19 15:44:37 tools kernel:        <ffffffff8016e5b8>{do_no_page+909} 
<ffffffff882a30cd>{:ocfs2:ocfs2_file_write+370}
Jul 19 15:44:37 tools kernel:        <ffffffff801809af>{vfs_write+202} 
<ffffffff80180ee3>{sys_write+69}
Jul 19 15:44:37 tools kernel:        <ffffffff8010db02>{tracesys+209} 
Jul 19 15:54:08 tools sshd(pam_unix)[6119]: session opened for user root by 
root(uid=0)
Jul 19 15:54:31 tools sshd(pam_unix)[6158]: session opened for user root by 
root(uid=0)
Jul 19 15:59:36 tools sshd(pam_unix)[29612]: session closed for user fliu
Jul 19 16:04:45 tools shutdown: shutting down for system halt
Jul 19 16:04:46 tools init: Switching to runlevel: 0
Jul 19 16:04:47 tools nagios: Caught SIGTERM, shutting down... 
Jul 19 16:04:47 tools nagios: Successfully shutdown... (PID=3389)
Jul 19 16:04:48 tools xfs[3285]: terminating 
Jul 19 16:07:57 tools syslogd 1.4.1: restart.
Jul 19 16:07:57 tools kernel: klogd 1.4.1, log source = /proc/kmsg started.
Jul 19 16:07:57 tools kernel: Bootdata ok (command line is ro root=LABEL=/ 
console=tty0 console=ttyS0,9600)

[...boot messages here...]
_______________________________________________
Ocfs2-users mailing list
[email protected]
http://oss.oracle.com/mailman/listinfo/ocfs2-users

Reply via email to