Hello,
We think we've seen this issue a few times now and I wonder if anyone
out there has any insight as to what's going on.
We have a large (40+) cluster accessing an OCFS2 v 1.2.1 filesystem.
All nodes are Fedora Core 4 (kernel 2.6.13-1.1532_FC4smp). One node is
x86_64, the rest are i686.
On the affected node (the x86_64 host), we tried to start the Sun grid
Engine dbwriter service. All the related processes entered a permanent
"D" state. All root logins received a syslog message saying:
kernel: Kernel BUG at "/root/dnld/ocfs2/ocfs2-1.2.1/fs/ocfs2/file.c":787
kernel: invalid operand: 0000 [1] SMP
I've attached a syslog fragment with the full stack trace.
We were able to log into the box and reboot it. The dbwriter then
started normally.
Does anyone have suggestions on how to avoid this problem going forward?
--
Eric Jones ejones at jimmy dot harvard dot edu
System Adminstrator Department of Biostatistics & Computational Biology
617-632-2447 Dana-Farber Cancer Institute
Jul 19 15:44:32 tools kernel: (1331,3):ocfs2_extend_file:787 ERROR: bug
expression: i_size_read(inode) != (le64_to_cpu(fe->i_size) - *bytes_extended)
Jul 19 15:44:32 tools kernel: (1331,3):ocfs2_extend_file:787 ERROR: Inode
139458055 i_size = 8920728, dinode i_size = 8922501, bytes_extended = 0,
new_i_size = 8920817
Jul 19 15:44:36 tools kernel: ----------- [cut here ] --------- [please bite
here ] ---------
Jul 19 15:44:36 tools kernel: Kernel BUG at
"/root/dnld/ocfs2/ocfs2-1.2.1/fs/ocfs2/file.c":787
Jul 19 15:44:36 tools kernel: invalid operand: 0000 [1] SMP
Jul 19 15:44:36 tools kernel: CPU 3
Jul 19 15:44:36 tools kernel: Modules linked in: nls_utf8 loop nfsd exportfs
parport_pc lp parport ipv6 ocfs2(U) ocfs2_dlmfs(U) ocfs2_dlm(U)
ocfs2_nodemanager(U) configfs(U) autofs4 nfs lockd nfs_acl sunrpc usb_storage
dm_mod video button battery ac ohci_hcd i2c_amd8111 i2c_amd756 i2c_core shpchp
e1000(U) bonding qla2300 qla2xxx scsi_transport_fc ext3 jbd mptsas(U)
mptscsih(U) mptbase(U) sd_mod scsi_mod
Jul 19 15:44:36 tools kernel: Pid: 1331, comm: java Not tainted
2.6.13-1.1532_FC4smp
Jul 19 15:44:36 tools kernel: RIP: 0010:[<ffffffff882a4499>]
<ffffffff882a4499>{:ocfs2:ocfs2_extend_file+779}
Jul 19 15:44:36 tools kernel: RSP: 0018:ffff8100888c7cd8 EFLAGS: 00010292
Jul 19 15:44:36 tools kernel: RAX: 0000000000000090 RBX: 0000000000000000 RCX:
ffffffff8041bea8
Jul 19 15:44:36 tools kernel: RDX: ffffffff8041bea8 RSI: 0000000000000296 RDI:
ffffffff8041bea0
Jul 19 15:44:36 tools kernel: RBP: ffff8100d3289000 R08: 0000000000881e98 R09:
0000000000000001
Jul 19 15:44:36 tools kernel: R10: 0000000008828bfa R11: ffffffff8011a954 R12:
ffff810073f49a20
Jul 19 15:44:36 tools kernel: R13: ffff810073f49a20 R14: ffff81015d2f3500 R15:
0000000000881ef1
Jul 19 15:44:36 tools kernel: FS: 00002aaaaaacffc0(0000)
GS:ffffffff80502980(0000) knlGS:00000000f7fd8a40
Jul 19 15:44:36 tools kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
Jul 19 15:44:36 tools kernel: CR2: 00002aaaefdb9000 CR3: 00000000dd6f8000 CR4:
00000000000006e0
Jul 19 15:44:36 tools kernel: Process java (pid: 1331, threadinfo
ffff8100888c6000, task ffff81009c637740)
Jul 19 15:44:36 tools kernel: Stack: 0000000000881e98 0000000000882585
0000000000000000 0000000000881ef1
Jul 19 15:44:36 tools kernel: ffff8100888c7e18 ffff8100f7dfb000
0000000073f49a20 ffff810073f49740
Jul 19 15:44:36 tools kernel: ffff8100888c7e88 0000000000000000
Jul 19 15:44:36 tools kernel: Call
Trace:<ffffffff882b2be4>{:ocfs2:ocfs2_write_lock_maybe_extend+2590}
Jul 19 15:44:36 tools kernel: <ffffffff8016e5b8>{do_no_page+909}
<ffffffff882a30cd>{:ocfs2:ocfs2_file_write+370}
Jul 19 15:44:36 tools kernel: <ffffffff801809af>{vfs_write+202}
<ffffffff80180ee3>{sys_write+69}
Jul 19 15:44:36 tools kernel: <ffffffff8010db02>{tracesys+209}
Jul 19 15:44:36 tools kernel:
Jul 19 15:44:36 tools kernel: Code: 0f 0b a3 48 be 2c 88 ff ff ff ff c2 13 03
4d 39 c7 0f 83 cf
Jul 19 15:44:36 tools kernel: RIP
<ffffffff882a4499>{:ocfs2:ocfs2_extend_file+779} RSP <ffff8100888c7cd8>
Jul 19 15:44:36 tools kernel: <3>Debug: sleeping function called from invalid
context at include/linux/rwsem.h:43
Jul 19 15:44:36 tools kernel: in_atomic():0, irqs_disabled():1
Jul 19 15:44:36 tools kernel:
Jul 19 15:44:36 tools kernel: Call
Trace:<ffffffff80136667>{profile_task_exit+21} <ffffffff80137862>{do_exit+34}
Jul 19 15:44:36 tools kernel: <ffffffff8020ff99>{vgacon_cursor+216}
<ffffffff8010f4a5>{default_do_nmi+0}
Jul 19 15:44:36 tools kernel: <ffffffff8010fee6>{do_invalid_op+163}
<ffffffff882a4499>{:ocfs2:ocfs2_extend_file+779}
Jul 19 15:44:36 tools kernel: <ffffffff8033dd02>{__wait_on_bit+96}
<ffffffff882c4968>{:ocfs2:ocfs2_buffer_cached+190}
Jul 19 15:44:36 tools kernel: <ffffffff8010e4bd>{error_exit+0}
<ffffffff8011a954>{flat_send_IPI_mask+0}
Jul 19 15:44:36 tools kernel:
<ffffffff882a4499>{:ocfs2:ocfs2_extend_file+779}
<ffffffff882a4495>{:ocfs2:ocfs2_extend_file+775}
Jul 19 15:44:36 tools kernel:
<ffffffff882b2be4>{:ocfs2:ocfs2_write_lock_maybe_extend+2590}
Jul 19 15:44:37 tools kernel: <ffffffff8016e5b8>{do_no_page+909}
<ffffffff882a30cd>{:ocfs2:ocfs2_file_write+370}
Jul 19 15:44:37 tools kernel: <ffffffff801809af>{vfs_write+202}
<ffffffff80180ee3>{sys_write+69}
Jul 19 15:44:37 tools kernel: <ffffffff8010db02>{tracesys+209}
Jul 19 15:54:08 tools sshd(pam_unix)[6119]: session opened for user root by
root(uid=0)
Jul 19 15:54:31 tools sshd(pam_unix)[6158]: session opened for user root by
root(uid=0)
Jul 19 15:59:36 tools sshd(pam_unix)[29612]: session closed for user fliu
Jul 19 16:04:45 tools shutdown: shutting down for system halt
Jul 19 16:04:46 tools init: Switching to runlevel: 0
Jul 19 16:04:47 tools nagios: Caught SIGTERM, shutting down...
Jul 19 16:04:47 tools nagios: Successfully shutdown... (PID=3389)
Jul 19 16:04:48 tools xfs[3285]: terminating
Jul 19 16:07:57 tools syslogd 1.4.1: restart.
Jul 19 16:07:57 tools kernel: klogd 1.4.1, log source = /proc/kmsg started.
Jul 19 16:07:57 tools kernel: Bootdata ok (command line is ro root=LABEL=/
console=tty0 console=ttyS0,9600)
[...boot messages here...]
_______________________________________________
Ocfs2-users mailing list
[email protected]
http://oss.oracle.com/mailman/listinfo/ocfs2-users