Re: Reproducible reiser4 bug with 2.6.16.2 patch on tail_conversion.c:80
Hello please check whether the attached patch helps. On Wednesday 10 May 2006 14:56, Timo Kokkonen wrote: Hello, I was able to produce this bug on two different computers running the same kernel version and same patches and same task. Each time I run dispatch-conf (I'm running on gentoo) to update the config files on /etc, it gives the following output: RCS file: /etc/config-archive/usr/lib/X11/xkb/xkbcomp,v 1.1 locked ci: RCS file /etc/config-archive/usr/lib/X11/xkb/xkbcomp,v is in use /etc/config-archive/usr/lib/X11/xkb/xkbcomp,v -- /etc/config-archive/usr/lib/X11/xkb/xkbcomp revision 1.1.1.1 writable /etc/config-archive/usr/lib/X11/xkb/xkbcomp exists; remove it? [ny](n): And right after that I can see this message in kernel log: May 10 11:26:32 kaatokone2 [4296234.613000] [ cut here ] May 10 11:26:32 kaatokone2 [4296234.613000] kernel BUG at fs/reiser4/plugin/file/tail_conversion.c:80! May 10 11:26:32 kaatokone2 [4296234.613000] invalid opcode: [#1] May 10 11:26:32 kaatokone2 [4296234.613000] PREEMPT May 10 11:26:32 kaatokone2 [4296234.613000] Modules linked in: w83627hf w83781d hwmon_vid eeprom i2c_isa May 10 11:26:32 kaatokone2 [4296234.613000] CPU:0 May 10 11:26:32 kaatokone2 [4296234.613000] EIP:0060:[b019ec1f] Not tainted VLI May 10 11:26:32 kaatokone2 [4296234.613000] EFLAGS: 00210286 (2.6.16-gentoo-r7 #1) May 10 11:26:32 kaatokone2 [4296234.613000] EIP is at get_nonexclusive_access+0x23/0x36 May 10 11:26:32 kaatokone2 [4296234.613000] eax: ef7644c4 ebx: ecx: a7e8b000 edx: e5d0c414 May 10 11:26:32 kaatokone2 [4296234.613000] esi: e5d0c414 edi: e5d0c46c ebp: e5d59e9c esp: e5d59dc4 May 10 11:26:32 kaatokone2 [4296234.613000] ds: 007b es: 007b ss: 0068 May 10 11:26:32 kaatokone2 [4296234.613000] Process rcs (pid: 8422, threadinfo=e5d58000 task=e5cf2ab0) May 10 11:26:32 kaatokone2 [4296234.613000] Stack: 0b01b18e9 e5d0c414 3400 e5d58000 b16ba260 May 10 11:26:32 kaatokone2 [4296234.613000]e5eb8780 b01b1ccd e5d0c46c e5d59e9c eed8e8c0 eed8e8d0 eed8e8d0 0002 May 10 11:26:32 kaatokone2 [4296234.613000]0001 e5d59e9c 1000 0001 efeea800 efded414 00391f78 May 10 11:26:32 kaatokone2 [4296234.613000] Call Trace: May 10 11:26:32 kaatokone2 [4296234.613000] [b01b18e9] extent_balance_dirty_pages+0x13a/0x145 May 10 11:26:32 kaatokone2 [4296234.613000] [b01b1ccd] extent_write_flow+0x3d9/0x4e7 May 10 11:26:32 kaatokone2 [4296234.613000] [b01b1f0a] write_extent+0x0/0x36 May 10 11:26:32 kaatokone2 [4296234.613000] [b019dc17] append_and_or_overwrite+0x24f/0x2fd May 10 11:26:32 kaatokone2 [4296234.613000] [b019dd0c] write_flow+0x47/0x50 May 10 11:26:32 kaatokone2 [4296234.613000] [b019de92] write_file+0x5c/0x7a May 10 11:26:32 kaatokone2 [4296234.613000] [b019e177] write_unix_file+0x2c7/0x444 May 10 11:26:32 kaatokone2 [4296234.613000] [b014bd9e] vfs_write+0x87/0x11b May 10 11:26:32 kaatokone2 [4296234.613000] [b014bed0] sys_write+0x3b/0x63 May 10 11:26:32 kaatokone2 [4296234.613000] [b0102a93] sysenter_past_esp+0x54/0x75 May 10 11:26:32 kaatokone2 [4296234.613000] Code: 0f 85 b5 0d 00 00 c3 c3 83 7c 24 08 00 8b 54 24 04 75 20 b8 00 e0 ff ff 21 e0 8b 00 8b 80 b4 04 00 00 8b 40 40 83 78 08 00 74 08 0f 0b 50 00 ba cf 43 b0 89 d0 ff 00 0f 88 92 0d 00 00 c3 8b 54 May 10 11:26:32 kaatokone2 [4296234.613000] 44reiser4[rcs(8422)]: release_unix_file (fs/reiser4/plugin/file/file.c:2670)[vs-44]: May 10 11:26:32 kaatokone2 [4296234.614000] WARNING: out of memory? May 10 11:26:32 kaatokone2 [4296234.614000] 4reiser4[rcs(8422)]: release_unix_file (fs/reiser4/plugin/file/file.c:2670)[vs-44]: May 10 11:26:32 kaatokone2 [4296234.614000] WARNING: out of memory? May 10 11:26:32 kaatokone2 [4296234.614000] 4reiser4[rcs(8422)]: release_unix_file (fs/reiser4/plugin/file/file.c:2670)[vs-44]: May 10 11:26:32 kaatokone2 [4296234.614000] WARNING: out of memory? May 10 11:26:32 kaatokone2 [4296234.614000] 4reiser4[rcs(8422)]: release_unix_file (fs/reiser4/plugin/file/file.c:2670)[vs-44]: May 10 11:26:32 kaatokone2 [4296234.614000] WARNING: out of memory? May 10 11:26:32 kaatokone2 [4296234.614000] 4reiser4[rcs(8422)]: release_unix_file (fs/reiser4/plugin/file/file.c:2670)[vs-44]: May 10 11:26:32 kaatokone2 [4296234.614000] WARNING: out of memory? After a reboot and running dispatch-conf again I get the exact same error message again. I got this same bug on the another computer aswell, during exact same conditions (running dispatch-conf, it even complained about the same filename). Running reiser4.fsck didn't reveal any problems at all on that computer. Booting with 2.6.15.1 (with gentoo-sources-2.6.15-r1 patches) and reiser4-2.6.15.1 didn't crash and dispatch-conf finished updating the files normally. So, now I have this computer that I can make to crash every time by running dispatch-conf. I can keep
Reproducible reiser4 bug with 2.6.16.2 patch on tail_conversion.c:80
Hello, I was able to produce this bug on two different computers running the same kernel version and same patches and same task. Each time I run dispatch-conf (I'm running on gentoo) to update the config files on /etc, it gives the following output: RCS file: /etc/config-archive/usr/lib/X11/xkb/xkbcomp,v 1.1 locked ci: RCS file /etc/config-archive/usr/lib/X11/xkb/xkbcomp,v is in use /etc/config-archive/usr/lib/X11/xkb/xkbcomp,v -- /etc/config-archive/usr/lib/X11/xkb/xkbcomp revision 1.1.1.1 writable /etc/config-archive/usr/lib/X11/xkb/xkbcomp exists; remove it? [ny](n): And right after that I can see this message in kernel log: May 10 11:26:32 kaatokone2 [4296234.613000] [ cut here ] May 10 11:26:32 kaatokone2 [4296234.613000] kernel BUG at fs/reiser4/plugin/file/tail_conversion.c:80! May 10 11:26:32 kaatokone2 [4296234.613000] invalid opcode: [#1] May 10 11:26:32 kaatokone2 [4296234.613000] PREEMPT May 10 11:26:32 kaatokone2 [4296234.613000] Modules linked in: w83627hf w83781d hwmon_vid eeprom i2c_isa May 10 11:26:32 kaatokone2 [4296234.613000] CPU:0 May 10 11:26:32 kaatokone2 [4296234.613000] EIP:0060:[b019ec1f] Not tainted VLI May 10 11:26:32 kaatokone2 [4296234.613000] EFLAGS: 00210286 (2.6.16-gentoo-r7 #1) May 10 11:26:32 kaatokone2 [4296234.613000] EIP is at get_nonexclusive_access+0x23/0x36 May 10 11:26:32 kaatokone2 [4296234.613000] eax: ef7644c4 ebx: ecx: a7e8b000 edx: e5d0c414 May 10 11:26:32 kaatokone2 [4296234.613000] esi: e5d0c414 edi: e5d0c46c ebp: e5d59e9c esp: e5d59dc4 May 10 11:26:32 kaatokone2 [4296234.613000] ds: 007b es: 007b ss: 0068 May 10 11:26:32 kaatokone2 [4296234.613000] Process rcs (pid: 8422, threadinfo=e5d58000 task=e5cf2ab0) May 10 11:26:32 kaatokone2 [4296234.613000] Stack: 0b01b18e9 e5d0c414 3400 e5d58000 b16ba260 May 10 11:26:32 kaatokone2 [4296234.613000]e5eb8780 b01b1ccd e5d0c46c e5d59e9c eed8e8c0 eed8e8d0 eed8e8d0 0002 May 10 11:26:32 kaatokone2 [4296234.613000]0001 e5d59e9c 1000 0001 efeea800 efded414 00391f78 May 10 11:26:32 kaatokone2 [4296234.613000] Call Trace: May 10 11:26:32 kaatokone2 [4296234.613000] [b01b18e9] extent_balance_dirty_pages+0x13a/0x145 May 10 11:26:32 kaatokone2 [4296234.613000] [b01b1ccd] extent_write_flow+0x3d9/0x4e7 May 10 11:26:32 kaatokone2 [4296234.613000] [b01b1f0a] write_extent+0x0/0x36 May 10 11:26:32 kaatokone2 [4296234.613000] [b019dc17] append_and_or_overwrite+0x24f/0x2fd May 10 11:26:32 kaatokone2 [4296234.613000] [b019dd0c] write_flow+0x47/0x50 May 10 11:26:32 kaatokone2 [4296234.613000] [b019de92] write_file+0x5c/0x7a May 10 11:26:32 kaatokone2 [4296234.613000] [b019e177] write_unix_file+0x2c7/0x444 May 10 11:26:32 kaatokone2 [4296234.613000] [b014bd9e] vfs_write+0x87/0x11b May 10 11:26:32 kaatokone2 [4296234.613000] [b014bed0] sys_write+0x3b/0x63 May 10 11:26:32 kaatokone2 [4296234.613000] [b0102a93] sysenter_past_esp+0x54/0x75 May 10 11:26:32 kaatokone2 [4296234.613000] Code: 0f 85 b5 0d 00 00 c3 c3 83 7c 24 08 00 8b 54 24 04 75 20 b8 00 e0 ff ff 21 e0 8b 00 8b 80 b4 04 00 00 8b 40 40 83 78 08 00 74 08 0f 0b 50 00 ba cf 43 b0 89 d0 ff 00 0f 88 92 0d 00 00 c3 8b 54 May 10 11:26:32 kaatokone2 [4296234.613000] 44reiser4[rcs(8422)]: release_unix_file (fs/reiser4/plugin/file/file.c:2670)[vs-44]: May 10 11:26:32 kaatokone2 [4296234.614000] WARNING: out of memory? May 10 11:26:32 kaatokone2 [4296234.614000] 4reiser4[rcs(8422)]: release_unix_file (fs/reiser4/plugin/file/file.c:2670)[vs-44]: May 10 11:26:32 kaatokone2 [4296234.614000] WARNING: out of memory? May 10 11:26:32 kaatokone2 [4296234.614000] 4reiser4[rcs(8422)]: release_unix_file (fs/reiser4/plugin/file/file.c:2670)[vs-44]: May 10 11:26:32 kaatokone2 [4296234.614000] WARNING: out of memory? May 10 11:26:32 kaatokone2 [4296234.614000] 4reiser4[rcs(8422)]: release_unix_file (fs/reiser4/plugin/file/file.c:2670)[vs-44]: May 10 11:26:32 kaatokone2 [4296234.614000] WARNING: out of memory? May 10 11:26:32 kaatokone2 [4296234.614000] 4reiser4[rcs(8422)]: release_unix_file (fs/reiser4/plugin/file/file.c:2670)[vs-44]: May 10 11:26:32 kaatokone2 [4296234.614000] WARNING: out of memory? After a reboot and running dispatch-conf again I get the exact same error message again. I got this same bug on the another computer aswell, during exact same conditions (running dispatch-conf, it even complained about the same filename). Running reiser4.fsck didn't reveal any problems at all on that computer. Booting with 2.6.15.1 (with gentoo-sources-2.6.15-r1 patches) and reiser4-2.6.15.1 didn't crash and dispatch-conf finished updating the files normally. So, now I have this computer that I can make to crash every time by running dispatch-conf. I can keep the computer intact and run further diagnostics on it, if necessary. If there is any other info you need to know, let me know what you want and I try
Re: reiser4 bug [was Re: 2.6.17-rc3-mm1]
Nope, did not work... regards Alex Am Dienstag, 9. Mai 2006 01:21 schrieb Joe Feise: Try the patch from here: http://marc.theaimsgroup.com/?l=reiserfsm=114709188305181w=2 That helped me get past the bootup phase (currently 8 hours uptime). -Joe Alexander Gran writes: Hi all, 2.6.17-rc3-mm1 doesn't get up running here, it bugs around while init runs: I cannot login afterwards, and syslog did not get the bug too. So here are some poor screenshots from my Treo650 (digicam is broken, sorry..;) EIP is in clear_inode. Trace: reiser4_delete_inode+0x6c/0xd0 d_delete+0xf0/0x10f reiser4_delete_inode+0x0/0xd0 generic_delete_inode+0x6b/0xfb input+0x5c/0x68 do_unlikat+0xd7/0x12c sysenter_past_esp+0x54/0x75 __hidp_send_ctrl_message+0xb4/0xfa details: http://zodiac.dnsalias.org/images/1.jpg http://zodiac.dnsalias.org/images/2.jpg http://zodiac.dnsalias.org/images/3.jpg http://zodiac.dnsalias.org/images/4.jpg Kernel config: http://zodiac.dnsalias.org/images/config System is my T40p, as usual. running an up2date debian unstable. regards Alex -- Encrypted Mails welcome. PGP-Key at http://zodiac.dnsalias.org/misc/pgpkey.asc | Key-ID: 0x6D7DD291 pgp043lcmFacP.pgp Description: PGP signature
Re: reiser4 bug
Hi, it is known problem. Fixed in 2.6.17-rc1-mm2 (reiser4-have-get_exclusive_access-restart-transaction.patch). as i've been burned by this bug, too i would suggest making a new patch for 2.6.16 including reiser4-have-get_exclusive_access-restart-transaction.patch or at least put a warning there that the version is unstable. i'm sure most people go straight for the vanilla kernel patch and don't bother with mm-kernels. this puts reiser4 in a bad light imo. kind regards, Michael -- /\ | Michael Weissenbacher [EMAIL PROTECTED] | | http://www.dermichi.com/ | | Email users are divided into two classes;| | 1) Those who have effective spam-blocking| | 2) Those who wish they did | \/
Re: reiser4 bug
On 4/10/06, Alexander Zarochentsev [EMAIL PROTECTED] wrote: On Tuesday 11 April 2006 09:28, Matt Eaton wrote: I was running OpenOffice and tried saving a file. I'm using 2.6.16.1 + reiser4-for-2.6.16-1.patch.gz (Please help! I'm having to run openoffice on a different filesystem as this crash has occurred twice saving the same file - after a reboot) kernel BUG at fs/reiser4/plugin/file/tail_conversion.c:29! invalid opcode: [#1] it is known problem. Fixed in 2.6.17-rc1-mm2 (reiser4-have-get_exclusive_access-restart-transaction.patch). I am afraid the patch from mm would not apply cleanly to 2.6.16.1 + reiser4-for-2.6.16-1.patch, you may try to apply only the following hunk: @@ -20,13 +20,12 @@ void get_exclusive_access(unix_file_info assert(nikita-3047, LOCK_CNT_NIL(inode_sem_w)); assert(nikita-3048, LOCK_CNT_NIL(inode_sem_r)); /* -* deadlock detection: sometimes we commit a transaction under +* deadlock avoidance: sometimes we commit a transaction under * rw-semaphore on a file. Such commit can deadlock with another * thread that captured some block (hence preventing atom from being * committed) and waits on rw-semaphore. */ - assert(nikita-3361, get_current_context()-trans-atom == NULL); - BUG_ON(get_current_context()-trans-atom != NULL); + txn_restart_current(); LOCK_CNT_INC(inode_sem_w); down_write(uf_info-latch); uf_info-exclusive_use = 1; _ PREEMPT SMP Modules linked in: eeprom lm85 hwmon_vid i2c_i801 lp vmnet parport_pc parport vmmon nfs lockd sunrpc snd_seq_midi snd_emu10k1_synth snd_emux_synth snd_seq_virmidi snd_seq_midi_emul snd_pcm_oss snd_mixer_oss snd_seq_oss snd_seq_midi_event snd_seq snd_hda_intel snd_hda_codec snd_emu10k1 snd_rawmidi snd_ac97_codec snd_ac97_bus snd_pcm snd_seq_device snd_timer snd_page_alloc snd_util_mem snd_hwdep snd soundcore binfmt_misc ntfs usblp ide_cd cdrom nvidia e100 mii uhci_hcd ehci_hcdCPU:0 EIP:0060:[c01c623c]Tainted: P VLI EFLAGS: 00210282 (2.6.16.1 #1) EIP is at get_exclusive_access+0x1c/0x3d eax: df21b5dc ebx: 0001 ecx: df21b654 edx: c50e17d4 esi: acb2e000 edi: c50e1780 ebp: esp: e4481f30 ds: 007b es: 007b ss: 0068 Process soffice.bin (pid: 21440, threadinfo=e448 task=dc5bfa90) Stack: 0c01c56f4 df21b5dc c013ff95 c253a7ac dda673c0 0001 37c2 df21b5dc df21b654 5058c680 e32c4680 acb2e000 e4481fa4 37c2 c014c900 e32c4680 acb2e000 37c2 e4481fa4 e32c4680 fff7 bf823d43 e448 Call Trace: [c01c56f4] write_unix_file+0x2a6/0x45c [c013ff95] vma_link+0xbe/0xc5 [c014c900] vfs_write+0x87/0x11b [c014ca32] sys_write+0x3b/0x63 [c010274b] sysenter_past_esp+0x54/0x75 Code: 8d 43 14 e8 0b 38 13 00 e9 d1 fc ff ff 90 90 ba 00 e0 ff ff 21 e2 8b 12 8b 92 c4 04 00 00 8b 44 24 04 8b 52 50 83 7a 10 00 74 08 0f 0b 1d 00 60 b8 31 c0 ba 01 00 ff ff f0 0f c1 10 85 d2 0f 85 !DSPAM:443b3e85155598791294130! -- Alex. Is this the same bug? And if so can you please provide a patch against the current 'stable' patch at the ftp? [ cut here ] kernel BUG at /usr/src/linux-stable/fs/reiser4/plugin/file/tail_conversion.c:29! invalid opcode: [#1] PREEMPT Modules linked in: ipt_MASQUERADE ipt_LOG ipt_TOS ipt_TCPMSS xt_tcpudp ipt_tos iptable_mangle ip_nat_ftp xt_tcpmss iptable_nat ip_nat iptable_filter ipt_multiport xt_state xt_limit xt_conntrack ip_conntrack_ftp ip_conntrack ip_tables x_tables snd_seq_midi snd_emu10k1_synth snd_emux_synth snd_seq_virmidi snd_seq_midi_event snd_seq_midi_emul snd_seq snd_emu10k1 snd_rawmidi snd_ac97_codec snd_ac97_bus snd_pcm snd_seq_device snd_timer snd_page_alloc snd_util_mem snd_hwdep snd CPU:0 EIP:0060:[c01b591c]Not tainted VLI EFLAGS: 00010286 (2.6.16.2-reiser4 #5) EIP is at get_exclusive_access+0x1a/0x3c eax: c2c9c344 ebx: ecx: d43c9354 edx: b53b7000 esi: c01b46a2 edi: 2000 ebp: 2000 esp: d2f15f34 ds: 007b es: 007b ss: 0068 Process apache2 (pid: 24045, threadinfo=d2f15000 task=c65b30b0) Stack: 0c01b4964 b53b7000 ce37b280 d9108340 c2c9c300 d43c93ac d43c9354 2000 0001 0005 d43c9300 d43c9360 2000 473b7000 ce37b280 c01b46a2 b53b7000 2000 c014712c d2f15fa4 ce37b280 fff7 Call Trace: [c01b4964] write_unix_file+0x2c2/0x4cd [c01b46a2] write_unix_file+0x0/0x4cd [c014712c] vfs_write+0xa1/0x140 [c0147659] sys_write+0x3c/0x63 [c0102a93] sysenter_past_esp+0x54/0x75 Code: ff c7 04 9e 00 00 00 00 43 39 fb 75 e8 5b 5e 5f c3 89 c1 b8 00 f0 ff ff 21 e0 8b 00 8b 80 b4 04 00 00 8b 40 40 83 78 08 00 74 08 0f 0b 1d 00 00 f3 32 c0 ba 01 00 ff ff 89 c8 0f c1 10 85 d2 0f 44reiser4[apache2(24045)]: release_unix_file (/usr/src/linux-stable/fs/reiser4/plugin/file/file.c:2674)[vs-44]: WARNING: out of memory?
RE: reiser4 bug
The patch posted works for me. I would concur with Michael on this one. For some reason openoffice triggered it with my problem. This is a bad area to have data loss and I imagine I wouldn't be the first to lose a document from this bug. It doesn't leave an inconsistent filesystem, but it does lock the current filesystem down (and prevents your document from being saved). Would definitely recommend you guys release a reiser4-for-2.6.16-2 patch and zap -1. Thanks for the quick reply though with the fix -- saved me additional troubles. Regards, Matthew Original Message Subject: Re: reiser4 bug From: Michael Weissenbacher [EMAIL PROTECTED] Date: Tue, April 11, 2006 4:27 am To: reiserfs-list@namesys.com Cc: Alexander Zarochentsev [EMAIL PROTECTED] Hi, it is known problem. Fixed in 2.6.17-rc1-mm2 (reiser4-have-get_exclusive_access-restart-transaction.patch). as i've been burned by this bug, too i would suggest making a new patch for 2.6.16 including reiser4-have-get_exclusive_access-restart-transaction.patch or at least put a warning there that the version is unstable. i'm sure most people go straight for the vanilla kernel patch and don't bother with mm-kernels. this puts reiser4 in a bad light imo. kind regards, Michael -- /\ | Michael Weissenbacher [EMAIL PROTECTED] | | http://www.dermichi.com/ | | Email users are divided into two classes;| | 1) Those who have effective spam-blocking| | 2) Those who wish they did | \/
Re: reiser4 bug
On Tuesday 11 April 2006 12:54, Avuton Olrich wrote: On 4/10/06, Alexander Zarochentsev [EMAIL PROTECTED] wrote: On Tuesday 11 April 2006 09:28, Matt Eaton wrote: I was running OpenOffice and tried saving a file. I'm using 2.6.16.1 + reiser4-for-2.6.16-1.patch.gz (Please help! I'm having to run openoffice on a different filesystem as this crash has occurred twice saving the same file - after a reboot) kernel BUG at fs/reiser4/plugin/file/tail_conversion.c:29! invalid opcode: [#1] it is known problem. Fixed in 2.6.17-rc1-mm2 (reiser4-have-get_exclusive_access-restart-transaction.patch). I am afraid the patch from mm would not apply cleanly to 2.6.16.1 + reiser4-for-2.6.16-1.patch, you may try to apply only the following hunk: @@ -20,13 +20,12 @@ void get_exclusive_access(unix_file_info assert(nikita-3047, LOCK_CNT_NIL(inode_sem_w)); assert(nikita-3048, LOCK_CNT_NIL(inode_sem_r)); /* -* deadlock detection: sometimes we commit a transaction under +* deadlock avoidance: sometimes we commit a transaction under * rw-semaphore on a file. Such commit can deadlock with another * thread that captured some block (hence preventing atom from being * committed) and waits on rw-semaphore. */ - assert(nikita-3361, get_current_context()-trans-atom == NULL); - BUG_ON(get_current_context()-trans-atom != NULL); + txn_restart_current(); LOCK_CNT_INC(inode_sem_w); down_write(uf_info-latch); uf_info-exclusive_use = 1; _ PREEMPT SMP Modules linked in: eeprom lm85 hwmon_vid i2c_i801 lp vmnet parport_pc parport vmmon nfs lockd sunrpc snd_seq_midi snd_emu10k1_synth snd_emux_synth snd_seq_virmidi snd_seq_midi_emul snd_pcm_oss snd_mixer_oss snd_seq_oss snd_seq_midi_event snd_seq snd_hda_intel snd_hda_codec snd_emu10k1 snd_rawmidi snd_ac97_codec snd_ac97_bus snd_pcm snd_seq_device snd_timer snd_page_alloc snd_util_mem snd_hwdep snd soundcore binfmt_misc ntfs usblp ide_cd cdrom nvidia e100 mii uhci_hcd ehci_hcdCPU:0 EIP:0060:[c01c623c]Tainted: P VLI EFLAGS: 00210282 (2.6.16.1 #1) EIP is at get_exclusive_access+0x1c/0x3d eax: df21b5dc ebx: 0001 ecx: df21b654 edx: c50e17d4 esi: acb2e000 edi: c50e1780 ebp: esp: e4481f30 ds: 007b es: 007b ss: 0068 Process soffice.bin (pid: 21440, threadinfo=e448 task=dc5bfa90) Stack: 0c01c56f4 df21b5dc c013ff95 c253a7ac dda673c0 0001 37c2 df21b5dc df21b654 5058c680 e32c4680 acb2e000 e4481fa4 37c2 c014c900 e32c4680 acb2e000 37c2 e4481fa4 e32c4680 fff7 bf823d43 e448 Call Trace: [c01c56f4] write_unix_file+0x2a6/0x45c [c013ff95] vma_link+0xbe/0xc5 [c014c900] vfs_write+0x87/0x11b [c014ca32] sys_write+0x3b/0x63 [c010274b] sysenter_past_esp+0x54/0x75 Code: 8d 43 14 e8 0b 38 13 00 e9 d1 fc ff ff 90 90 ba 00 e0 ff ff 21 e2 8b 12 8b 92 c4 04 00 00 8b 44 24 04 8b 52 50 83 7a 10 00 74 08 0f 0b 1d 00 60 b8 31 c0 ba 01 00 ff ff f0 0f c1 10 85 d2 0f 85 -- Alex. Is this the same bug? And if so can you please provide a patch yes. against the current 'stable' patch at the ftp? ftp://ftp.namesys.com/pub/reiser4-for-2.6/2.6.16/reiser4-for-2.6.16-2.patch.gz [ cut here ] kernel BUG at /usr/src/linux-stable/fs/reiser4/plugin/file/tail_conversion.c:29! invalid opcode: [#1] PREEMPT Modules linked in: ipt_MASQUERADE ipt_LOG ipt_TOS ipt_TCPMSS xt_tcpudp ipt_tos iptable_mangle ip_nat_ftp xt_tcpmss iptable_nat ip_nat iptable_filter ipt_multiport xt_state xt_limit xt_conntrack ip_conntrack_ftp ip_conntrack ip_tables x_tables snd_seq_midi snd_emu10k1_synth snd_emux_synth snd_seq_virmidi snd_seq_midi_event snd_seq_midi_emul snd_seq snd_emu10k1 snd_rawmidi snd_ac97_codec snd_ac97_bus snd_pcm snd_seq_device snd_timer snd_page_alloc snd_util_mem snd_hwdep snd CPU:0 EIP:0060:[c01b591c]Not tainted VLI EFLAGS: 00010286 (2.6.16.2-reiser4 #5) EIP is at get_exclusive_access+0x1a/0x3c eax: c2c9c344 ebx: ecx: d43c9354 edx: b53b7000 esi: c01b46a2 edi: 2000 ebp: 2000 esp: d2f15f34 ds: 007b es: 007b ss: 0068 Process apache2 (pid: 24045, threadinfo=d2f15000 task=c65b30b0) Stack: 0c01b4964 b53b7000 ce37b280 d9108340 c2c9c300 d43c93ac d43c9354 2000 0001 0005 d43c9300 d43c9360 2000 473b7000 ce37b280 c01b46a2 b53b7000 2000 c014712c d2f15fa4 ce37b280 fff7 Call Trace: [c01b4964] write_unix_file+0x2c2/0x4cd [c01b46a2] write_unix_file+0x0/0x4cd [c014712c] vfs_write+0xa1/0x140 [c0147659] sys_write+0x3c/0x63 [c0102a93] sysenter_past_esp+0x54/0x75 Code: ff c7 04 9e 00 00 00 00 43 39 fb 75 e8 5b 5e 5f c3 89 c1 b8 00 f0 ff ff 21 e0 8b 00 8b 80 b4 04 00 00 8b 40 40 83 78 08 00 74 08 0f 0b 1d 00 00 f3
Re: reiser4 bug
On Mon, 2006-04-10 at 22:28 -0700, Matt Eaton wrote: I was running OpenOffice and tried saving a file. I'm using 2.6.16.1 + reiser4-for-2.6.16-1.patch.gz (Please help! I'm having to run openoffice on a different filesystem as this crash has occurred twice saving the same file - after a reboot) Did you fsck when you rebooted? kernel BUG at fs/reiser4/plugin/file/tail_conversion.c:29! invalid opcode: [#1] PREEMPT SMP Modules linked in: eeprom lm85 hwmon_vid i2c_i801 lp vmnet parport_pc parport vmmon nfs lockd sunrpc snd_seq_midi snd_emu10k1_synth snd_emux_synth snd_seq_virmidi snd_seq_midi_emul snd_pcm_oss snd_mixer_oss snd_seq_oss snd_seq_midi_event snd_seq snd_hda_intel snd_hda_codec snd_emu10k1 snd_rawmidi snd_ac97_codec snd_ac97_bus snd_pcm snd_seq_device snd_timer snd_page_alloc snd_util_mem snd_hwdep snd soundcore binfmt_misc ntfs usblp ide_cd cdrom nvidia e100 mii uhci_hcd ehci_hcdCPU:0 EIP:0060:[c01c623c]Tainted: P VLI EFLAGS: 00210282 (2.6.16.1 #1) EIP is at get_exclusive_access+0x1c/0x3d eax: df21b5dc ebx: 0001 ecx: df21b654 edx: c50e17d4 esi: acb2e000 edi: c50e1780 ebp: esp: e4481f30 ds: 007b es: 007b ss: 0068 Process soffice.bin (pid: 21440, threadinfo=e448 task=dc5bfa90) Stack: 0c01c56f4 df21b5dc c013ff95 c253a7ac dda673c0 0001 37c2 df21b5dc df21b654 5058c680 e32c4680 acb2e000 e4481fa4 37c2 c014c900 e32c4680 acb2e000 37c2 e4481fa4 e32c4680 fff7 bf823d43 e448 Call Trace: [c01c56f4] write_unix_file+0x2a6/0x45c [c013ff95] vma_link+0xbe/0xc5 [c014c900] vfs_write+0x87/0x11b [c014ca32] sys_write+0x3b/0x63 [c010274b] sysenter_past_esp+0x54/0x75 Code: 8d 43 14 e8 0b 38 13 00 e9 d1 fc ff ff 90 90 ba 00 e0 ff ff 21 e2 8b 12 8b 92 c4 04 00 00 8b 44 24 04 8b 52 50 83 7a 10 00 74 08 0f 0b 1d 00 60 b8 31 c0 ba 01 00 ff ff f0 0f c1 10 85 d2 0f 85 -- Jake Maciejewski [EMAIL PROTECTED]
Re: reiser4 bug
On Tuesday 11 April 2006 09:28, Matt Eaton wrote: I was running OpenOffice and tried saving a file. I'm using 2.6.16.1 + reiser4-for-2.6.16-1.patch.gz (Please help! I'm having to run openoffice on a different filesystem as this crash has occurred twice saving the same file - after a reboot) kernel BUG at fs/reiser4/plugin/file/tail_conversion.c:29! invalid opcode: [#1] it is known problem. Fixed in 2.6.17-rc1-mm2 (reiser4-have-get_exclusive_access-restart-transaction.patch). I am afraid the patch from mm would not apply cleanly to 2.6.16.1 + reiser4-for-2.6.16-1.patch, you may try to apply only the following hunk: @@ -20,13 +20,12 @@ void get_exclusive_access(unix_file_info assert(nikita-3047, LOCK_CNT_NIL(inode_sem_w)); assert(nikita-3048, LOCK_CNT_NIL(inode_sem_r)); /* -* deadlock detection: sometimes we commit a transaction under +* deadlock avoidance: sometimes we commit a transaction under * rw-semaphore on a file. Such commit can deadlock with another * thread that captured some block (hence preventing atom from being * committed) and waits on rw-semaphore. */ - assert(nikita-3361, get_current_context()-trans-atom == NULL); - BUG_ON(get_current_context()-trans-atom != NULL); + txn_restart_current(); LOCK_CNT_INC(inode_sem_w); down_write(uf_info-latch); uf_info-exclusive_use = 1; _ PREEMPT SMP Modules linked in: eeprom lm85 hwmon_vid i2c_i801 lp vmnet parport_pc parport vmmon nfs lockd sunrpc snd_seq_midi snd_emu10k1_synth snd_emux_synth snd_seq_virmidi snd_seq_midi_emul snd_pcm_oss snd_mixer_oss snd_seq_oss snd_seq_midi_event snd_seq snd_hda_intel snd_hda_codec snd_emu10k1 snd_rawmidi snd_ac97_codec snd_ac97_bus snd_pcm snd_seq_device snd_timer snd_page_alloc snd_util_mem snd_hwdep snd soundcore binfmt_misc ntfs usblp ide_cd cdrom nvidia e100 mii uhci_hcd ehci_hcdCPU:0 EIP:0060:[c01c623c]Tainted: P VLI EFLAGS: 00210282 (2.6.16.1 #1) EIP is at get_exclusive_access+0x1c/0x3d eax: df21b5dc ebx: 0001 ecx: df21b654 edx: c50e17d4 esi: acb2e000 edi: c50e1780 ebp: esp: e4481f30 ds: 007b es: 007b ss: 0068 Process soffice.bin (pid: 21440, threadinfo=e448 task=dc5bfa90) Stack: 0c01c56f4 df21b5dc c013ff95 c253a7ac dda673c0 0001 37c2 df21b5dc df21b654 5058c680 e32c4680 acb2e000 e4481fa4 37c2 c014c900 e32c4680 acb2e000 37c2 e4481fa4 e32c4680 fff7 bf823d43 e448 Call Trace: [c01c56f4] write_unix_file+0x2a6/0x45c [c013ff95] vma_link+0xbe/0xc5 [c014c900] vfs_write+0x87/0x11b [c014ca32] sys_write+0x3b/0x63 [c010274b] sysenter_past_esp+0x54/0x75 Code: 8d 43 14 e8 0b 38 13 00 e9 d1 fc ff ff 90 90 ba 00 e0 ff ff 21 e2 8b 12 8b 92 c4 04 00 00 8b 44 24 04 8b 52 50 83 7a 10 00 74 08 0f 0b 1d 00 60 b8 31 c0 ba 01 00 ff ff f0 0f c1 10 85 d2 0f 85 !DSPAM:443b3e85155598791294130! -- Alex.
Re: 2.6.17-rc1-mm1, nfsd/reiser4 BUG
On Tue, 2006-04-04 at 01:45 -0700, Andrew Morton wrote: ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.17-rc1/2.6.17-rc1-mm1/ While running the following command on a Reiser4 partition mounted on a MD RAID-0 device on top of two SCSI disks: find -type f -print0 | xargs -0 cat /dev/null I was running this on the local machine, not on the NFS mounts. In fact, the NFS mounts should have had none, or very little, activity. I started getting the following error messages. I had been running 2.6.16-rc5-mm3 and hadn't noticed anything like that, so when I get some time I may try to isolate it, if I can reproduce it. But, in case anyone else knows the problem, here it is BUG: unable to handle kernel NULL pointer dereference at virtual address printing eip: e0a9a3d2 *pde = Oops: 0002 [#1] PREEMPT last sysfs file: /devices/pci:00/:00:00.0/class Modules linked in: it87 hwmon_vid hwmon i2c_isa parport_pc lp parport nfsd exportfs lockd autofs4 sunrpc iptable_filter ip_tables x_tables binfmt_misc dm_mod ohci_hcd eepro100 uhci_hcd ehci_hcd sis_agp agpgart i2c_sis630 i2c_core snd_intel8x0 snd_ac97_codec snd_ac97_bus snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd soundcore snd_page_alloc e100 via_rhine sis900 mii crc32 ide_cd cdrom usbcore CPU:0 EIP:0060:[e0a9a3d2]Not tainted VLI EFLAGS: 00010246 (2.6.17-rc1-mm1 #1) EIP is at nfsd_cache_lookup+0x169/0x2e3 [nfsd] eax: ebx: dd05a120 ecx: ddc3da38 edx: esi: c815319e edi: ebp: 0007 esp: ddfe1f40 ds: 007b es: 007b ss: 0068 Process nfsd (pid: 2170, threadinfo=ddfe1000 task=c148a070) Stack: 0de05d000 ddc3d9e0 0011 0003 e0ac9d00 de05d000 e0aa80bc e0aa7fa0 e0aa80bc e0a93049 c51b9014 de05d064 e0aa80bc e0aa7fa0 de05d000 e0ab27d2 003d dd7eaba0 e0aa7fa0 de05d040 c51b9014 000186a3 0007 c51b9008 Call Trace: e0a93049 nfsd_dispatch+0x39/0x16f [nfsd] e0ab27d2 svc_process+0x38c/0x5c6 [sunrpc] e0a93509 nfsd+0x19a/0x315 [nfsd] e0a9336f nfsd+0x0/0x315 [nfsd] c0101005 kernel_thread_helper+0x5/0xb Code: 28 8b 54 24 0c 89 53 30 8b 53 24 a1 e8 ba 3b c0 89 43 34 89 d0 c1 e8 18 31 d0 8b 54 24 04 83 e0 3f 8d 0c 82 8b 03 8b 53 04 85 c0 89 02 74 03 89 50 04 8b 01 85 c0 89 03 74 03 89 58 04 89 19 80 EIP: [e0a9a3d2] nfsd_cache_lookup+0x169/0x2e3 [nfsd] SS:ESP 0068:ddfe1f40 6note: nfsd[2170] exited with preempt_count 1 BUG: unable to handle kernel NULL pointer dereference at virtual address printing eip: e0a9a3d2 *pde = Oops: 0002 [#2] PREEMPT last sysfs file: /devices/pci:00/:00:00.0/class Modules linked in: it87 hwmon_vid hwmon i2c_isa parport_pc lp parport nfsd exportfs lockd autofs4 sunrpc iptable_filter ip_tables x_tables binfmt_misc dm_mod ohci_hcd eepro100 uhci_hcd ehci_hcd sis_agp agpgart i2c_sis630 i2c_core snd_intel8x0 snd_ac97_codec snd_ac97_bus snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd soundcore snd_page_alloc e100 via_rhine sis900 mii crc32 ide_cd cdrom usbcore CPU:0 EIP:0060:[e0a9a3d2]Not tainted VLI EFLAGS: 00010246 (2.6.17-rc1-mm1 #1) EIP is at nfsd_cache_lookup+0x169/0x2e3 [nfsd] eax: ebx: dd05a160 ecx: ddc3da38 edx: esi: c815319e edi: ebp: 0007 esp: dff43f40 ds: 007b es: 007b ss: 0068 Process nfsd (pid: 2167, threadinfo=dff43000 task=df599550) Stack: 0de7f5000 ddc3d9e0 0011 0003 e0ac9d00 de7f5000 e0aa80bc e0aa7fa0 e0aa80bc e0a93049 d7196014 de7f5064 e0aa80bc e0aa7fa0 de7f5000 e0ab27d2 003d dd7eaba0 e0aa7fa0 de7f5040 d7196014 000186a3 0007 d7196008 Call Trace: e0a93049 nfsd_dispatch+0x39/0x16f [nfsd] e0ab27d2 svc_process+0x38c/0x5c6 [sunrpc] e0a93509 nfsd+0x19a/0x315 [nfsd] e0a9336f nfsd+0x0/0x315 [nfsd] c0101005 kernel_thread_helper+0x5/0xb Code: 28 8b 54 24 0c 89 53 30 8b 53 24 a1 e8 ba 3b c0 89 43 34 89 d0 c1 e8 18 31 d0 8b 54 24 04 83 e0 3f 8d 0c 82 8b 03 8b 53 04 85 c0 89 02 74 03 89 50 04 8b 01 85 c0 89 03 74 03 89 58 04 89 19 80 EIP: [e0a9a3d2] nfsd_cache_lookup+0x169/0x2e3 [nfsd] SS:ESP 0068:dff43f40 6note: nfsd[2167] exited with preempt_count 1 BUG: unable to handle kernel NULL pointer dereference at virtual address printing eip: e0a9a3d2 *pde = Oops: 0002 [#3] PREEMPT last sysfs file: /devices/pci:00/:00:00.0/class Modules linked in: it87 hwmon_vid hwmon i2c_isa parport_pc lp parport nfsd exportfs lockd autofs4 sunrpc iptable_filter ip_tables x_tables binfmt_misc dm_mod ohci_hcd eepro100 uhci_hcd ehci_hcd sis_agp agpgart i2c_sis630 i2c_core snd_intel8x0 snd_ac97_codec snd_ac97_bus snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd soundcore snd_page_alloc e100 via_rhine sis900 mii crc32 ide_cd cdrom usbcore CPU:0 EIP:0060:[e0a9a3d2]Not tainted VLI EFLAGS: 00010246 (2.6.17-rc1-mm1 #1) EIP is at nfsd_cache_lookup+0x169/0x2e3 [nfsd] eax: ebx: dd05a1a0 ecx: ddc3da38 edx: esi: c815319e edi: ebp:
Re: 2.6.16-rc6-mm2: reiser4 BUG when unmounting fs
Am Mittwoch, 22. März 2006 08:43 schrieb Vladimir V. Saveliev: The attached patch fixes the problem. confirmed, works in 2.6.16-mm1. regards Alex -- Encrypted Mails welcome. PGP-Key at http://zodiac.dnsalias.org/misc/pgpkey.asc | Key-ID: 0x6D7DD291 pgp8PhYOevnwY.pgp Description: PGP signature
Re: 2.6.16-rc6-mm2: reiser4 BUG when unmounting fs
Le 22.03.2006 08:43, Vladimir V. Saveliev a écrit : Hello On Tue, 2006-03-21 at 21:38 +0100, Laurent Riffard wrote: Le 21.03.2006 21:27, Laurent Riffard a écrit : Le 18.03.2006 13:40, Andrew Morton a écrit : ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.16-rc6/2.6.16-rc6-mm2/ Hello, This BUG is 100% reproducible. Simply boot to runlevel 1 and then unmount a reiser4 fs: The attached patch fixes the problem. Ok, it works fine now. Thanks. fs/reiser4/page_cache.c |4 1 files changed, 4 deletions(-) diff -puN fs/reiser4/page_cache.c~reiser4-fix-bd_inode fs/reiser4/page_cache.c --- linux-2.6.16-rc6-mm2/fs/reiser4/page_cache.c~reiser4-fix-bd_inode 2006-03-21 06:42:42.0 +0300 +++ linux-2.6.16-rc6-mm2-vs/fs/reiser4/page_cache.c 2006-03-21 07:21:54.0 +0300 @@ -198,10 +198,6 @@ init_fake_inode(struct super_block *supe { assert(nikita-2168, fake-i_state I_NEW); fake-i_mapping-a_ops = formatted_fake_as_ops; - fake-i_blkbits = super-s_blocksize_bits; - fake-i_size = ~0ull; - fake-i_rdev = super-s_bdev-bd_dev; - fake-i_bdev = super-s_bdev; *pfake = fake; /* NOTE-NIKITA something else? */ unlock_new_inode(fake); _ -- laurent
Re: 2.6.16-rc6-mm2: reiser4 BUG when unmounting fs
Le 21.03.2006 21:27, Laurent Riffard a écrit : Le 18.03.2006 13:40, Andrew Morton a écrit : ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.16-rc6/2.6.16-rc6-mm2/ Hello, This BUG is 100% reproducible. Simply boot to runlevel 1 and then unmount a reiser4 fs: Oops! Somebody already reported it: http://lkml.org/lkml/2006/3/21/88. Sorry for the noise... -- laurent
Re: possible reiser4 bug
Alexander Zarochentsev wrote: please use better version of the patch (attached). Also tried the patch and it seems problem with linux 2.6.16-rc2-mm1 is gone. Not much tested, but it always crashed when accessing a particular file, and that's fixed now. For me, this can be included in official sources. Thanks, Maarten Deprez
Re: possible reiser4 bug
On Sunday 26 February 2006 08:33, Alexander Zarochentsev wrote: please use better version of the patch (attached). Hello, after some days of testing the problem seems gone. The kernel survived many CPU and I/O intensive tasks (some compilations of gcc and firefox) without an hiccup, whereas without the patch it hanged quite soon. Thanks and regards, Francesco -- Dr. Francesco Biscani Dipartimento di Astronomia Università di Padova [EMAIL PROTECTED]
Re: reiser4 bug in 2.6.16-rc2-mm1
Sorry for the delay.On 2/22/06, Vladimir V. Saveliev [EMAIL PROTECTED] wrote: HelloOn Tue, 2006-02-21 at 19:58 -0600, John Cohorn wrote: First off, thank you for taking the time to respond. Basically any heavy activity on the partition seems to trigger it. Here's a short transcript: dually ~ # cat /proc/version Linux version 2.6.16-rc2-mm1 ([EMAIL PROTECTED]) (gcc version 3.3.6 (Gentoo 3.3.6, ssp-3.3.6-1.0, pie-8.7.8)) #7 SMP PREEMPT Thu Feb 16 21:19:53 Local time zone must be set-- dually ~ # mkfs.reiser4 --version mkfs.reiser4 1.0.5 Copyright (C) 2001, 2002, 2003, 2004 by Hans Reiser, licensing governed by reiser4progs/COPYING. dually ~ # mkfs.reiser4 /dev/sdb1 --yes dually ~ # mount /dev/sdb1 /data1 dually ~ # klogd -d -f ~/kernel.log dually ~ # cd /data1 dually data1 # cp -ra /usr/src/linux-2.6.16-rc2-mm1/ . dually data1 # tar cf linux.bz2 linux-2.6.16-rc2-mm1/ ...after about half a minute of activity I receive the message which I've attached as a small JPEG reiser4 panicked cowardly: reiser4[tar(4078)] commit_current_atom (fs/reiser4/txnmgr.c:1130) [zam-597]:.Would you please try your test on ide disk?At your suggestion I plugged in a spare PATA IDE disk, formatted it with R4, and was about to copy several copies of the Linux kernel sources to the partition, tar them up, and move them around with no lockups or apparent corruption. I can't see whether it was line 1130 or1138.I'll take a quick look myself, but I'm completely unfamiliar with reiser4 sources and the kernel sources in general. After this my system is completely locked up. When I mount the partition after restarting it no longer contains any files. I have few or no problems when creating, mounting, and working on reiser4 partitions when booted from the System recovery liveCD so it must be something in the latest sources or in my kernel configuration IIRC, there were changes (not in reiser4) which caused kernel to noticei/o errors on SCSI devices where older kernels did not see any problems.Reiser3 is completely stable on my SATA drives...is R4 simply doing more error checking? Why should even a severe I/O error completely freeze my machine? So it seems as though the problem is in the interaction between R4 and the SCSI code...are any workarounds to this apparent? (which is 100% stable for everything else under reiser3, but I haven't bothered to perfect yet). Apologies about the JPEG, I could not figure out how to get the kernel output logged into a file(maybe it freezes before the file can flush to disk, i dunno). If you know of a way, I welcome instruction.reiserfs-list@namesys.com does not accept binary attachments. Pleasesend the image to [EMAIL PROTECTED] and [EMAIL PROTECTED]done. You should setup either serial or netconsole to catch kernel output.Please look at instructions:linux/Documentation/serial-console.txtlinux/Documentation/networking/netconsole.txt Thanks for the suggestions. I'll see if I can set up logging kernel messages to my laptop.
Re: possible reiser4 bug
thanks for the patch, Alex, but yesterday i converted all of my harddisks to reiserFS in an act of frustration. Sorry that i can't help you test your patch. Alexander Zarochentsev wrote: hello, On Friday 24 February 2006 08:35, Alec Burney wrote: [...] would you please try the attached patch. fs/reiser4/plugin/file/file.c |9 + 1 files changed, 5 insertions(+), 4 deletions(-) Index: linux-2.6.16-rc3-mm1/fs/reiser4/plugin/file/file.c === --- linux-2.6.16-rc3-mm1.orig/fs/reiser4/plugin/file/file.c +++ linux-2.6.16-rc3-mm1/fs/reiser4/plugin/file/file.c @@ -2454,6 +2454,7 @@ ssize_t write_unix_file(struct file *fil if (inode_get_flag(inode, REISER4_PART_CONV)) { /* we can not currently write to a file which is partially converted */ + txn_restart(ctx); get_exclusive_access(uf_info); result = finish_conversion(inode); drop_exclusive_access(uf_info); @@ -2481,6 +2482,7 @@ ssize_t write_unix_file(struct file *fil 3) convert file to extents to not enter here on each write to mmaped file */ + txn_restart(ctx); get_exclusive_access(uf_info); result = check_pages_unix_file(file, inode); drop_exclusive_access(uf_info); @@ -2535,14 +2537,13 @@ ssize_t write_unix_file(struct file *fil while (left 0) { int excl = 0; - /* getting exclusive or not exclusive access requires no -transaction open */ - txn_restart_current(); - /* faultin user page */ fault_in_pages_readable(buf, left PAGE_CACHE_SIZE ? PAGE_CACHE_SIZE : left); + /* getting exclusive or not exclusive access requires no +transaction open */ + txn_restart(ctx); if (inode-i_size == 0) { get_exclusive_access(uf_info); excl = 1; -- cell: 808.332.5131
Re: possible reiser4 bug
hello, On Friday 24 February 2006 08:35, Alec Burney wrote: i have been experiencing hard locks (preceded by intense 5 load averages, disk i/o timeouts, and processes gone astray) that seem to be related to a possible bug in reiser4. here is the information that i think is important: a sample dmesg from right before a hardlock: [ cut here ] kernel BUG at fs/reiser4/plugin/file/tail_conversion.c:29! invalid opcode: [#1] SMP last sysfs file: /class/vc/vcsa7/dev Modules linked in: snd_rtctimer CPU:0 EIP:0060:[c019c3bb]Not tainted VLI EFLAGS: 00210282 (2.6.16-rc3-mm1 #1) EIP is at get_exclusive_access+0x1b/0x44 eax: eec1b1ac ebx: ecda70d8 ecx: ecda7140 edx: esi: addee000 edi: eec1b160 ebp: esp: dfd89f38 ds: 007b es: 007b ss: 0068 Process soffice.bin (pid: 11208, threadinfo=dfd89000 task=e1193a70) Stack: 1 c019b956 c013f613 eecbfb74 e8daa560 0001 5600 ecda70d8 ecda7140 addee000 ea8ea7c0 5600 d06d2740 5600 ea8ea7c0 c03b0e10 c014bbf3 dfd89fa4 addee000 ea8ea7c0 fff7 bfb4a870 Call Trace: c019b956 write_unix_file+0x2a4/0x44b c013f613 vma_link+0xa2/0xbc c014bbf3 vfs_write+0xab/0x14c c014bd32 sys_write+0x3c/0x62 c01025bf sysenter_past_esp+0x54/0x75 Code: 8d 43 10 e8 5c 54 1a 00 e9 f8 fc ff ff 90 90 90 53 89 c3 b8 00 f0 ff ff 21 e0 8b 00 8b 80 c8 04 00 00 8b 40 48 83 78 0c 00 74 08 0f 0b 1d 00 e0 58 36 c0 e8 28 61 1a 00 ba 01 00 ff ff 89 d8 f0 [...] my question: can this be fixed, has it been fixed, or can i convert this drive online to reiserfs. sorry for such a long message, but this is very important to me. thank you! would you please try the attached patch. -- Alex. fs/reiser4/plugin/file/file.c |9 + 1 files changed, 5 insertions(+), 4 deletions(-) Index: linux-2.6.16-rc3-mm1/fs/reiser4/plugin/file/file.c === --- linux-2.6.16-rc3-mm1.orig/fs/reiser4/plugin/file/file.c +++ linux-2.6.16-rc3-mm1/fs/reiser4/plugin/file/file.c @@ -2454,6 +2454,7 @@ ssize_t write_unix_file(struct file *fil if (inode_get_flag(inode, REISER4_PART_CONV)) { /* we can not currently write to a file which is partially converted */ + txn_restart(ctx); get_exclusive_access(uf_info); result = finish_conversion(inode); drop_exclusive_access(uf_info); @@ -2481,6 +2482,7 @@ ssize_t write_unix_file(struct file *fil 3) convert file to extents to not enter here on each write to mmaped file */ + txn_restart(ctx); get_exclusive_access(uf_info); result = check_pages_unix_file(file, inode); drop_exclusive_access(uf_info); @@ -2535,14 +2537,13 @@ ssize_t write_unix_file(struct file *fil while (left 0) { int excl = 0; - /* getting exclusive or not exclusive access requires no - transaction open */ - txn_restart_current(); - /* faultin user page */ fault_in_pages_readable(buf, left PAGE_CACHE_SIZE ? PAGE_CACHE_SIZE : left); + /* getting exclusive or not exclusive access requires no + transaction open */ + txn_restart(ctx); if (inode-i_size == 0) { get_exclusive_access(uf_info); excl = 1;
Re: possible reiser4 bug
On Saturday 25 February 2006 22:09, Alexander Zarochentsev wrote: would you please try the attached patch. I'm having a similar problem on 2.6.16-rc4 + reiser4 from -mm1. I will be testing the attached patch. Regards, Francesco -- Dr. Francesco Biscani Dipartimento di Astronomia Università di Padova [EMAIL PROTECTED]
Re: possible reiser4 bug
On Sunday 26 February 2006 05:33, Francesco Biscani wrote: On Saturday 25 February 2006 22:09, Alexander Zarochentsev wrote: would you please try the attached patch. I'm having a similar problem on 2.6.16-rc4 + reiser4 from -mm1. I will be testing the attached patch. please use better version of the patch (attached). Regards, Francesco -- Alex. fs/reiser4/plugin/file/file.c|9 + fs/reiser4/plugin/item/extent_file_ops.c |2 +- fs/reiser4/vfs_ops.c |5 - 3 files changed, 10 insertions(+), 6 deletions(-) Index: linux-2.6.16-rc3-mm1/fs/reiser4/plugin/file/file.c === --- linux-2.6.16-rc3-mm1.orig/fs/reiser4/plugin/file/file.c +++ linux-2.6.16-rc3-mm1/fs/reiser4/plugin/file/file.c @@ -2454,6 +2454,7 @@ ssize_t write_unix_file(struct file *fil if (inode_get_flag(inode, REISER4_PART_CONV)) { /* we can not currently write to a file which is partially converted */ + txn_restart(ctx); get_exclusive_access(uf_info); result = finish_conversion(inode); drop_exclusive_access(uf_info); @@ -2481,6 +2482,7 @@ ssize_t write_unix_file(struct file *fil 3) convert file to extents to not enter here on each write to mmaped file */ + txn_restart(ctx); get_exclusive_access(uf_info); result = check_pages_unix_file(file, inode); drop_exclusive_access(uf_info); @@ -2535,14 +2537,13 @@ ssize_t write_unix_file(struct file *fil while (left 0) { int excl = 0; - /* getting exclusive or not exclusive access requires no - transaction open */ - txn_restart_current(); - /* faultin user page */ fault_in_pages_readable(buf, left PAGE_CACHE_SIZE ? PAGE_CACHE_SIZE : left); + /* getting exclusive or not exclusive access requires no + transaction open */ + txn_restart(ctx); if (inode-i_size == 0) { get_exclusive_access(uf_info); excl = 1; Index: linux-2.6.16-rc3-mm1/fs/reiser4/vfs_ops.c === --- linux-2.6.16-rc3-mm1.orig/fs/reiser4/vfs_ops.c +++ linux-2.6.16-rc3-mm1/fs/reiser4/vfs_ops.c @@ -209,8 +209,11 @@ void writeout(struct super_block *sb, st void reiser4_throttle_write(struct inode *inode) { - txn_restart_current(); + reiser4_context *ctx = get_current_context(); + + txn_restart(ctx); balance_dirty_pages_ratelimited(inode-i_mapping); + txn_restart(ctx); } const char *REISER4_SUPER_MAGIC_STRING = ReIsEr4; Index: linux-2.6.16-rc3-mm1/fs/reiser4/plugin/item/extent_file_ops.c === --- linux-2.6.16-rc3-mm1.orig/fs/reiser4/plugin/item/extent_file_ops.c +++ linux-2.6.16-rc3-mm1/fs/reiser4/plugin/item/extent_file_ops.c @@ -803,7 +803,7 @@ extent_balance_dirty_pages(struct inode fault_in_pages_readable(f-data, f-length PAGE_CACHE_SIZE ? PAGE_CACHE_SIZE : f-length); - + txn_restart_current(); if (excl) get_exclusive_access(uf_info); else
possible reiser4 bug
i have been experiencing hard locks (preceded by intense 5 load averages, disk i/o timeouts, and processes gone astray) that seem to be related to a possible bug in reiser4. here is the information that i think is important: a sample dmesg from right before a hardlock: [ cut here ] kernel BUG at fs/reiser4/plugin/file/tail_conversion.c:29! invalid opcode: [#1] SMP last sysfs file: /class/vc/vcsa7/dev Modules linked in: snd_rtctimer CPU:0 EIP:0060:[c019c3bb]Not tainted VLI EFLAGS: 00210282 (2.6.16-rc3-mm1 #1) EIP is at get_exclusive_access+0x1b/0x44 eax: eec1b1ac ebx: ecda70d8 ecx: ecda7140 edx: esi: addee000 edi: eec1b160 ebp: esp: dfd89f38 ds: 007b es: 007b ss: 0068 Process soffice.bin (pid: 11208, threadinfo=dfd89000 task=e1193a70) Stack: 1 c019b956 c013f613 eecbfb74 e8daa560 0001 5600 ecda70d8 ecda7140 addee000 ea8ea7c0 5600 d06d2740 5600 ea8ea7c0 c03b0e10 c014bbf3 dfd89fa4 addee000 ea8ea7c0 fff7 bfb4a870 Call Trace: c019b956 write_unix_file+0x2a4/0x44b c013f613 vma_link+0xa2/0xbc c014bbf3 vfs_write+0xab/0x14c c014bd32 sys_write+0x3c/0x62 c01025bf sysenter_past_esp+0x54/0x75 Code: 8d 43 10 e8 5c 54 1a 00 e9 f8 fc ff ff 90 90 90 53 89 c3 b8 00 f0 ff ff 21 e0 8b 00 8b 80 c8 04 00 00 8b 40 48 83 78 0c 00 74 08 0f 0b 1d 00 e0 58 36 c0 e8 28 61 1a 00 ba 01 00 ff ff 89 d8 f0 my uname -a: Linux helix 2.6.16-rc3-mm1 #1 SMP Sun Feb 19 21:46:21 HST 2006 i686 AMD Athlon(tm) MP 2600+ AuthenticAMD GNU/Linux my fsck.reiser4 -V fsck.reiser4 1.0.5 Copyright (C) 2001, 2002, 2003, 2004 by Hans Reiser, licensing governed by reiser4progs/COPYING. my fsck.reiser4 -p from a drive known to be causing trouble Default profiles: format: format40 (id:0x0 type:0x8) journal:journal40 (id:0x0 type:0xf) oid:oid40 (id:0x0 type:0x9) alloc: alloc40 (id:0x0 type:0xe) key:key_large (id:0x1 type:0x10) node: node40 (id:0x0 type:0x2) statdata: stat40 (id:0x0 type:0x1) nodeptr:nodeptr40 (id:0x3 type:0x1) direntry: cde40 (id:0x2 type:0x1) tail: plain40 (id:0x6 type:0x1) extent: extent40 (id:0x5 type:0x1) acl:absent (id:0x4 type:0x1) permission: absent (id:0x0 type:0x6) regular:reg40 (id:0x0 type:0x0) directory: dir40 (id:0x1 type:0x0) symlink:sym40 (id:0x2 type:0x0) special:spl40 (id:0x3 type:0x0) hash: r5_hash (id:0x1 type:0x3) fibration: ext_1_fibre (id:0x2 type:0x4) formatting: smart (id:0x2 type:0x5) the partition in question is 232903996 kilobytes large (that's 223 gigabytes), and does not fail badblocks or fsck.reiser4 testing. my system holds up to memtest86, and does not crash if the drive is not mounted and used. my question: can this be fixed, has it been fixed, or can i convert this drive online to reiserfs. sorry for such a long message, but this is very important to me. thank you!
Re: reiser4 bug in 2.6.16-rc2-mm1
Hello On Tue, 2006-02-21 at 19:58 -0600, John Cohorn wrote: First off, thank you for taking the time to respond. Basically any heavy activity on the partition seems to trigger it. Here's a short transcript: dually ~ # cat /proc/version Linux version 2.6.16-rc2-mm1 ([EMAIL PROTECTED]) (gcc version 3.3.6 (Gentoo 3.3.6, ssp-3.3.6-1.0, pie-8.7.8)) #7 SMP PREEMPT Thu Feb 16 21:19:53 Local time zone must be set-- dually ~ # mkfs.reiser4 --version mkfs.reiser4 1.0.5 Copyright (C) 2001, 2002, 2003, 2004 by Hans Reiser, licensing governed by reiser4progs/COPYING. dually ~ # mkfs.reiser4 /dev/sdb1 --yes dually ~ # mount /dev/sdb1 /data1 dually ~ # klogd -d -f ~/kernel.log dually ~ # cd /data1 dually data1 # cp -ra /usr/src/linux-2.6.16-rc2-mm1/ . dually data1 # tar cf linux.bz2 linux-2.6.16-rc2-mm1/ ...after about half a minute of activity I receive the message which I've attached as a small JPEG reiser4 panicked cowardly: reiser4[tar(4078)] commit_current_atom (fs/reiser4/txnmgr.c:1130) [zam-597]:. Would you please try your test on ide disk? I can't see whether it was line 1130 or 1138. I'll take a quick look myself, but I'm completely unfamiliar with reiser4 sources and the kernel sources in general. After this my system is completely locked up. When I mount the partition after restarting it no longer contains any files. I have few or no problems when creating, mounting, and working on reiser4 partitions when booted from the System recovery liveCD so it must be something in the latest sources or in my kernel configuration IIRC, there were changes (not in reiser4) which caused kernel to notice i/o errors on SCSI devices where older kernels did not see any problems. (which is 100% stable for everything else under reiser3, but I haven't bothered to perfect yet). Apologies about the JPEG, I could not figure out how to get the kernel output logged into a file(maybe it freezes before the file can flush to disk, i dunno). If you know of a way, I welcome instruction. reiserfs-list@namesys.com does not accept binary attachments. Please send the image to [EMAIL PROTECTED] and [EMAIL PROTECTED] You should setup either serial or netconsole to catch kernel output. Please look at instructions: linux/Documentation/serial-console.txt linux/Documentation/networking/netconsole.txt
Re: reiser4 bug in 2.6.16-rc2-mm1
Hello On Fri, 2006-02-17 at 10:43 -0600, John Cohorn wrote: Is there a known issue with 2.6.16-xxx-mm1? I too am having severe and very random, but reproducible problems which will completely lock up my machine minutes after mounting an R4 partition if not seconds. I did try disabling 4K stacks in my .config. Also, this is an Athlon 64 X2 I'm running with so maybe there are SMP issues? no, it does not look so. One of my friends said that Reiser4 became unstable for him also when he upgraded to a dual core Athlon64...but until I test for myself that's just a wild theory that it's an SMP bug. This machine is otherwise stable in Memtest, in Windows playing the latest games, and on Linux compiling huge amounts of software. If there are not known issues with the latest kernels then I will happily try to send detailed configurations and logs of what's happening this evening. I have two shiny new 250GB SATA drives on my machine and I can test freely with one of them until I need the space. I'd love to help squash any bugs that I can :) If you described the exact sequence of commands which makes reiser4 to crash - that would help a lot.
Re: reiser4 bug in 2.6.16-rc2-mm1
First off, thank you for taking the time to respond. Basically any heavy activity on the partition seems to trigger it. Here's a short transcript:dually ~ # cat /proc/versionLinux version 2.6.16-rc2-mm1 ([EMAIL PROTECTED]) (gcc version 3.3.6 (Gentoo 3.3.6, ssp-3.3.6-1.0, pie-8.7.8)) #7 SMP PREEMPT Thu Feb 16 21:19:53 Local time zone must be set--dually ~ # mkfs.reiser4 --versionmkfs.reiser4 1.0.5Copyright (C) 2001, 2002, 2003, 2004 by Hans Reiser, licensing governed by reiser4progs/COPYING. dually ~ # mkfs.reiser4 /dev/sdb1 --yesdually ~ # mount /dev/sdb1 /data1 dually ~ # klogd -d -f ~/kernel.logdually ~ # cd /data1dually data1 # cp -ra /usr/src/linux-2.6.16-rc2-mm1/ .dually data1 # tar cf linux.bz2 linux-2.6.16-rc2-mm1/...after about half a minute of activity I receive the message which I've attached as a small JPEG reiser4 panicked cowardly: reiser4[tar(4078)] commit_current_atom (fs/reiser4/txnmgr.c:1130) [zam-597]:. I can't see whether it was line 1130 or 1138. I'll take a quick look myself, but I'm completely unfamiliar with reiser4 sources and the kernel sources in general. After this my system is completely locked up. When I mount the partition after restarting it no longer contains any files. I have few or no problems when creating, mounting, and working on reiser4 partitions when booted from the System recovery liveCD so it must be something in the latest sources or in my kernel configuration(which is 100% stable for everything else under reiser3, but I haven't bothered to perfect yet). Apologies about the JPEG, I could not figure out how to get the kernel output logged into a file(maybe it freezes before the file can flush to disk, i dunno). If you know of a way, I welcome instruction. On 2/21/06, Vladimir V. Saveliev [EMAIL PROTECTED] wrote: HelloOn Fri, 2006-02-17 at 10:43 -0600, John Cohorn wrote: Is there a known issue with 2.6.16-xxx-mm1? I too am having severe and very random, but reproducible problems which will completely lock up my machine minutes after mounting an R4 partition if not seconds. I did try disabling 4K stacks in my .config. Also, this is an Athlon 64 X2 I'm running with so maybe there are SMP issues? no, it does not look so. One of my friends said that Reiser4 became unstable for him also when he upgraded to a dual core Athlon64...but until I test for myself that's just a wild theory that it's an SMP bug. This machine is otherwise stable in Memtest, in Windows playing the latest games, and on Linux compiling huge amounts of software. If there are not known issues with the latest kernels then I will happily try to send detailed configurations and logs of what's happening this evening. I have two shiny new 250GB SATA drives on my machine and I can test freely with one of them until I need the space. I'd love to help squash any bugs that I can :) If you described the exact sequence of commands which makes reiser4 tocrash - that would help a lot.
Re: reiser4 bug in 2.6.16-rc2-mm1
Is there a known issue with 2.6.16-xxx-mm1? I too am having severe and very random, but reproducible problems which will completely lock up my machine minutes after mounting an R4 partition if not seconds. I did try disabling 4K stacks in my .config. Also, this is an Athlon 64 X2 I'm running with so maybe there are SMP issues? One of my friends said that Reiser4 became unstable for him also when he upgraded to a dual core Athlon64...but until I test for myself that's just a wild theory that it's an SMP bug. This machine is otherwise stable in Memtest, in Windows playing the latest games, and on Linux compiling huge amounts of software. If there are not known issues with the latest kernels then I will happily try to send detailed configurations and logs of what's happening this evening. I have two shiny new 250GB SATA drives on my machine and I can test freely with one of them until I need the space. I'd love to help squash any bugs that I can :) On 2/16/06, Andrew James Wade [EMAIL PROTECTED] wrote: On Friday 10 February 2006 09:22, Maarten Deprez wrote: Hello, reiser4 on linux 2.6.16-rc2-mm1 bugs for me in plugins/file/tail_conversion.c line 29, locking up a process sometimes, when it is reading a file. Greetings, Maarten DeprezStill present in 2.6.16-rc3-mm1:[ cut here ]kernel BUG at fs/reiser4/plugin/file/tail_conversion.c:81!invalid opcode: [#1] PREEMPTlast sysfs file: /devices/pci:00/:00:01.0/:01:00.0/i2c-0/nameCPU:0EIP:0060:[get_nonexclusive_access+30/49]Not tainted VLIEFLAGS: 00010286 (2.6.16-rc3-mm1 #2)EIP is at get_nonexclusive_access+0x1e/0x31 eax: cc2ec288 ebx: ecx: cb87d4e8 edx: esi: cb87d4e8 edi: ebp: d30ba574 esp: d3934e00ds: 007b es: 007b ss: 0068Process kmail (pid: 21299, threadinfo=d3934000 task=d8577570) Stack: 0c01ca9ae d94d157c d3934ed8 cb87d540 000f 00320af1 df136ef4 f000 d20b2494 1000 d94d158c 0127 43f543c8 2ac0f373 d94d158c Call Trace: c01ca9ae write_extent+0x68d/0xbc3 c01cd0e2 item_length_by_coord+0xb/0xf c01c8125 nr_units_extent+0x5/0xd c01c94ef init_coord_extension_extent+0x60/0xdf c01b5031 set_file_state+0x26/0x5b c01b5128 find_file_item+0xc2/0xd4 c01ca321 write_extent+0x0/0xbc3 c01b6c46 write_flow+0x248/0x2df c01b74bc write_unix_file+0x343/0x4cc c01345f6 lru_cache_add_active+0x47/0x5d c01b7179 write_unix_file+0x0/0x4cc c01488f5 vfs_write+0x83/0x122 c014910e sys_write+0x3c/0x63 c0102ac7 sysenter_past_esp+0x54/0x75Code: 81 c4 b0 00 00 00 89 e8 5b 5e 5f 5d c3 85 d2 89 c1 75 20 b8 00 f0 ff ff 21 e0 8b 00 8b 80 c4 04 00 00 8b 40 40 83 78 08 00 74 08 0f 0b 51 00 b8 b8 38 c0 89 c8 ff 00 0f 88 e1 06 00 00 c3 55 ba 44reiser4[kmail(21299)]: release_unix_file (fs/reiser4/plugin/file/file.c:2674)[vs-44]:WARNING: out of memory?4reiser4[kmail(21299)]: release_unix_file (fs/reiser4/plugin/file/file.c:2674)[vs-44]: WARNING: out of memory?...
Re: reiser4 bug in 2.6.16-rc2-mm1
On Friday 10 February 2006 09:22, Maarten Deprez wrote: Hello, reiser4 on linux 2.6.16-rc2-mm1 bugs for me in plugins/file/tail_conversion.c line 29, locking up a process sometimes, when it is reading a file. Greetings, Maarten Deprez Still present in 2.6.16-rc3-mm1: [ cut here ] kernel BUG at fs/reiser4/plugin/file/tail_conversion.c:81! invalid opcode: [#1] PREEMPT last sysfs file: /devices/pci:00/:00:01.0/:01:00.0/i2c-0/name CPU:0 EIP:0060:[get_nonexclusive_access+30/49]Not tainted VLI EFLAGS: 00010286 (2.6.16-rc3-mm1 #2) EIP is at get_nonexclusive_access+0x1e/0x31 eax: cc2ec288 ebx: ecx: cb87d4e8 edx: esi: cb87d4e8 edi: ebp: d30ba574 esp: d3934e00 ds: 007b es: 007b ss: 0068 Process kmail (pid: 21299, threadinfo=d3934000 task=d8577570) Stack: 0c01ca9ae d94d157c d3934ed8 cb87d540 000f 00320af1 df136ef4 f000 d20b2494 1000 d94d158c 0127 43f543c8 2ac0f373 d94d158c Call Trace: c01ca9ae write_extent+0x68d/0xbc3 c01cd0e2 item_length_by_coord+0xb/0xf c01c8125 nr_units_extent+0x5/0xd c01c94ef init_coord_extension_extent+0x60/0xdf c01b5031 set_file_state+0x26/0x5b c01b5128 find_file_item+0xc2/0xd4 c01ca321 write_extent+0x0/0xbc3 c01b6c46 write_flow+0x248/0x2df c01b74bc write_unix_file+0x343/0x4cc c01345f6 lru_cache_add_active+0x47/0x5d c01b7179 write_unix_file+0x0/0x4cc c01488f5 vfs_write+0x83/0x122 c014910e sys_write+0x3c/0x63 c0102ac7 sysenter_past_esp+0x54/0x75 Code: 81 c4 b0 00 00 00 89 e8 5b 5e 5f 5d c3 85 d2 89 c1 75 20 b8 00 f0 ff ff 21 e0 8b 00 8b 80 c4 04 00 00 8b 40 40 83 78 08 00 74 08 0f 0b 51 00 b8 b8 38 c0 89 c8 ff 00 0f 88 e1 06 00 00 c3 55 ba 44reiser4[kmail(21299)]: release_unix_file (fs/reiser4/plugin/file/file.c:2674)[vs-44]: WARNING: out of memory? 4reiser4[kmail(21299)]: release_unix_file (fs/reiser4/plugin/file/file.c:2674)[vs-44]: WARNING: out of memory? ...
reiser4 bug in 2.6.16-rc2-mm1
Hello, reiser4 on linux 2.6.16-rc2-mm1 bugs for me in plugins/file/tail_conversion.c line 29, locking up a process sometimes, when it is reading a file. Greetings, Maarten Deprez
reiser4 bug
Hi! after some days of uptime i just had a soft lockup with / and /usr on reiser4 while running an emerge (which quit on make install with a segmentation fault). RAM was at 192/512 mb, lots of diskspace left on both partitions. I thought it will be able to sync after some seconds, but after 5 minutes hanging at this state, i had to reset the system. kernel is 2.6.15-rc2 with reiser4 from 2.6.15-rc2-mm1. reiser4 is mounted noatime,nodiratime (otherwhise it wouldn't be usable at all atm because of hanging at tasks like 'ebuild foo.ebuild digest' and the already known vi :wq! - where it seems to take forever to return to the prompt). following is the bug message: http://phpfi.com/89498 regards! tom
Re: reiser4 bug
Hello, On Sun, Feb 27, 2005 at 07:34:11PM +, Szabolcs Illes wrote: Hi All, I have been using reiser4fs for 4 months without any problem since today. After tryinig to open a movie with mplayer a have this in my log: how large was the movie file? We have another similar bug report related to writing/reading of a large (20GB) tar file, so file size may be important. Feb 27 14:00:22 sunset kernel: reiser4[mc(9192)]: write_sd_by_inode_common (fs/reiser4/plugin/object.c:480)[nikita-2221]: Feb 27 14:00:22 sunset kernel: WARNING: Failed to save sd for 1599330: -5 Feb 27 14:00:22 sunset kernel: reiser4[mc(9192)]: create_child_common (fs/reiser4/plugin/dir/dir.c:501)[nikita-2219]: Feb 27 14:00:22 sunset kernel: WARNING: Failed to create sd for 1599330 Feb 27 14:00:23 sunset kernel: reiser4[cupsd(3296)]: parse_node40 (fs/reiser4/plugin/node/node40.c:767)[nikita-494]: Feb 27 14:00:23 sunset kernel: WARNING: Wrong level found in node: 1 != 95 Feb 27 14:00:23 sunset kernel: reiser4[cupsd(3296)]: parse_node40 (fs/reiser4/plugin/node/node40.c:767)[nikita-494]: Feb 27 14:00:23 sunset kernel: WARNING: Wrong level found in node: 1 != 95 Feb 27 14:00:24 sunset kernel: reiser4[cupsd(3296)]: parse_node40 (fs/reiser4/plugin/node/node40.c:767)[nikita-494]: Feb 27 14:00:24 sunset kernel: WARNING: Wrong level found in node: 1 != 95 Thanks, -- Alex.
Re: reiser4 bug
Alex Zarochentsev wrote: Hello, On Sun, Feb 27, 2005 at 07:34:11PM +, Szabolcs Illes wrote: Hi All, I have been using reiser4fs for 4 months without any problem since today. After tryinig to open a movie with mplayer a have this in my log: how large was the movie file? It was about 300MB. But previously I have copied a lot of relative big movies (600-700 MB). We have another similar bug report related to writing/reading of a large (20GB) tar file, so file size may be important. I think the larges file I have ever stored using reiser4 was about 3 or 4 GB. ( for example: vmware disk images, dvd isos ...) Feb 27 14:00:22 sunset kernel: reiser4[mc(9192)]: write_sd_by_inode_common (fs/reiser4/plugin/object.c:480)[nikita-2221]: Feb 27 14:00:22 sunset kernel: WARNING: Failed to save sd for 1599330: -5 Feb 27 14:00:22 sunset kernel: reiser4[mc(9192)]: create_child_common (fs/reiser4/plugin/dir/dir.c:501)[nikita-2219]: Feb 27 14:00:22 sunset kernel: WARNING: Failed to create sd for 1599330 Feb 27 14:00:23 sunset kernel: reiser4[cupsd(3296)]: parse_node40 (fs/reiser4/plugin/node/node40.c:767)[nikita-494]: Feb 27 14:00:23 sunset kernel: WARNING: Wrong level found in node: 1 != 95 Feb 27 14:00:23 sunset kernel: reiser4[cupsd(3296)]: parse_node40 (fs/reiser4/plugin/node/node40.c:767)[nikita-494]: Feb 27 14:00:23 sunset kernel: WARNING: Wrong level found in node: 1 != 95 Feb 27 14:00:24 sunset kernel: reiser4[cupsd(3296)]: parse_node40 (fs/reiser4/plugin/node/node40.c:767)[nikita-494]: Feb 27 14:00:24 sunset kernel: WARNING: Wrong level found in node: 1 != 95 Thanks,
Re: reiser4 bug
Alex Zarochentsev wrote: On Sun, Feb 27, 2005 at 07:34:11PM +, Szabolcs Illes wrote: Hi All, I have been using reiser4fs for 4 months without any problem since today. After tryinig to open a movie with mplayer a have this in my log: no other reiser4 messages in the log for 4 months? I have checked my logs. I was not able to find anything from reiserfs. But I have uploaded my syslog since feb 27 (this is when the bug occured): http://nuk.teteny.elte.hu/~selli/syslog.1.bz2 ( 1.8 compressed, 300M uncompresed ) It may has more information about the bug, you should check it. For example the very first message from reiser4 is: Feb 27 12:59:38 sunset kernel: WARNING: Wrong level found in node: 1 != 95 Feb 27 12:59:38 sunset kernel: reiser4[cupsd(3296)]: parse_node40 (fs/reiser4/plugin/node/node40.c:767)[nikita-494]: like delimitingkey mismatch reports? there was wrong bitmap checksum. actually I haven't seen such reports until now. are you sure your computer's disk/ram work well? My machine (seems to be) is fun, I have run some tests. Another info: I was using an extrenal usb2 disk (120Gb/ reiser4) when this affair occurd. After rebooting I had to repair all my reiser4 partitions. ( / hda2, /home hda5/, /mnt/Movies sda1). For the / I had to use --build-sb, then --build-fs, then --fix For /home, and the external /mnt/Movies just the --build-sb Feb 27 14:00:22 sunset kernel: reiser4[mc(9192)]: write_sd_by_inode_common (fs/reiser4/plugin/object.c:480)[nikita-2221]: Feb 27 14:00:22 sunset kernel: WARNING: Failed to save sd for 1599330: -5 Feb 27 14:00:22 sunset kernel: reiser4[mc(9192)]: create_child_common (fs/reiser4/plugin/dir/dir.c:501)[nikita-2219]: Feb 27 14:00:22 sunset kernel: WARNING: Failed to create sd for 1599330 Feb 27 14:00:23 sunset kernel: reiser4[cupsd(3296)]: parse_node40 (fs/reiser4/plugin/node/node40.c:767)[nikita-494]: Feb 27 14:00:23 sunset kernel: WARNING: Wrong level found in node: 1 != 95 Feb 27 14:00:23 sunset kernel: reiser4[cupsd(3296)]: parse_node40 (fs/reiser4/plugin/node/node40.c:767)[nikita-494]: Feb 27 14:00:23 sunset kernel: WARNING: Wrong level found in node: 1 != 95 Feb 27 14:00:24 sunset kernel: reiser4[cupsd(3296)]: parse_node40 (fs/reiser4/plugin/node/node40.c:767)[nikita-494]: Feb 27 14:00:24 sunset kernel: WARNING: Wrong level found in node: 1 != 95 .. 1000x times were there unclear reboots? I had some crash ( I think it was because of my bug ethernet driver, I have no crash since I replaced it) That shouldn't be a problem unless disk write caching is disabled. How can I chech this? Szabolcs After trying to reboot: http://nuk.teteny.elte.hu/~selli/dscf0437.jpg (using 2.6.9-cko3) http://nuk.teteny.elte.hu/~selli/dscf0438.jpg http://nuk.teteny.elte.hu/~selli/dscf0439.jpg I had to install a new root filesystem, and using that I was able to fix the filesystem. ( --build-sb, build-fs, and fix ) I have lost some file but fortunatly I was able to use my system again. The affair happend when I was using 2.6.10 with reiser4 patch(from namesys.com). I tried to reboot with my previos kernel (2.6.9-cko3) but had the some bug (see screenshots). Sorry but I forgot to save the outout of the fsck prog :(. If U need any information do not hesitate contacting me.( I am not on the mailing list) Some info in advanve: Hardware: Dell Inpiron 8600c (1600Mhz Centrion processor) Kernel config: [...]
Re: reiser4 bug
On Sun, Feb 27, 2005 at 07:34:11PM +, Szabolcs Illes wrote: Hi All, I have been using reiser4fs for 4 months without any problem since today. After tryinig to open a movie with mplayer a have this in my log: no other reiser4 messages in the log for 4 months? like delimiting keys mismatch reports? there was wrong bitmap checksum. actually I haven't seen such reports until now. are you sure your computer's disk/ram work well? Feb 27 14:00:22 sunset kernel: reiser4[mc(9192)]: write_sd_by_inode_common (fs/reiser4/plugin/object.c:480)[nikita-2221]: Feb 27 14:00:22 sunset kernel: WARNING: Failed to save sd for 1599330: -5 Feb 27 14:00:22 sunset kernel: reiser4[mc(9192)]: create_child_common (fs/reiser4/plugin/dir/dir.c:501)[nikita-2219]: Feb 27 14:00:22 sunset kernel: WARNING: Failed to create sd for 1599330 Feb 27 14:00:23 sunset kernel: reiser4[cupsd(3296)]: parse_node40 (fs/reiser4/plugin/node/node40.c:767)[nikita-494]: Feb 27 14:00:23 sunset kernel: WARNING: Wrong level found in node: 1 != 95 Feb 27 14:00:23 sunset kernel: reiser4[cupsd(3296)]: parse_node40 (fs/reiser4/plugin/node/node40.c:767)[nikita-494]: Feb 27 14:00:23 sunset kernel: WARNING: Wrong level found in node: 1 != 95 Feb 27 14:00:24 sunset kernel: reiser4[cupsd(3296)]: parse_node40 (fs/reiser4/plugin/node/node40.c:767)[nikita-494]: Feb 27 14:00:24 sunset kernel: WARNING: Wrong level found in node: 1 != 95 .. 1000x times were there unclear reboots? That shouldn't be a problem unless disk write caching is disabled. After trying to reboot: http://nuk.teteny.elte.hu/~selli/dscf0437.jpg (using 2.6.9-cko3) http://nuk.teteny.elte.hu/~selli/dscf0438.jpg http://nuk.teteny.elte.hu/~selli/dscf0439.jpg I had to install a new root filesystem, and using that I was able to fix the filesystem. ( --build-sb, build-fs, and fix ) I have lost some file but fortunatly I was able to use my system again. The affair happend when I was using 2.6.10 with reiser4 patch(from namesys.com). I tried to reboot with my previos kernel (2.6.9-cko3) but had the some bug (see screenshots). Sorry but I forgot to save the outout of the fsck prog :(. If U need any information do not hesitate contacting me.( I am not on the mailing list) Some info in advanve: Hardware: Dell Inpiron 8600c (1600Mhz Centrion processor) Kernel config: [...] -- Alex.
reiser4 bug
Hi All, I have been using reiser4fs for 4 months without any problem since today. After tryinig to open a movie with mplayer a have this in my log: Feb 27 14:00:22 sunset kernel: reiser4[mc(9192)]: write_sd_by_inode_common (fs/reiser4/plugin/object.c:480)[nikita-2221]: Feb 27 14:00:22 sunset kernel: WARNING: Failed to save sd for 1599330: -5 Feb 27 14:00:22 sunset kernel: reiser4[mc(9192)]: create_child_common (fs/reiser4/plugin/dir/dir.c:501)[nikita-2219]: Feb 27 14:00:22 sunset kernel: WARNING: Failed to create sd for 1599330 Feb 27 14:00:23 sunset kernel: reiser4[cupsd(3296)]: parse_node40 (fs/reiser4/plugin/node/node40.c:767)[nikita-494]: Feb 27 14:00:23 sunset kernel: WARNING: Wrong level found in node: 1 != 95 Feb 27 14:00:23 sunset kernel: reiser4[cupsd(3296)]: parse_node40 (fs/reiser4/plugin/node/node40.c:767)[nikita-494]: Feb 27 14:00:23 sunset kernel: WARNING: Wrong level found in node: 1 != 95 Feb 27 14:00:24 sunset kernel: reiser4[cupsd(3296)]: parse_node40 (fs/reiser4/plugin/node/node40.c:767)[nikita-494]: Feb 27 14:00:24 sunset kernel: WARNING: Wrong level found in node: 1 != 95 .. 1000x times After trying to reboot: http://nuk.teteny.elte.hu/~selli/dscf0437.jpg (using 2.6.9-cko3) http://nuk.teteny.elte.hu/~selli/dscf0438.jpg http://nuk.teteny.elte.hu/~selli/dscf0439.jpg I had to install a new root filesystem, and using that I was able to fix the filesystem. ( --build-sb, build-fs, and fix ) I have lost some file but fortunatly I was able to use my system again. The affair happend when I was using 2.6.10 with reiser4 patch(from namesys.com). I tried to reboot with my previos kernel (2.6.9-cko3) but had the some bug (see screenshots). Sorry but I forgot to save the outout of the fsck prog :(. If U need any information do not hesitate contacting me.( I am not on the mailing list) Some info in advanve: Hardware: Dell Inpiron 8600c (1600Mhz Centrion processor) Kernel config: # # Automatically generated make config: don't edit # Linux kernel version: 2.6.10 # Mon Jan 24 08:26:43 2005 # CONFIG_X86=y CONFIG_MMU=y CONFIG_UID16=y CONFIG_GENERIC_ISA_DMA=y CONFIG_GENERIC_IOMAP=y # # Code maturity level options # CONFIG_EXPERIMENTAL=y # CONFIG_CLEAN_COMPILE is not set CONFIG_BROKEN=y CONFIG_BROKEN_ON_SMP=y CONFIG_LOCK_KERNEL=y # # General setup # CONFIG_LOCALVERSION= CONFIG_SWAP=y CONFIG_SYSVIPC=y # CONFIG_POSIX_MQUEUE is not set # CONFIG_BSD_PROCESS_ACCT is not set CONFIG_SYSCTL=y CONFIG_AUDIT=y CONFIG_AUDITSYSCALL=y CONFIG_LOG_BUF_SHIFT=14 CONFIG_HOTPLUG=y CONFIG_KOBJECT_UEVENT=y # CONFIG_IKCONFIG is not set # CONFIG_EMBEDDED is not set CONFIG_KALLSYMS=y # CONFIG_KALLSYMS_ALL is not set # CONFIG_KALLSYMS_EXTRA_PASS is not set CONFIG_FUTEX=y CONFIG_EPOLL=y # CONFIG_CC_OPTIMIZE_FOR_SIZE is not set CONFIG_SHMEM=y CONFIG_CC_ALIGN_FUNCTIONS=0 CONFIG_CC_ALIGN_LABELS=0 CONFIG_CC_ALIGN_LOOPS=0 CONFIG_CC_ALIGN_JUMPS=0 # CONFIG_TINY_SHMEM is not set # # Loadable module support # CONFIG_MODULES=y CONFIG_MODULE_UNLOAD=y # CONFIG_MODULE_FORCE_UNLOAD is not set CONFIG_OBSOLETE_MODPARM=y # CONFIG_MODVERSIONS is not set # CONFIG_MODULE_SRCVERSION_ALL is not set CONFIG_KMOD=y # # Processor type and features # CONFIG_X86_PC=y # CONFIG_X86_ELAN is not set # CONFIG_X86_VOYAGER is not set # CONFIG_X86_NUMAQ is not set # CONFIG_X86_SUMMIT is not set # CONFIG_X86_BIGSMP is not set # CONFIG_X86_VISWS is not set # CONFIG_X86_GENERICARCH is not set # CONFIG_X86_ES7000 is not set # CONFIG_M386 is not set # CONFIG_M486 is not set # CONFIG_M586 is not set # CONFIG_M586TSC is not set # CONFIG_M586MMX is not set # CONFIG_M686 is not set # CONFIG_MPENTIUMII is not set # CONFIG_MPENTIUMIII is not set CONFIG_MPENTIUMM=y # CONFIG_MPENTIUM4 is not set # CONFIG_MK6 is not set # CONFIG_MK7 is not set # CONFIG_MK8 is not set # CONFIG_MCRUSOE is not set # CONFIG_MEFFICEON is not set # CONFIG_MWINCHIPC6 is not set # CONFIG_MWINCHIP2 is not set # CONFIG_MWINCHIP3D is not set # CONFIG_MCYRIXIII is not set # CONFIG_MVIAC3_2 is not set # CONFIG_X86_GENERIC is not set CONFIG_X86_CMPXCHG=y CONFIG_X86_XADD=y CONFIG_X86_L1_CACHE_SHIFT=6 CONFIG_RWSEM_XCHGADD_ALGORITHM=y CONFIG_X86_WP_WORKS_OK=y CONFIG_X86_INVLPG=y CONFIG_X86_BSWAP=y CONFIG_X86_POPAD_OK=y CONFIG_X86_GOOD_APIC=y CONFIG_X86_INTEL_USERCOPY=y CONFIG_X86_USE_PPRO_CHECKSUM=y CONFIG_HPET_TIMER=y CONFIG_HPET_EMULATE_RTC=y # CONFIG_SMP is not set CONFIG_PREEMPT=y # CONFIG_X86_UP_APIC is not set CONFIG_X86_TSC=y CONFIG_X86_MCE=y CONFIG_X86_MCE_NONFATAL=y # CONFIG_TOSHIBA is not set CONFIG_I8K=m # CONFIG_MICROCODE is not set # CONFIG_X86_MSR is not set # CONFIG_X86_CPUID is not set # # Firmware Drivers # # CONFIG_EDD is not set CONFIG_NOHIGHMEM=y # CONFIG_HIGHMEM4G is not set # CONFIG_HIGHMEM64G is not set # CONFIG_MATH_EMULATION is not set CONFIG_MTRR=y # CONFIG_EFI is not set CONFIG_HAVE_DEC_LOCK=y # CONFIG_REGPARM is not set # # Power management options (ACPI, APM) # CONFIG_PM=y # CONFIG_PM_DEBUG is not set # CONFIG_SOFTWARE_SUSPEND is not set # # ACPI (Advanced
reiserfs bug? ALSO Reiser4 bug!?
Hi I set up a symlink /var/log - /boot/log where /boot is a reiserfs system. That was because Reiser4 lost log data when we were chasing the last bug with VS. Now the partition grew full and it was shown by my computer completely locking up. Out of the blue. At least that's what I assume caused the lock-up and the following. On a reboot I got a trace on the screen from dmesg dying. Unfortunately I can't use netconsole for some reason, but it was there. Sometimes it boots but screams a lot that the logfiles are not found, and for some reason /boot never mounted. That's mysterious because /boot must be mounted for the kernel to be loaded, I don't have a copy in the mount point directory, but dmesg says nothing about anything but Reiser4 being mounted on / I don't frankly have the time now to reproduce this, it is somewhat sporadic and I tried it too many times already. The system is a Debian Sid (Unstable) and the command was dmesg -s 524288 /var/log/dmesg on a full reiserfs /var/log - /boot/log partition/directory. Now that I got my system up, it did not mount /boot or /home/mjt, only / and it also lost the host name. (none):/etc# is my prompt. The filesystem is 128480 blocks with just as many used. Hope this helps in any way! PS. Now that I kicked in a reboot, I'm getting: reiser4[run-parts(2707)]: traverse_tree (fs/reiser4/search.c:755)[nikita-1481]: WARNING: Too many iterations: 1048576 Where the iteration count varies. Warnings are usually not lethal, right? But this seems to end never. Time for the sync-umount-reboot sysrq magic.. Thanks! -- mjt