NFS oops
Hello. I know that NFS is undergoing continuing changes and updates and don't know if the oops I'm getting is already on someones todo list. I run a database archive then push to a nas box via nfs. It looks like most of the data arrives, but then the file transfer fails to end. It initially ends like this: nfs: server 192.168.20.114 not responding, still trying And then becomes this: [ 5801.324005] INFO: task cp:5368 blocked for more than 1208 seconds. [ 5801.324011] Tainted: G I 5.2.0-rc6 #1 [ 5801.324013] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. [ 5801.324015] cp D 0 5368 5095 0x4000 [ 5801.324019] Call Trace: [ 5801.324030] ? __schedule+0x284/0x660 [ 5801.324033] schedule+0x29/0x90 [ 5801.324037] io_schedule+0x12/0x40 [ 5801.324041] wait_on_page_bit+0x10e/0x1c0 [ 5801.324044] ? file_fdatawait_range+0x20/0x20 [ 5801.324047] __filemap_fdatawait_range+0x8b/0xf0 [ 5801.324050] filemap_write_and_wait+0x42/0x70 [ 5801.324072] nfs_wb_all+0x1a/0x120 [nfs] [ 5801.324076] filp_close+0x2a/0x70 [ 5801.324078] __x64_sys_close+0x1e/0x50 [ 5801.324081] do_syscall_64+0x4f/0x130 [ 5801.324085] entry_SYSCALL_64_after_hwframe+0x44/0xa9 [ 5801.324088] RIP: 0033:0x7efcd8e908d4 [ 5801.324094] Code: Bad RIP value. [ 5801.324095] RSP: 002b:7ffdfc946108 EFLAGS: 0246 ORIG_RAX: 0003 [ 5801.324097] RAX: ffda RBX: RCX: 7efcd8e908d4 [ 5801.324098] RDX: 0002 RSI: 7efcd99ac000 RDI: 0004 [ 5801.324100] RBP: 7ffdfc9464e0 R08: 0002 R09: 0001 [ 5801.324101] R10: 0002 R11: 0246 R12: 7ffdfc9466b0 [ 5801.324102] R13: R14: 7ffdfc946600 R15: 7ffdfc9482df ...and the oops repeats I'm running kernel Linux freedom 5.2.0-rc6 #1 SMP Sun Jun 23 18:31:56 MDT 2019 x86_64 x86_64 x86_64 GNU/Linux If you need more info, please reply to me directly as I'm not on the list, Thanks, Bob
Re: NFS Oops 2.6.20.14
On 06/18/2007 04:05 PM, Aaron Porter wrote: > Reproducable, every time nfs-kernel-server exits: > > nfsd: unexporting all filesystems > BUG: unable to handle kernel paging request at virtual address 6b6b6b6f > printing eip: > f92a7751 > *pde = 6b6b6b6b > Oops: [#1] > PREEMPT SMP > Modules linked in: xt_tcpudp iptable_nat nf_nat nf_conntrack_ipv4 > nf_conntrack nfnetlink ip_tables x_tables nfs nfsd exportfs lockd sunrpc > capability commoncap ipv6 ohci_hcd usbcore amd_rng rng_core i2c_amd8111 > generic amd74xx shpchp pci_hotplug ide_scsi w83627hf eeprom lm85 hwmon_vid > i2c_isa i2c_amd756 i2c_core rtc unix > CPU:0 > EIP:0060:[]Not tainted VLI > EFLAGS: 00010202 (2.6.20.14-20070612 #2) > EIP is at cache_clean+0x101/0x1e6 [sunrpc] > eax: ebx: 6b6b6b6b ecx: 0001 edx: f92eb940 > esi: dfd9e800 edi: ebp: efc7e000 esp: f56bbf84 > ds: 007b es: 007b ss: 0068 > Process nfsd (pid: 2856, ti=f56ba000 task=c74daaa0 task.ti=f56ba000) > Stack: f92eb940 f70ddd50 f92a7880 f92a78aa f70ddd24 f92cae61 > f92a1b8c >f92a1fe1 0009 0009 f92c36b2 f92dd07d > efc7e000 >feff fef8 f92c3448 > > Call Trace: > [] cache_flush+0xc/0x1f [sunrpc] > [] cache_purge+0x17/0x20 [sunrpc] > [] nfsd_export_flush+0x14/0x28 [nfsd] > [] svc_destroy+0x96/0x100 [sunrpc] > [] svc_exit_thread+0x69/0x7d [sunrpc] > [] nfsd+0x26a/0x27b [nfsd] > [] nfsd+0x0/0x27b [nfsd] > [] kernel_thread_helper+0x7/0x10 25: 8b 15 60 ea 2b f9 mov0xf92bea60,%edx 0: 8b 43 04 mov0x4(%ebx),%eax <= 3: 39 42 54 cmp%eax,0x54(%edx) 6: 7e 04 jlec <_EIP+0xc> 8: 40inc%eax 9: 89 42 54 mov%eax,0x54(%edx) net/sunrpc/cache.c::cache_clean(): cp = & current_detail->hash_table[current_index]; ch = *cp; for (; ch; cp= & ch->next, ch= *cp) { => ch has been freed ==> if (current_detail->nextcheck > ch->expiry_time) current_detail->nextcheck = ch->expiry_time+1; - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: NFS Oops 2.6.20.14
On 06/18/2007 04:05 PM, Aaron Porter wrote: Reproducable, every time nfs-kernel-server exits: nfsd: unexporting all filesystems BUG: unable to handle kernel paging request at virtual address 6b6b6b6f printing eip: f92a7751 *pde = 6b6b6b6b Oops: [#1] PREEMPT SMP Modules linked in: xt_tcpudp iptable_nat nf_nat nf_conntrack_ipv4 nf_conntrack nfnetlink ip_tables x_tables nfs nfsd exportfs lockd sunrpc capability commoncap ipv6 ohci_hcd usbcore amd_rng rng_core i2c_amd8111 generic amd74xx shpchp pci_hotplug ide_scsi w83627hf eeprom lm85 hwmon_vid i2c_isa i2c_amd756 i2c_core rtc unix CPU:0 EIP:0060:[f92a7751]Not tainted VLI EFLAGS: 00010202 (2.6.20.14-20070612 #2) EIP is at cache_clean+0x101/0x1e6 [sunrpc] eax: ebx: 6b6b6b6b ecx: 0001 edx: f92eb940 esi: dfd9e800 edi: ebp: efc7e000 esp: f56bbf84 ds: 007b es: 007b ss: 0068 Process nfsd (pid: 2856, ti=f56ba000 task=c74daaa0 task.ti=f56ba000) Stack: f92eb940 f70ddd50 f92a7880 f92a78aa f70ddd24 f92cae61 f92a1b8c f92a1fe1 0009 0009 f92c36b2 f92dd07d efc7e000 feff fef8 f92c3448 Call Trace: [f92a7880] cache_flush+0xc/0x1f [sunrpc] [f92a78aa] cache_purge+0x17/0x20 [sunrpc] [f92cae61] nfsd_export_flush+0x14/0x28 [nfsd] [f92a1b8c] svc_destroy+0x96/0x100 [sunrpc] [f92a1fe1] svc_exit_thread+0x69/0x7d [sunrpc] [f92c36b2] nfsd+0x26a/0x27b [nfsd] [f92c3448] nfsd+0x0/0x27b [nfsd] [c010364b] kernel_thread_helper+0x7/0x10 25: 8b 15 60 ea 2b f9 mov0xf92bea60,%edx 0: 8b 43 04 mov0x4(%ebx),%eax = 3: 39 42 54 cmp%eax,0x54(%edx) 6: 7e 04 jlec _EIP+0xc 8: 40inc%eax 9: 89 42 54 mov%eax,0x54(%edx) net/sunrpc/cache.c::cache_clean(): cp = current_detail-hash_table[current_index]; ch = *cp; for (; ch; cp= ch-next, ch= *cp) { = ch has been freed == if (current_detail-nextcheck ch-expiry_time) current_detail-nextcheck = ch-expiry_time+1; - To unsubscribe from this list: send the line unsubscribe linux-kernel in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: NFS Oops 2.6.20.14
On Mon, 18 Jun 2007 13:05:07 -0700 Aaron Porter wrote: > > Reproducable, every time nfs-kernel-server exits: Kernel .config file, please... > nfsd: unexporting all filesystems > BUG: unable to handle kernel paging request at virtual address 6b6b6b6f > printing eip: > f92a7751 > *pde = 6b6b6b6b > Oops: [#1] > PREEMPT SMP > Modules linked in: xt_tcpudp iptable_nat nf_nat nf_conntrack_ipv4 > nf_conntrack nfnetlink ip_tables x_tables nfs nfsd exportfs lockd sunrpc > capability commoncap ipv6 ohci_hcd usbcore amd_rng rng_core i2c_amd8111 > generic amd74xx shpchp pci_hotplug ide_scsi w83627hf eeprom lm85 hwmon_vid > i2c_isa i2c_amd756 i2c_core rtc unix > CPU:0 > EIP:0060:[]Not tainted VLI > EFLAGS: 00010202 (2.6.20.14-20070612 #2) > EIP is at cache_clean+0x101/0x1e6 [sunrpc] > eax: ebx: 6b6b6b6b ecx: 0001 edx: f92eb940 > esi: dfd9e800 edi: ebp: efc7e000 esp: f56bbf84 > ds: 007b es: 007b ss: 0068 > Process nfsd (pid: 2856, ti=f56ba000 task=c74daaa0 task.ti=f56ba000) > Stack: f92eb940 f70ddd50 f92a7880 f92a78aa f70ddd24 f92cae61 > f92a1b8c >f92a1fe1 0009 0009 f92c36b2 f92dd07d > efc7e000 >feff fef8 f92c3448 > > Call Trace: > [] cache_flush+0xc/0x1f [sunrpc] > [] cache_purge+0x17/0x20 [sunrpc] > [] nfsd_export_flush+0x14/0x28 [nfsd] > [] svc_destroy+0x96/0x100 [sunrpc] > [] svc_exit_thread+0x69/0x7d [sunrpc] > [] nfsd+0x26a/0x27b [nfsd] > [] nfsd+0x0/0x27b [nfsd] > [] kernel_thread_helper+0x7/0x10 > === > Code: f9 0f 8d f9 00 00 00 8d 42 0c e8 dd 26 11 c7 a1 60 ea 2b f9 8b 50 08 a1 > 64 ea 2b f9 8d 34 82 8b 1e 85 db 74 6d 8b 15 60 ea 2b f9 <8b> 43 04 39 42 54 > 7e 04 40 89 42 54 8b 43 04 3b 05 00 f6 52 c0 > EIP: [] cache_clean+0x101/0x1e6 [sunrpc] SS:ESP 0068:f56bbf84 > <6>note: nfsd[2856] exited with preempt_count 2 > --- ~Randy *** Remember to use Documentation/SubmitChecklist when testing your code *** - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: NFS Oops 2.6.20.14
On Mon, 18 Jun 2007 13:05:07 -0700 Aaron Porter wrote: Reproducable, every time nfs-kernel-server exits: Kernel .config file, please... nfsd: unexporting all filesystems BUG: unable to handle kernel paging request at virtual address 6b6b6b6f printing eip: f92a7751 *pde = 6b6b6b6b Oops: [#1] PREEMPT SMP Modules linked in: xt_tcpudp iptable_nat nf_nat nf_conntrack_ipv4 nf_conntrack nfnetlink ip_tables x_tables nfs nfsd exportfs lockd sunrpc capability commoncap ipv6 ohci_hcd usbcore amd_rng rng_core i2c_amd8111 generic amd74xx shpchp pci_hotplug ide_scsi w83627hf eeprom lm85 hwmon_vid i2c_isa i2c_amd756 i2c_core rtc unix CPU:0 EIP:0060:[f92a7751]Not tainted VLI EFLAGS: 00010202 (2.6.20.14-20070612 #2) EIP is at cache_clean+0x101/0x1e6 [sunrpc] eax: ebx: 6b6b6b6b ecx: 0001 edx: f92eb940 esi: dfd9e800 edi: ebp: efc7e000 esp: f56bbf84 ds: 007b es: 007b ss: 0068 Process nfsd (pid: 2856, ti=f56ba000 task=c74daaa0 task.ti=f56ba000) Stack: f92eb940 f70ddd50 f92a7880 f92a78aa f70ddd24 f92cae61 f92a1b8c f92a1fe1 0009 0009 f92c36b2 f92dd07d efc7e000 feff fef8 f92c3448 Call Trace: [f92a7880] cache_flush+0xc/0x1f [sunrpc] [f92a78aa] cache_purge+0x17/0x20 [sunrpc] [f92cae61] nfsd_export_flush+0x14/0x28 [nfsd] [f92a1b8c] svc_destroy+0x96/0x100 [sunrpc] [f92a1fe1] svc_exit_thread+0x69/0x7d [sunrpc] [f92c36b2] nfsd+0x26a/0x27b [nfsd] [f92c3448] nfsd+0x0/0x27b [nfsd] [c010364b] kernel_thread_helper+0x7/0x10 === Code: f9 0f 8d f9 00 00 00 8d 42 0c e8 dd 26 11 c7 a1 60 ea 2b f9 8b 50 08 a1 64 ea 2b f9 8d 34 82 8b 1e 85 db 74 6d 8b 15 60 ea 2b f9 8b 43 04 39 42 54 7e 04 40 89 42 54 8b 43 04 3b 05 00 f6 52 c0 EIP: [f92a7751] cache_clean+0x101/0x1e6 [sunrpc] SS:ESP 0068:f56bbf84 6note: nfsd[2856] exited with preempt_count 2 --- ~Randy *** Remember to use Documentation/SubmitChecklist when testing your code *** - To unsubscribe from this list: send the line unsubscribe linux-kernel in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
NFS Oops 2.6.20.14
Reproducable, every time nfs-kernel-server exits: nfsd: unexporting all filesystems BUG: unable to handle kernel paging request at virtual address 6b6b6b6f printing eip: f92a7751 *pde = 6b6b6b6b Oops: [#1] PREEMPT SMP Modules linked in: xt_tcpudp iptable_nat nf_nat nf_conntrack_ipv4 nf_conntrack nfnetlink ip_tables x_tables nfs nfsd exportfs lockd sunrpc capability commoncap ipv6 ohci_hcd usbcore amd_rng rng_core i2c_amd8111 generic amd74xx shpchp pci_hotplug ide_scsi w83627hf eeprom lm85 hwmon_vid i2c_isa i2c_amd756 i2c_core rtc unix CPU:0 EIP:0060:[]Not tainted VLI EFLAGS: 00010202 (2.6.20.14-20070612 #2) EIP is at cache_clean+0x101/0x1e6 [sunrpc] eax: ebx: 6b6b6b6b ecx: 0001 edx: f92eb940 esi: dfd9e800 edi: ebp: efc7e000 esp: f56bbf84 ds: 007b es: 007b ss: 0068 Process nfsd (pid: 2856, ti=f56ba000 task=c74daaa0 task.ti=f56ba000) Stack: f92eb940 f70ddd50 f92a7880 f92a78aa f70ddd24 f92cae61 f92a1b8c f92a1fe1 0009 0009 f92c36b2 f92dd07d efc7e000 feff fef8 f92c3448 Call Trace: [] cache_flush+0xc/0x1f [sunrpc] [] cache_purge+0x17/0x20 [sunrpc] [] nfsd_export_flush+0x14/0x28 [nfsd] [] svc_destroy+0x96/0x100 [sunrpc] [] svc_exit_thread+0x69/0x7d [sunrpc] [] nfsd+0x26a/0x27b [nfsd] [] nfsd+0x0/0x27b [nfsd] [] kernel_thread_helper+0x7/0x10 === Code: f9 0f 8d f9 00 00 00 8d 42 0c e8 dd 26 11 c7 a1 60 ea 2b f9 8b 50 08 a1 64 ea 2b f9 8d 34 82 8b 1e 85 db 74 6d 8b 15 60 ea 2b f9 <8b> 43 04 39 42 54 7e 04 40 89 42 54 8b 43 04 3b 05 00 f6 52 c0 EIP: [] cache_clean+0x101/0x1e6 [sunrpc] SS:ESP 0068:f56bbf84 <6>note: nfsd[2856] exited with preempt_count 2 oops-config.gz Description: Binary data
NFS Oops 2.6.20.14
Reproducable, every time nfs-kernel-server exits: nfsd: unexporting all filesystems BUG: unable to handle kernel paging request at virtual address 6b6b6b6f printing eip: f92a7751 *pde = 6b6b6b6b Oops: [#1] PREEMPT SMP Modules linked in: xt_tcpudp iptable_nat nf_nat nf_conntrack_ipv4 nf_conntrack nfnetlink ip_tables x_tables nfs nfsd exportfs lockd sunrpc capability commoncap ipv6 ohci_hcd usbcore amd_rng rng_core i2c_amd8111 generic amd74xx shpchp pci_hotplug ide_scsi w83627hf eeprom lm85 hwmon_vid i2c_isa i2c_amd756 i2c_core rtc unix CPU:0 EIP:0060:[f92a7751]Not tainted VLI EFLAGS: 00010202 (2.6.20.14-20070612 #2) EIP is at cache_clean+0x101/0x1e6 [sunrpc] eax: ebx: 6b6b6b6b ecx: 0001 edx: f92eb940 esi: dfd9e800 edi: ebp: efc7e000 esp: f56bbf84 ds: 007b es: 007b ss: 0068 Process nfsd (pid: 2856, ti=f56ba000 task=c74daaa0 task.ti=f56ba000) Stack: f92eb940 f70ddd50 f92a7880 f92a78aa f70ddd24 f92cae61 f92a1b8c f92a1fe1 0009 0009 f92c36b2 f92dd07d efc7e000 feff fef8 f92c3448 Call Trace: [f92a7880] cache_flush+0xc/0x1f [sunrpc] [f92a78aa] cache_purge+0x17/0x20 [sunrpc] [f92cae61] nfsd_export_flush+0x14/0x28 [nfsd] [f92a1b8c] svc_destroy+0x96/0x100 [sunrpc] [f92a1fe1] svc_exit_thread+0x69/0x7d [sunrpc] [f92c36b2] nfsd+0x26a/0x27b [nfsd] [f92c3448] nfsd+0x0/0x27b [nfsd] [c010364b] kernel_thread_helper+0x7/0x10 === Code: f9 0f 8d f9 00 00 00 8d 42 0c e8 dd 26 11 c7 a1 60 ea 2b f9 8b 50 08 a1 64 ea 2b f9 8d 34 82 8b 1e 85 db 74 6d 8b 15 60 ea 2b f9 8b 43 04 39 42 54 7e 04 40 89 42 54 8b 43 04 3b 05 00 f6 52 c0 EIP: [f92a7751] cache_clean+0x101/0x1e6 [sunrpc] SS:ESP 0068:f56bbf84 6note: nfsd[2856] exited with preempt_count 2 oops-config.gz Description: Binary data
NFS oops with 2.6.12-rc2-mm3
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 hello, upon unmounting some NFSv3 shares, the following oops happened and i could not remount any nfs shares after this: lockd: cannot unmonitor 192.168.10.10 Unable to handle kernel NULL pointer dereference at virtual address printing eip: *pde = Oops: [#1] PREEMPT Modules linked in: pdc202xx_old snd_ens1371 snd_rawmidi snd_ac97_codec usbmouse usbkbd uhci_hcd via82cxxx via_agp agpgart usbcore autofs4 snd_pcm_oss snd_pcm snd_timer snd_page_alloc snd_mixer_oss snd soundcore ide_cd cdrom ide_disk ide_core CPU:0 EIP:0060:[<>]Not tainted VLI EFLAGS: 00010286 (2.6.12-rc2-mm3) EIP is at 0x0 eax: c1617ec0 ebx: c1617ec0 ecx: edx: c1771a50 esi: edi: c1617f34 ebp: df0fee40 esp: dec6ff2c ds: 007b es: 007b ss: 0068 Process rpciod/0 (pid: 8166, threadinfo=dec6f000 task=c1771a50) Stack: c03976db c1771a50 c1771b78 0296 c1617f3c 0293 c1617f40 c012a0ae dffc6550 dec6f000 df0fee58 df0fee48 df0fee50 dec6f000 c1617ec0 c0397810 dec6f000 0001 c0116ac0 Call Trace: [] __rpc_execute+0x14b/0x250 [] worker_thread+0x1ae/0x280 [] rpc_async_schedule+0x0/0x10 [] default_wake_function+0x0/0x10 [] __wake_up_common+0x37/0x60 [] default_wake_function+0x0/0x10 [] worker_thread+0x0/0x280 [] kthread+0x95/0xd0 [] kthread+0x0/0xd0 [] kernel_thread_helper+0x5/0x18 Code: Bad EIP value. i rarely use -mm kernels so i don't have much experience yet. some more details are here: http://nerdbynature.de/bits/prinz/2.6.12-rc2-mm3/ thank you, Christian. - -- BOFH excuse #67: descramble code needed from software company -BEGIN PGP SIGNATURE- Version: GnuPG v1.4.1 (GNU/Linux) Comment: Using GnuPG with Thunderbird - http://enigmail.mozdev.org iD8DBQFCaY4k+A7rjkF8z0wRAsXGAKCTO1hlydgUbSoeklNOPttsZ3uUJQCdH3SS bw4wvNW8wzWeAqPTzJObwZ8= =VolN -END PGP SIGNATURE- - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
NFS oops with 2.6.12-rc2-mm3
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 hello, upon unmounting some NFSv3 shares, the following oops happened and i could not remount any nfs shares after this: lockd: cannot unmonitor 192.168.10.10 Unable to handle kernel NULL pointer dereference at virtual address printing eip: *pde = Oops: [#1] PREEMPT Modules linked in: pdc202xx_old snd_ens1371 snd_rawmidi snd_ac97_codec usbmouse usbkbd uhci_hcd via82cxxx via_agp agpgart usbcore autofs4 snd_pcm_oss snd_pcm snd_timer snd_page_alloc snd_mixer_oss snd soundcore ide_cd cdrom ide_disk ide_core CPU:0 EIP:0060:[]Not tainted VLI EFLAGS: 00010286 (2.6.12-rc2-mm3) EIP is at 0x0 eax: c1617ec0 ebx: c1617ec0 ecx: edx: c1771a50 esi: edi: c1617f34 ebp: df0fee40 esp: dec6ff2c ds: 007b es: 007b ss: 0068 Process rpciod/0 (pid: 8166, threadinfo=dec6f000 task=c1771a50) Stack: c03976db c1771a50 c1771b78 0296 c1617f3c 0293 c1617f40 c012a0ae dffc6550 dec6f000 df0fee58 df0fee48 df0fee50 dec6f000 c1617ec0 c0397810 dec6f000 0001 c0116ac0 Call Trace: [c03976db] __rpc_execute+0x14b/0x250 [c012a0ae] worker_thread+0x1ae/0x280 [c0397810] rpc_async_schedule+0x0/0x10 [c0116ac0] default_wake_function+0x0/0x10 [c0116b07] __wake_up_common+0x37/0x60 [c0116ac0] default_wake_function+0x0/0x10 [c0129f00] worker_thread+0x0/0x280 [c012e3a5] kthread+0x95/0xd0 [c012e310] kthread+0x0/0xd0 [c010132d] kernel_thread_helper+0x5/0x18 Code: Bad EIP value. i rarely use -mm kernels so i don't have much experience yet. some more details are here: http://nerdbynature.de/bits/prinz/2.6.12-rc2-mm3/ thank you, Christian. - -- BOFH excuse #67: descramble code needed from software company -BEGIN PGP SIGNATURE- Version: GnuPG v1.4.1 (GNU/Linux) Comment: Using GnuPG with Thunderbird - http://enigmail.mozdev.org iD8DBQFCaY4k+A7rjkF8z0wRAsXGAKCTO1hlydgUbSoeklNOPttsZ3uUJQCdH3SS bw4wvNW8wzWeAqPTzJObwZ8= =VolN -END PGP SIGNATURE- - To unsubscribe from this list: send the line unsubscribe linux-kernel in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
[PATCH 2.4.5] Fix NFS Oops w.r.t. unhashed inodes
Hi Linus, One consequence of the removal of the 'put_inode: force_delete' made for 2.4.5 mmap() is that we now use the 'magic nfs' codepath in iput(). The result is that when we unhash inodes due to staleness in nfs_revalidate_inode(), we now end up calling clear_inode() in iput without first calling truncate_inode_pages(), and thus trigger the BUG() on line 486 in fs/inode.c. After discussion with Al, I think the minimal solution would be to add a call to truncate_inode_pages() to the magic nfs code. To do the call in nfs_revalidate_inode() itself (Al's suggestion) would be racy w.r.t. adding pages in read or write itself. A second consequence of the removal of force_delete was the fact that file or directory deletion no longer results in the inode getting thrown out of the icache upon last iput(). This gave problems due to inode number reuse on the userland nfsd. The solution is to update i_nlink when we delete or rmdir. Cheers, Trond --- linux-2.4.5-pre6/fs/inode.c.origFri May 25 14:15:38 2001 +++ linux-2.4.5-pre6/fs/inode.c Wed May 30 12:17:29 2001 @@ -1044,6 +1044,8 @@ inode->i_state|=I_FREEING; inodes_stat.nr_inodes--; spin_unlock(_lock); + if (inode->i_data.nrpages) + truncate_inode_pages(>i_data, 0); clear_inode(inode); } } --- linux-2.4.5-pre6/fs/nfs/dir.c.orig Fri May 25 14:15:38 2001 +++ linux-2.4.5-pre6/fs/nfs/dir.c Thu May 31 14:53:32 2001 @@ -753,6 +753,8 @@ nfs_zap_caches(dir); error = NFS_PROTO(dir)->rmdir(dir, >d_name); + if (!error) + dentry->d_inode->i_nlink = 0; return error; } @@ -870,6 +872,8 @@ error = NFS_PROTO(dir)->remove(dir, >d_name); if (error < 0) goto out; + if (inode) + inode->i_nlink--; out_delete: /* - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
[PATCH 2.4.5] Fix NFS Oops w.r.t. unhashed inodes
Hi Linus, One consequence of the removal of the 'put_inode: force_delete' made for 2.4.5 mmap() is that we now use the 'magic nfs' codepath in iput(). The result is that when we unhash inodes due to staleness in nfs_revalidate_inode(), we now end up calling clear_inode() in iput without first calling truncate_inode_pages(), and thus trigger the BUG() on line 486 in fs/inode.c. After discussion with Al, I think the minimal solution would be to add a call to truncate_inode_pages() to the magic nfs code. To do the call in nfs_revalidate_inode() itself (Al's suggestion) would be racy w.r.t. adding pages in read or write itself. A second consequence of the removal of force_delete was the fact that file or directory deletion no longer results in the inode getting thrown out of the icache upon last iput(). This gave problems due to inode number reuse on the userland nfsd. The solution is to update i_nlink when we delete or rmdir. Cheers, Trond --- linux-2.4.5-pre6/fs/inode.c.origFri May 25 14:15:38 2001 +++ linux-2.4.5-pre6/fs/inode.c Wed May 30 12:17:29 2001 @@ -1044,6 +1044,8 @@ inode-i_state|=I_FREEING; inodes_stat.nr_inodes--; spin_unlock(inode_lock); + if (inode-i_data.nrpages) + truncate_inode_pages(inode-i_data, 0); clear_inode(inode); } } --- linux-2.4.5-pre6/fs/nfs/dir.c.orig Fri May 25 14:15:38 2001 +++ linux-2.4.5-pre6/fs/nfs/dir.c Thu May 31 14:53:32 2001 @@ -753,6 +753,8 @@ nfs_zap_caches(dir); error = NFS_PROTO(dir)-rmdir(dir, dentry-d_name); + if (!error) + dentry-d_inode-i_nlink = 0; return error; } @@ -870,6 +872,8 @@ error = NFS_PROTO(dir)-remove(dir, dentry-d_name); if (error 0) goto out; + if (inode) + inode-i_nlink--; out_delete: /* - To unsubscribe from this list: send the line unsubscribe linux-kernel in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: [NFS] Oops in 2.4.2 - please give advice
On Mon, 12 Mar 2001 18:36:33 +0100, Jean-Eric Cuendet <[EMAIL PROTECTED]> wrote: >Warning (compare_maps): mismatch on symbol partition_name , ksyms_base says >c02086c0, System.map says c014f200. Ignoring ksyms_base entry That message is suspicious. It looks like you have md built into the kernel but some module has redefined partition_name. Since the only code that defines partition_name in 2.4.2 is drivers/md/md.c, it looks like you managed to build md.o into the kernel and as a module. Please double check your config and modules_install. - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
[NFS] Oops in 2.4.2 - please give advice
Hi, I have this Oops with the following config: - Stock 2.4.2 (no patch) - I have LVM volumes (0.9.1b5) - I have Raid0 + Raid5 volumes (v0.90) - Filesystems are ext2 - The machine is a PII 733 with SCSI and IDE disks. - The chipset is a VIA but DMA is disabled (for IDE disks). Thanks for any help -jec ksymoops 2.4.0 on i686 2.4.2-lb5-n3-6. Options used -V (default) -k /proc/ksyms (default) -l /proc/modules (default) -o /lib/modules/2.4.2-lb5-n3-6/ (default) -m /boot/System.map-2.4.2-lb5-n3-6 (specified) Warning (compare_maps): mismatch on symbol partition_name , ksyms_base says c02086c0, System.map says c014f200. Ignoring ksyms_base entry Mar 12 10:00:02 fatboy kernel: Unable to handle kernel paging request at virtual address 00b50008 Mar 12 10:00:02 fatboy kernel: c0124407 Mar 12 10:00:02 fatboy kernel: *pde = Mar 12 10:00:02 fatboy kernel: Oops: Mar 12 10:00:02 fatboy kernel: CPU:0 Mar 12 10:00:02 fatboy kernel: EIP:0010:[generic_file_readahead+415/708] Mar 12 10:00:02 fatboy kernel: EFLAGS: 00010206 Mar 12 10:00:02 fatboy kernel: eax: c7fa ebx: 00b5 ecx: 000f edx: c7fb25e0 Mar 12 10:00:02 fatboy kernel: esi: 022b edi: c2f9e8a0 ebp: c200bea4 esp: c377fef8 Mar 12 10:00:02 fatboy kernel: ds: 0018 es: 0018 ss: 0018 Mar 12 10:00:02 fatboy kernel: Process tar (pid: 20695, stackpage=c377f000) Mar 12 10:00:02 fatboy kernel: Stack: c10f36a8 0001 c2f9e8a0 0600 c119c9e8 0124 c7fb25e0 001f Mar 12 10:00:02 fatboy kernel:0111 0013 0020 00f2 022b c0124753 0001 c2f9e8a0 Mar 12 10:00:02 fatboy kernel:c200be00 c10f36a8 2800 0806d9c0 baac c10f36a8 Mar 12 10:00:02 fatboy kernel: Call Trace: [do_generic_file_read+551/1412] [generic_file_read+99/128] [file_read_actor+0/84] [sys_read+142/196] [system_call+51/56] Mar 12 10:00:02 fatboy kernel: Code: 39 6b 08 75 f4 8b 74 24 14 39 73 0c 75 eb 53 e8 c9 4e 00 00 Using defaults from ksymoops -t elf32-i386 -a i386 Code; Before first symbol <_EIP>: Code; Before first symbol 0: 39 6b 08 cmp%ebp,0x8(%ebx) Code; 0003 Before first symbol 3: 75 f4 jnefff9 <_EIP+0xfff9> fff9 fff9 4edd Before first symbol 1 warning issued. Results may not be reliable. _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ Jean-Eric Cuendet Linkvest SA Av des Baumettes 19, 1020 Renens Switzerland Tel +41 21 632 9043 Fax +41 21 632 9090 http://www.linkvest.com E-mail: [EMAIL PROTECTED] _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ ___ NFS maillist - [EMAIL PROTECTED] http://lists.sourceforge.net/lists/listinfo/nfs - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
[NFS] Oops in 2.4.2 - please give advice
Hi, I have this Oops with the following config: - Stock 2.4.2 (no patch) - I have LVM volumes (0.9.1b5) - I have Raid0 + Raid5 volumes (v0.90) - Filesystems are ext2 - The machine is a PII 733 with SCSI and IDE disks. - The chipset is a VIA but DMA is disabled (for IDE disks). Thanks for any help -jec ksymoops 2.4.0 on i686 2.4.2-lb5-n3-6. Options used -V (default) -k /proc/ksyms (default) -l /proc/modules (default) -o /lib/modules/2.4.2-lb5-n3-6/ (default) -m /boot/System.map-2.4.2-lb5-n3-6 (specified) Warning (compare_maps): mismatch on symbol partition_name , ksyms_base says c02086c0, System.map says c014f200. Ignoring ksyms_base entry Mar 12 10:00:02 fatboy kernel: Unable to handle kernel paging request at virtual address 00b50008 Mar 12 10:00:02 fatboy kernel: c0124407 Mar 12 10:00:02 fatboy kernel: *pde = Mar 12 10:00:02 fatboy kernel: Oops: Mar 12 10:00:02 fatboy kernel: CPU:0 Mar 12 10:00:02 fatboy kernel: EIP:0010:[generic_file_readahead+415/708] Mar 12 10:00:02 fatboy kernel: EFLAGS: 00010206 Mar 12 10:00:02 fatboy kernel: eax: c7fa ebx: 00b5 ecx: 000f edx: c7fb25e0 Mar 12 10:00:02 fatboy kernel: esi: 022b edi: c2f9e8a0 ebp: c200bea4 esp: c377fef8 Mar 12 10:00:02 fatboy kernel: ds: 0018 es: 0018 ss: 0018 Mar 12 10:00:02 fatboy kernel: Process tar (pid: 20695, stackpage=c377f000) Mar 12 10:00:02 fatboy kernel: Stack: c10f36a8 0001 c2f9e8a0 0600 c119c9e8 0124 c7fb25e0 001f Mar 12 10:00:02 fatboy kernel:0111 0013 0020 00f2 022b c0124753 0001 c2f9e8a0 Mar 12 10:00:02 fatboy kernel:c200be00 c10f36a8 2800 0806d9c0 baac c10f36a8 Mar 12 10:00:02 fatboy kernel: Call Trace: [do_generic_file_read+551/1412] [generic_file_read+99/128] [file_read_actor+0/84] [sys_read+142/196] [system_call+51/56] Mar 12 10:00:02 fatboy kernel: Code: 39 6b 08 75 f4 8b 74 24 14 39 73 0c 75 eb 53 e8 c9 4e 00 00 Using defaults from ksymoops -t elf32-i386 -a i386 Code; Before first symbol _EIP: Code; Before first symbol 0: 39 6b 08 cmp%ebp,0x8(%ebx) Code; 0003 Before first symbol 3: 75 f4 jnefff9 _EIP+0xfff9 fff9 END_OF_CODE+37764c3a/??? Code; 0005 Before first symbol 5: 8b 74 24 14 mov0x14(%esp,1),%esi Code; 0009 Before first symbol 9: 39 73 0c cmp%esi,0xc(%ebx) Code; 000c Before first symbol c: 75 eb jnefff9 _EIP+0xfff9 fff9 END_OF_CODE+37764c3a/??? Code; 000e Before first symbol e: 53push %ebx Code; 000f Before first symbol f: e8 c9 4e 00 00call 4edd _EIP+0x4edd 4edd Before first symbol 1 warning issued. Results may not be reliable. _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ Jean-Eric Cuendet Linkvest SA Av des Baumettes 19, 1020 Renens Switzerland Tel +41 21 632 9043 Fax +41 21 632 9090 http://www.linkvest.com E-mail: [EMAIL PROTECTED] _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ ___ NFS maillist - [EMAIL PROTECTED] http://lists.sourceforge.net/lists/listinfo/nfs - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: [NFS] Oops in 2.4.2 - please give advice
On Mon, 12 Mar 2001 18:36:33 +0100, Jean-Eric Cuendet [EMAIL PROTECTED] wrote: Warning (compare_maps): mismatch on symbol partition_name , ksyms_base says c02086c0, System.map says c014f200. Ignoring ksyms_base entry That message is suspicious. It looks like you have md built into the kernel but some module has redefined partition_name. Since the only code that defines partition_name in 2.4.2 is drivers/md/md.c, it looks like you managed to build md.o into the kernel and as a module. Please double check your config and modules_install. - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
NFS oops with 2.3.99-pre3
While copying a file to an NFS filesystem, cp got stuck and I later found an oops on the console. These are mounted: pc-sw8:(pid282) on /net type auto (intr,rw,port=1023,timeo=8,retrans=110,indirect,map=/etc/amd/amd.net) inxs4:/export/home on /amd/inxs4/root/export/home type nfs (rw) This seems to be the hung process. There is no PID 6569, and there were no messages about PID 6571. Anyway, cp is stuck in __down(). F STAT PID PPID %CPU PRI WCHAN WCHAN COMMAND 000 D 6571 1 0.0 39 down 107ab0 cp Running "uname -a" on the server called inxs4 reports: SunOS inxs4 5.6 Generic_105181-11 sun4u sparc I'm using an "amd" that comes from Debian, and it reports: Unofficial patch level 102. amd 5.2.2.2 of 1992/05/31 16:53:21 bsd44-beta #0: Wed Aug 9 14:16:02 PDT 2000 Here is everything I could get off the console. Since gpm is still working, this ought to be accurate. nfs warning: mount version newer than kernel nfs_read_super: get root fattr failed INIT: version 2.78 reloading Unable to handle kernel NULL pointer dereference at virtual address printing eip: c015cc28 *pde = Oops: 0002 CPU:0 EIP:0010:[] EFLAGS: 00010286 eax: ebx: c0a8e064 ecx: c0b0e818 edx: c1f03dd0 esi: c1b36780 edi: c015cc18 ebp: c0a8e064 esp: c1f03c9c ds: 0018 es: 0018 ss: 0018 Process cp (pid: 6569, stackpage=c1f03000) Stack: c1f03d1c c01f5be6 c0a8e064 c0b0e818 c1f03dd0 0246 c1f03cec c1f03d1c 0286 c01f86c3 c1f03d1c c1f03d14 c1f03d1c c1b36780 c1f02000 c01f5232 c1f02000 c1f03d84 c1f03d84 c1f03cec c01f84a4 c01f8a02 c1f03d1c Call Trace: [] [] [] [] [] [] [] [] [] [] [] [] [] [] [] [] [] [] [] Code: c7 00 02 00 00 00 8b 02 50 51 53 e8 fc fc ff ff 83 c4 0c 5b First column is the address from above, second is the function: c01094bc system_call c0122aca generic_file_write c012a2fc sys_write c013bb28 update_atime c0157775 nfs_commit_write c0157829 nfs_file_write c0157f79 nfs_writepage_sync c0159270 nfs_updatepage c015a8c1 nfs_instantiate c015a9a5 nfs_create c015bb1a nfs_proc_write c015cc18 nfs_xdr_writeres c015cc28 nfs_xdr_writeres c01f5232 rpc_call_sync c01f5242 rpc_call_sync c01f5be6 call_decode c01f6e78 xprt_timer c01f84a4 __rpc_wake_up c01f86c3 __rpc_execute c01f8a02 rpc_execute c01f9a3d rpc_init_task These didn't map to anything. (many are stack addresses) c0a8e064 ? c0b0e818 ? c1b36780 ? c1f02000 ? c1f03000 ? c1f03c9c ? c1f03cec ? c1f03d14 ? c1f03d1c ? c1f03d84 ? c1f03dd0 ? - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] Please read the FAQ at http://www.tux.org/lkml/
NFS oops with 2.3.99-pre3
While copying a file to an NFS filesystem, cp got stuck and I later found an oops on the console. These are mounted: pc-sw8:(pid282) on /net type auto (intr,rw,port=1023,timeo=8,retrans=110,indirect,map=/etc/amd/amd.net) inxs4:/export/home on /amd/inxs4/root/export/home type nfs (rw) This seems to be the hung process. There is no PID 6569, and there were no messages about PID 6571. Anyway, cp is stuck in __down(). F STAT PID PPID %CPU PRI WCHAN WCHAN COMMAND 000 D 6571 1 0.0 39 down 107ab0 cp Running "uname -a" on the server called inxs4 reports: SunOS inxs4 5.6 Generic_105181-11 sun4u sparc I'm using an "amd" that comes from Debian, and it reports: Unofficial patch level 102. amd 5.2.2.2 of 1992/05/31 16:53:21 bsd44-beta #0: Wed Aug 9 14:16:02 PDT 2000 Here is everything I could get off the console. Since gpm is still working, this ought to be accurate. nfs warning: mount version newer than kernel nfs_read_super: get root fattr failed INIT: version 2.78 reloading Unable to handle kernel NULL pointer dereference at virtual address printing eip: c015cc28 *pde = Oops: 0002 CPU:0 EIP:0010:[c015cc28] EFLAGS: 00010286 eax: ebx: c0a8e064 ecx: c0b0e818 edx: c1f03dd0 esi: c1b36780 edi: c015cc18 ebp: c0a8e064 esp: c1f03c9c ds: 0018 es: 0018 ss: 0018 Process cp (pid: 6569, stackpage=c1f03000) Stack: c1f03d1c c01f5be6 c0a8e064 c0b0e818 c1f03dd0 0246 c1f03cec c1f03d1c 0286 c01f86c3 c1f03d1c c1f03d14 c1f03d1c c1b36780 c1f02000 c01f5232 c1f02000 c1f03d84 c1f03d84 c1f03cec c01f84a4 c01f8a02 c1f03d1c Call Trace: [c01f5be6] [c01f86c3] [c01f5232] [c01f84a4] [c01f8a02] [c01f5242] [c01f6e78] [c01f9a3d] [c015bb1a] [c0157f79] [c015a8c1] [c015a9a5] [c0159270] [c0157775] [c0122aca] [c013bb28] [c0157829] [c012a2fc] [c01094bc] Code: c7 00 02 00 00 00 8b 02 50 51 53 e8 fc fc ff ff 83 c4 0c 5b First column is the address from above, second is the function: c01094bc system_call c0122aca generic_file_write c012a2fc sys_write c013bb28 update_atime c0157775 nfs_commit_write c0157829 nfs_file_write c0157f79 nfs_writepage_sync c0159270 nfs_updatepage c015a8c1 nfs_instantiate c015a9a5 nfs_create c015bb1a nfs_proc_write c015cc18 nfs_xdr_writeres c015cc28 nfs_xdr_writeres c01f5232 rpc_call_sync c01f5242 rpc_call_sync c01f5be6 call_decode c01f6e78 xprt_timer c01f84a4 __rpc_wake_up c01f86c3 __rpc_execute c01f8a02 rpc_execute c01f9a3d rpc_init_task These didn't map to anything. (many are stack addresses) c0a8e064 ? c0b0e818 ? c1b36780 ? c1f02000 ? c1f03000 ? c1f03c9c ? c1f03cec ? c1f03d14 ? c1f03d1c ? c1f03d84 ? c1f03dd0 ? - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] Please read the FAQ at http://www.tux.org/lkml/