uvmfault (7.99.1/amd64)
My main machine suddenly hung last night and then rebooted. There was no big load on it at that time. dmesg contains: uvm_fault(0x810157c0, 0x8003393c8000, 1) - e fatal page fault in supervisor mode trap type 6 code 0 rip 80264fc5 cs 8 rflags 10202 cr2 8003393c8000 ilevel 4 rsp fe813d81d720 curlwp 0xfe813dc10aa0 pid 0.143 lowest kstack 0xfe813d81a2c0 panic: trap cpu7: Begin traceback... vpanic() at netbsd:vpanic+0x13c snprintf() at netbsd:snprintf startlwp() at netbsd:startlwp cpu7: End traceback... dumping to dev 168,3 (offset=8, size=8373576): dump Copyright (c) 1996, 1997, 1998, 1999, 2000, 2001, 2002, 2003, 2004, 2005, (new kernel booting messages follow) I did get a core dump, and I do have a kernel with symbols. # gdb netbsd GNU gdb (GDB) 7.7.1 Copyright (C) 2014 Free Software Foundation, Inc. License GPLv3+: GNU GPL version 3 or later http://gnu.org/licenses/gpl.html This is free software: you are free to change and redistribute it. There is NO WARRANTY, to the extent permitted by law. Type show copying and show warranty for details. This GDB was configured as x86_64--netbsd. Type show configuration for configuration details. For bug reporting instructions, please see: http://www.gnu.org/software/gdb/bugs/. Find the GDB manual and other documentation resources online at: http://www.gnu.org/software/gdb/documentation/. For help, type help. Type apropos word to search for commands related to word... Reading symbols from netbsd...done. (gdb) target kvm netbsd.core 0x805b6ac5 in cpu_reboot (howto=howto@entry=260, bootstr=bootstr@entry=0x0) at /archive/foreign/src/sys/arch/amd64/amd64/machdep.c:671 671 dumpsys(); (gdb) bt #0 0x805b6ac5 in cpu_reboot (howto=howto@entry=260, bootstr=bootstr@entry=0x0) at /archive/foreign/src/sys/arch/amd64/amd64/machdep.c:671 #1 0x807b0ae4 in vpanic (fmt=fmt@entry=0x80c51a95 trap, ap=ap@entry=0xfe813d81d510) at /archive/foreign/src/sys/kern/subr_prf.c:340 #2 0x807b0b9f in panic (fmt=fmt@entry=0x80c51a95 trap) at /archive/foreign/src/sys/kern/subr_prf.c:256 #3 0x807fc037 in trap (frame=0xfe813d81d630) at /archive/foreign/src/sys/arch/amd64/amd64/trap.c:298 #4 0x8010108e in alltraps () #5 0x80264fc5 in .Mmbuf_inner_loop () #6 0xfe8692e23400 in ?? () #7 0xfe813d81d750 in ?? () #8 0x804c3b5e in in_delayed_cksum (m=0x8003393c8000) at /archive/foreign/src/sys/netinet/ip_output.c:791 Backtrace stopped: previous frame inner to this frame (corrupt stack?) This does not really look like useful information, does it? Thomas
Re: uvmfault (7.99.1/amd64)
Thomas Klausner wrote: My main machine suddenly hung last night and then rebooted. There was no big load on it at that time. dmesg contains: [snip] #8 0x804c3b5e in in_delayed_cksum (m=0x8003393c8000) at /archive/foreign/src/sys/netinet/ip_output.c:791 Backtrace stopped: previous frame inner to this frame (corrupt stack?) This does not really look like useful information, does it? Can you tell which protocol family you were using at the time ? I was regularly getting a similar crash when using NFS over IPv6, this was with a network controller that only offloads checksumming for IPv4, the in_delayed_cksum() function is where the network stack does the checksum in software. I confess that the current way that I'm trying to fix it is by switching to a network card with hardware checksumming for both IPv4 and IPv6. Robert Swindells
Re: uvmfault (7.99.1/amd64)
On 09/13/14 07:55, Thomas Klausner wrote: My main machine suddenly hung last night and then rebooted. There was no big load on it at that time. dmesg contains: uvm_fault(0x810157c0, 0x8003393c8000, 1) - e fatal page fault in supervisor mode trap type 6 code 0 rip 80264fc5 cs 8 rflags 10202 cr2 8003393c8000 ilevel 4 rsp fe813d81d720 curlwp 0xfe813dc10aa0 pid 0.143 lowest kstack 0xfe813d81a2c0 panic: trap cpu7: Begin traceback... vpanic() at netbsd:vpanic+0x13c snprintf() at netbsd:snprintf startlwp() at netbsd:startlwp cpu7: End traceback... dumping to dev 168,3 (offset=8, size=8373576): dump Copyright (c) 1996, 1997, 1998, 1999, 2000, 2001, 2002, 2003, 2004, 2005, (new kernel booting messages follow) I did get a core dump, and I do have a kernel with symbols. # gdb netbsd GNU gdb (GDB) 7.7.1 Copyright (C) 2014 Free Software Foundation, Inc. License GPLv3+: GNU GPL version 3 or later http://gnu.org/licenses/gpl.html This is free software: you are free to change and redistribute it. There is NO WARRANTY, to the extent permitted by law. Type show copying and show warranty for details. This GDB was configured as x86_64--netbsd. Type show configuration for configuration details. For bug reporting instructions, please see: http://www.gnu.org/software/gdb/bugs/. Find the GDB manual and other documentation resources online at: http://www.gnu.org/software/gdb/documentation/. For help, type help. Type apropos word to search for commands related to word... Reading symbols from netbsd...done. (gdb) target kvm netbsd.core 0x805b6ac5 in cpu_reboot (howto=howto@entry=260, bootstr=bootstr@entry=0x0) at /archive/foreign/src/sys/arch/amd64/amd64/machdep.c:671 671 dumpsys(); (gdb) bt #0 0x805b6ac5 in cpu_reboot (howto=howto@entry=260, bootstr=bootstr@entry=0x0) at /archive/foreign/src/sys/arch/amd64/amd64/machdep.c:671 #1 0x807b0ae4 in vpanic (fmt=fmt@entry=0x80c51a95 trap, ap=ap@entry=0xfe813d81d510) at /archive/foreign/src/sys/kern/subr_prf.c:340 #2 0x807b0b9f in panic (fmt=fmt@entry=0x80c51a95 trap) at /archive/foreign/src/sys/kern/subr_prf.c:256 #3 0x807fc037 in trap (frame=0xfe813d81d630) at /archive/foreign/src/sys/arch/amd64/amd64/trap.c:298 #4 0x8010108e in alltraps () #5 0x80264fc5 in .Mmbuf_inner_loop () #6 0xfe8692e23400 in ?? () #7 0xfe813d81d750 in ?? () #8 0x804c3b5e in in_delayed_cksum (m=0x8003393c8000) at /archive/foreign/src/sys/netinet/ip_output.c:791 Backtrace stopped: previous frame inner to this frame (corrupt stack?) This does not really look like useful information, does it? Thomas Try crash(8). It does a better job of stack traces through traps. NIck
Re: uvmfault (7.99.1/amd64)
On Sat, Sep 13, 2014 at 09:40:35AM +0100, Robert Swindells wrote: #8 0x804c3b5e in in_delayed_cksum (m=0x8003393c8000) at /archive/foreign/src/sys/netinet/ip_output.c:791 Backtrace stopped: previous frame inner to this frame (corrupt stack?) This does not really look like useful information, does it? Can you tell which protocol family you were using at the time ? I'm nfs-mounting via wm0: wm0: flags=8843UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST mtu 1500 capabilities=7ff80TSO4,IP4CSUM_Rx,IP4CSUM_Tx,TCP4CSUM_Rx capabilities=7ff80TCP4CSUM_Tx,UDP4CSUM_Rx,UDP4CSUM_Tx,TCP6CSUM_Rx capabilities=7ff80TCP6CSUM_Tx,UDP6CSUM_Rx,UDP6CSUM_Tx,TSO6 enabled=0 ec_capabilities=7VLAN_MTU,VLAN_HWTAGGING,JUMBO_MTU ec_enabled=0 address: ... media: Ethernet autoselect (1000baseT full-duplex,flowcontrol,rxpause,txpause) status: active inet ... inet6 ... My /etc/fstab has IPv4 addresses for the NFS mounts, like this: 192.168.1.2:/volume1/music /disk/music nfs intr,nodev,nosuid,rw,soft,tcp So it should be IPv4 only. I was regularly getting a similar crash when using NFS over IPv6, this was with a network controller that only offloads checksumming for IPv4, the in_delayed_cksum() function is where the network stack does the checksum in software. I confess that the current way that I'm trying to fix it is by switching to a network card with hardware checksumming for both IPv4 and IPv6. From the capabilities cited above, my card already should do that, right? Thomas
Re: uvmfault (7.99.1/amd64)
On Sat, Sep 13, 2014 at 07:57:20AM +0100, Nick Hudson wrote: On 09/13/14 07:55, Thomas Klausner wrote: My main machine suddenly hung last night and then rebooted. There was no big load on it at that time. dmesg contains: uvm_fault(0x810157c0, 0x8003393c8000, 1) - e fatal page fault in supervisor mode trap type 6 code 0 rip 80264fc5 cs 8 rflags 10202 cr2 8003393c8000 ilevel 4 rsp fe813d81d720 curlwp 0xfe813dc10aa0 pid 0.143 lowest kstack 0xfe813d81a2c0 panic: trap cpu7: Begin traceback... vpanic() at netbsd:vpanic+0x13c snprintf() at netbsd:snprintf startlwp() at netbsd:startlwp cpu7: End traceback... dumping to dev 168,3 (offset=8, size=8373576): dump Copyright (c) 1996, 1997, 1998, 1999, 2000, 2001, 2002, 2003, 2004, 2005, (new kernel booting messages follow) I did get a core dump, and I do have a kernel with symbols. # gdb netbsd GNU gdb (GDB) 7.7.1 Copyright (C) 2014 Free Software Foundation, Inc. License GPLv3+: GNU GPL version 3 or later http://gnu.org/licenses/gpl.html This is free software: you are free to change and redistribute it. There is NO WARRANTY, to the extent permitted by law. Type show copying and show warranty for details. This GDB was configured as x86_64--netbsd. Type show configuration for configuration details. For bug reporting instructions, please see: http://www.gnu.org/software/gdb/bugs/. Find the GDB manual and other documentation resources online at: http://www.gnu.org/software/gdb/documentation/. For help, type help. Type apropos word to search for commands related to word... Reading symbols from netbsd...done. (gdb) target kvm netbsd.core 0x805b6ac5 in cpu_reboot (howto=howto@entry=260, bootstr=bootstr@entry=0x0) at /archive/foreign/src/sys/arch/amd64/amd64/machdep.c:671 671 dumpsys(); (gdb) bt #0 0x805b6ac5 in cpu_reboot (howto=howto@entry=260, bootstr=bootstr@entry=0x0) at /archive/foreign/src/sys/arch/amd64/amd64/machdep.c:671 #1 0x807b0ae4 in vpanic (fmt=fmt@entry=0x80c51a95 trap, ap=ap@entry=0xfe813d81d510) at /archive/foreign/src/sys/kern/subr_prf.c:340 #2 0x807b0b9f in panic (fmt=fmt@entry=0x80c51a95 trap) at /archive/foreign/src/sys/kern/subr_prf.c:256 #3 0x807fc037 in trap (frame=0xfe813d81d630) at /archive/foreign/src/sys/arch/amd64/amd64/trap.c:298 #4 0x8010108e in alltraps () #5 0x80264fc5 in .Mmbuf_inner_loop () #6 0xfe8692e23400 in ?? () #7 0xfe813d81d750 in ?? () #8 0x804c3b5e in in_delayed_cksum (m=0x8003393c8000) at /archive/foreign/src/sys/netinet/ip_output.c:791 Backtrace stopped: previous frame inner to this frame (corrupt stack?) This does not really look like useful information, does it? Thomas Try crash(8). It does a better job of stack traces through traps. # crash -M netbsd.core -N netbsd Crash version 7.99.1, image version 7.99.1. System panicked: trap Backtrace from time of crash is available. crash bt _KERNEL_OPT_NVGA_RASTERCONSOLE() at 0 _KERNEL_OPT_IPFILTER_COMPAT() at _KERNEL_OPT_IPFILTER_COMPAT+0x3 vpanic() at vpanic+0x145 snprintf() at snprintf startlwp() at startlwp crash That looks weird. Thomas
Re: Some drmkms success on my laptop
On Sat 13 Sep 2014 at 02:35:11 +0200, Rhialto wrote: pms0 at pckbc1 (aux slot) pms0: Synaptics touchpad version 6.5 pms0: Palm detect, One button click pad pckbc1: using irq 12 for aux slot wsmouse0 at pms0 mux 0 One button click pad - I guess that explains why X only recognizes a single button. The kernel mis-recognizes it. This is a regression, since in the 6.x kernels it still works fine. It is also nothing to do with DRMKMS, it also happens in the GENERIC kernel. X uses the VESA driver there (which is sub-optimal since it uses a resolution of 1024 x 768 instead of 1600 x 900). The assertion failure I mentioned is in this fragment of code, for reference, line 2833 of src/sys/external/bsd/drm2/dist/drm/i915/i915_gem.c: 2826 static void 2827 i915_gem_object_move_to_inactive(struct drm_i915_gem_object *obj) 2828 { 2829 struct drm_i915_private *dev_priv = obj-base.dev-dev_private; 2830 struct i915_address_space *vm; 2831 struct i915_vma *vma; 2832 2833 -- BUG_ON(obj-base.write_domain ~I915_GEM_GPU_DOMAINS); 2834 BUG_ON(!obj-active); 2835 2836 list_for_each_entry(vm, dev_priv-vm_list, global_link) { 2837 vma = i915_gem_obj_to_vma(obj, vm); 2838 if (vma !list_empty(vma-mm_list)) 2839 list_move_tail(vma-mm_list, vm-inactive_list); 2840 } -Olaf. -- ___ Olaf 'Rhialto' Seibert -- The Doctor: No, 'eureka' is Greek for \X/ rhialto/at/xs4all.nl-- 'this bath is too hot.' pgpp_W7q6lZf0.pgp Description: PGP signature
Re: Wedges sysinst (Was: Wedges enabled on -current)
On Sat, Sep 13, 2014 at 08:25:24AM +0700, Robert Elz wrote: I cannot get any mention of gpt partitioning to appear anywhere - using an amd64 xen domU install kernel 7.99.1 (from a day or two ago). It checks wether some binaries are available in $PATH and hides menus (for gpt, lvm, cgd, ...) silently if they are not found - I haven't tried that install kernel image yet, and also there may be bugs in the hiding code. Need to have a closer look... Martin
Re: uvmfault (7.99.1/amd64)
Thomas Klausner wrote: On Sat, Sep 13, 2014 at 09:40:35AM +0100, Robert Swindells wrote: #8 0x804c3b5e in in_delayed_cksum (m=0x8003393c8000) at /archive/foreign/src/sys/netinet/ip_output.c:791 Backtrace stopped: previous frame inner to this frame (corrupt stack?) This does not really look like useful information, does it? Can you tell which protocol family you were using at the time ? I'm nfs-mounting via wm0: wm0: flags=8843UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST mtu 1500 capabilities=7ff80TSO4,IP4CSUM_Rx,IP4CSUM_Tx,TCP4CSUM_Rx capabilities=7ff80TCP4CSUM_Tx,UDP4CSUM_Rx,UDP4CSUM_Tx,TCP6CSUM_Rx capabilities=7ff80TCP6CSUM_Tx,UDP6CSUM_Rx,UDP6CSUM_Tx,TSO6 enabled=0 ec_capabilities=7VLAN_MTU,VLAN_HWTAGGING,JUMBO_MTU ec_enabled=0 address: ... media: Ethernet autoselect (1000baseT full-duplex,flowcontrol,rxpause,txpause) status: active inet ... inet6 ... I just added a wm card to my main system and it seems solid with all the offload features turned on, even TSO. Obviously it doesn't help with finding any problem in the kernel. My /etc/fstab has IPv4 addresses for the NFS mounts, like this: 192.168.1.2:/volume1/music /disk/music nfs intr,nodev,nosuid,rw,soft,tcp So it should be IPv4 only. And TCP, I was using UDP over IPv6. A common factor is writing to NFS though. I was regularly getting a similar crash when using NFS over IPv6, this was with a network controller that only offloads checksumming for IPv4, the in_delayed_cksum() function is where the network stack does the checksum in software. I confess that the current way that I'm trying to fix it is by switching to a network card with hardware checksumming for both IPv4 and IPv6. From the capabilities cited above, my card already should do that, right? No, the enabled=0 means they are all turned off. To turn on the checksumming you can run: # ifconfig wm0 ip4csum udp4csum tcp4csum udp6csum tcp6csum Or put the options in you /etc/ifconfig.wm0 file. Don't do this if you are using bridge(4) on this machine. Robert Swindells
daily CVS update output
Updating src tree: P src/distrib/amd64/cdroms/Makefile.cdrom P src/distrib/amd64/installimage/Makefile P src/distrib/amd64/installimage/spec.inst P src/distrib/i386/cdroms/Makefile.cdrom P src/distrib/i386/installimage/Makefile P src/distrib/i386/installimage/spec.inst P src/distrib/sparc64/cdroms/installcd/Makefile P src/distrib/vax/cdroms/installcd/Makefile P src/etc/defaults/rc.conf P src/sys/arch/acorn26/acorn26/db_interface.c P src/sys/arch/acorn26/include/vmparam.h P src/sys/arch/acorn32/acorn32/rpc_machdep.c P src/sys/arch/acorn32/eb7500atx/eb7500atx_machdep.c P src/sys/arch/acorn32/podulebus/esc.c P src/sys/arch/acorn32/podulebus/podulebus.c P src/sys/arch/acorn32/podulebus/ptsc.c P src/sys/arch/acorn32/podulebus/sfas.c U src/sys/arch/arm/allwinner/awin_otg.c U src/sys/arch/arm/allwinner/awin_otgreg.h P src/sys/arch/arm/allwinner/awin_reg.h P src/sys/arch/arm/allwinner/awin_usb.c P src/sys/arch/arm/allwinner/files.awin P src/sys/arch/arm/at91/at91bus.c P src/sys/arch/arm/broadcom/bcm2835_emmc.c P src/sys/arch/arm/broadcom/files.bcm2835 P src/sys/arch/arm/footbridge/isa/isa_io.c P src/sys/arch/arm/include/cpufunc_proto.h P src/sys/arch/arm/include/db_machdep.h P src/sys/arch/arm/include/ofw.h P src/sys/arch/arm/include/types.h P src/sys/arch/arm/iomd/vidcaudio.c P src/sys/arch/epoc32/epoc32/machdep.c P src/sys/arch/evbarm/adi_brh/brh_machdep.c P src/sys/arch/evbarm/armadillo/armadillo9_machdep.c P src/sys/arch/evbarm/conf/CUBIEBOARD P src/sys/arch/evbarm/conf/RPI P src/sys/arch/evbarm/conf/std.awin P src/sys/arch/evbarm/g42xxeb/g42xxeb_machdep.c P src/sys/arch/evbarm/hdl_g/hdlg_machdep.c P src/sys/arch/evbarm/imx31/imx31lk_machdep.c P src/sys/arch/evbarm/iq80310/iq80310_machdep.c P src/sys/arch/evbarm/iq80321/iq80321_machdep.c P src/sys/arch/evbarm/ixdp425/ixdp425_machdep.c P src/sys/arch/evbarm/ixm1200/ixm1200_machdep.c P src/sys/arch/evbarm/lubbock/lubbock_machdep.c P src/sys/arch/evbarm/mini2440/mini2440_machdep.c P src/sys/arch/evbarm/npwr_fc/npwr_fc_machdep.c P src/sys/arch/evbarm/nslu2/nslu2_machdep.c P src/sys/arch/evbarm/smdk2xx0/smdk2410_machdep.c P src/sys/arch/evbarm/smdk2xx0/smdk2800_machdep.c P src/sys/arch/evbarm/tsarm/tsarm_machdep.c P src/sys/arch/evbarm/tsarm/isa/isa_io.c P src/sys/arch/evbarm/viper/viper_machdep.c P src/sys/arch/hpcarm/hpcarm/hpc_machdep.c P src/sys/arch/hpcarm/hpcarm/pxa2x0_hpc_machdep.c P src/sys/arch/hpcarm/hpcarm/sa11x0_hpc_machdep.c P src/sys/arch/iyonix/iyonix/iyonix_machdep.c P src/sys/arch/netwinder/netwinder/netwinder_machdep.c P src/sys/conf/files P src/sys/dev/usb/ehci.c P src/sys/dev/usb/motg.c P src/sys/dev/usb/motgreg.h P src/sys/dev/usb/motgvar.h P src/sys/net/bpf.c P src/usr.bin/make/var.c P src/usr.sbin/sysinst/net.c Updating xsrc tree: Killing core files: Running the SUP scanner: SUP Scan for current starting at Sun Sep 14 03:06:00 2014 SUP Scan for current completed at Sun Sep 14 03:06:50 2014 SUP Scan for mirror starting at Sun Sep 14 03:06:50 2014 SUP Scan for mirror completed at Sun Sep 14 03:09:09 2014 Updating file list: -rw-rw-r-- 1 srcmastr netbsd 43030493 Sep 14 03:17 ls-lRA.gz