uvmfault (7.99.1/amd64)

2014-09-13 Thread Thomas Klausner
My main machine suddenly hung last night and then rebooted. There was
no big load on it at that time. dmesg contains:

uvm_fault(0x810157c0, 0x8003393c8000, 1) - e
fatal page fault in supervisor mode
trap type 6 code 0 rip 80264fc5 cs 8 rflags 10202 cr2 8003393c8000 
ilevel 4 rsp fe813d81d720
curlwp 0xfe813dc10aa0 pid 0.143 lowest kstack 0xfe813d81a2c0
panic: trap
cpu7: Begin traceback...
vpanic() at netbsd:vpanic+0x13c
snprintf() at netbsd:snprintf
startlwp() at netbsd:startlwp
cpu7: End traceback...

dumping to dev 168,3 (offset=8, size=8373576):
dump Copyright (c) 1996, 1997, 1998, 1999, 2000, 2001, 2002, 2003, 2004, 2005,
(new kernel booting messages follow)

I did get a core dump, and I do have a kernel with symbols.
# gdb netbsd
GNU gdb (GDB) 7.7.1
Copyright (C) 2014 Free Software Foundation, Inc.
License GPLv3+: GNU GPL version 3 or later http://gnu.org/licenses/gpl.html
This is free software: you are free to change and redistribute it.
There is NO WARRANTY, to the extent permitted by law.  Type show copying
and show warranty for details.
This GDB was configured as x86_64--netbsd.
Type show configuration for configuration details.
For bug reporting instructions, please see:
http://www.gnu.org/software/gdb/bugs/.
Find the GDB manual and other documentation resources online at:
http://www.gnu.org/software/gdb/documentation/.
For help, type help.
Type apropos word to search for commands related to word...
Reading symbols from netbsd...done.
(gdb) target  kvm netbsd.core
0x805b6ac5 in cpu_reboot (howto=howto@entry=260, 
bootstr=bootstr@entry=0x0) at 
/archive/foreign/src/sys/arch/amd64/amd64/machdep.c:671
671 dumpsys();
(gdb) bt
#0  0x805b6ac5 in cpu_reboot (howto=howto@entry=260, 
bootstr=bootstr@entry=0x0) at 
/archive/foreign/src/sys/arch/amd64/amd64/machdep.c:671
#1  0x807b0ae4 in vpanic (fmt=fmt@entry=0x80c51a95 trap, 
ap=ap@entry=0xfe813d81d510) at /archive/foreign/src/sys/kern/subr_prf.c:340
#2  0x807b0b9f in panic (fmt=fmt@entry=0x80c51a95 trap) at 
/archive/foreign/src/sys/kern/subr_prf.c:256
#3  0x807fc037 in trap (frame=0xfe813d81d630) at 
/archive/foreign/src/sys/arch/amd64/amd64/trap.c:298
#4  0x8010108e in alltraps ()
#5  0x80264fc5 in .Mmbuf_inner_loop ()
#6  0xfe8692e23400 in ?? ()
#7  0xfe813d81d750 in ?? ()
#8  0x804c3b5e in in_delayed_cksum (m=0x8003393c8000) at 
/archive/foreign/src/sys/netinet/ip_output.c:791
Backtrace stopped: previous frame inner to this frame (corrupt stack?)

This does not really look like useful information, does it?
 Thomas


Re: uvmfault (7.99.1/amd64)

2014-09-13 Thread Robert Swindells

Thomas Klausner wrote:
My main machine suddenly hung last night and then rebooted. There was
no big load on it at that time. dmesg contains:

[snip]

#8  0x804c3b5e in in_delayed_cksum (m=0x8003393c8000) at 
/archive/foreign/src/sys/netinet/ip_output.c:791
Backtrace stopped: previous frame inner to this frame (corrupt stack?)

This does not really look like useful information, does it?

Can you tell which protocol family you were using at the time ?

I was regularly getting a similar crash when using NFS over IPv6, this
was with a network controller that only offloads checksumming for IPv4,
the in_delayed_cksum() function is where the network stack does the
checksum in software.

I confess that the current way that I'm trying to fix it is by
switching to a network card with hardware checksumming for both IPv4
and IPv6.

Robert Swindells


Re: uvmfault (7.99.1/amd64)

2014-09-13 Thread Nick Hudson

On 09/13/14 07:55, Thomas Klausner wrote:

My main machine suddenly hung last night and then rebooted. There was
no big load on it at that time. dmesg contains:

uvm_fault(0x810157c0, 0x8003393c8000, 1) - e
fatal page fault in supervisor mode
trap type 6 code 0 rip 80264fc5 cs 8 rflags 10202 cr2 8003393c8000 
ilevel 4 rsp fe813d81d720
curlwp 0xfe813dc10aa0 pid 0.143 lowest kstack 0xfe813d81a2c0
panic: trap
cpu7: Begin traceback...
vpanic() at netbsd:vpanic+0x13c
snprintf() at netbsd:snprintf
startlwp() at netbsd:startlwp
cpu7: End traceback...

dumping to dev 168,3 (offset=8, size=8373576):
dump Copyright (c) 1996, 1997, 1998, 1999, 2000, 2001, 2002, 2003, 2004, 2005,
(new kernel booting messages follow)

I did get a core dump, and I do have a kernel with symbols.
# gdb netbsd
GNU gdb (GDB) 7.7.1
Copyright (C) 2014 Free Software Foundation, Inc.
License GPLv3+: GNU GPL version 3 or later http://gnu.org/licenses/gpl.html
This is free software: you are free to change and redistribute it.
There is NO WARRANTY, to the extent permitted by law.  Type show copying
and show warranty for details.
This GDB was configured as x86_64--netbsd.
Type show configuration for configuration details.
For bug reporting instructions, please see:
http://www.gnu.org/software/gdb/bugs/.
Find the GDB manual and other documentation resources online at:
http://www.gnu.org/software/gdb/documentation/.
For help, type help.
Type apropos word to search for commands related to word...
Reading symbols from netbsd...done.
(gdb) target  kvm netbsd.core
0x805b6ac5 in cpu_reboot (howto=howto@entry=260, 
bootstr=bootstr@entry=0x0) at 
/archive/foreign/src/sys/arch/amd64/amd64/machdep.c:671
671 dumpsys();
(gdb) bt
#0  0x805b6ac5 in cpu_reboot (howto=howto@entry=260, 
bootstr=bootstr@entry=0x0) at 
/archive/foreign/src/sys/arch/amd64/amd64/machdep.c:671
#1  0x807b0ae4 in vpanic (fmt=fmt@entry=0x80c51a95 trap, 
ap=ap@entry=0xfe813d81d510) at /archive/foreign/src/sys/kern/subr_prf.c:340
#2  0x807b0b9f in panic (fmt=fmt@entry=0x80c51a95 trap) at 
/archive/foreign/src/sys/kern/subr_prf.c:256
#3  0x807fc037 in trap (frame=0xfe813d81d630) at 
/archive/foreign/src/sys/arch/amd64/amd64/trap.c:298
#4  0x8010108e in alltraps ()
#5  0x80264fc5 in .Mmbuf_inner_loop ()
#6  0xfe8692e23400 in ?? ()
#7  0xfe813d81d750 in ?? ()
#8  0x804c3b5e in in_delayed_cksum (m=0x8003393c8000) at 
/archive/foreign/src/sys/netinet/ip_output.c:791
Backtrace stopped: previous frame inner to this frame (corrupt stack?)

This does not really look like useful information, does it?
  Thomas



Try crash(8). It does a better job of stack traces through traps.

NIck


Re: uvmfault (7.99.1/amd64)

2014-09-13 Thread Thomas Klausner
On Sat, Sep 13, 2014 at 09:40:35AM +0100, Robert Swindells wrote:
 #8  0x804c3b5e in in_delayed_cksum (m=0x8003393c8000) at 
 /archive/foreign/src/sys/netinet/ip_output.c:791
 Backtrace stopped: previous frame inner to this frame (corrupt stack?)
 
 This does not really look like useful information, does it?
 
 Can you tell which protocol family you were using at the time ?

I'm nfs-mounting via wm0:
wm0: flags=8843UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST mtu 1500
capabilities=7ff80TSO4,IP4CSUM_Rx,IP4CSUM_Tx,TCP4CSUM_Rx
capabilities=7ff80TCP4CSUM_Tx,UDP4CSUM_Rx,UDP4CSUM_Tx,TCP6CSUM_Rx
capabilities=7ff80TCP6CSUM_Tx,UDP6CSUM_Rx,UDP6CSUM_Tx,TSO6
enabled=0
ec_capabilities=7VLAN_MTU,VLAN_HWTAGGING,JUMBO_MTU
ec_enabled=0
address: ...
media: Ethernet autoselect (1000baseT 
full-duplex,flowcontrol,rxpause,txpause)
status: active
inet ...
inet6 ...

My /etc/fstab has IPv4 addresses for the NFS mounts, like this:

192.168.1.2:/volume1/music  /disk/music nfs 
intr,nodev,nosuid,rw,soft,tcp

So it should be IPv4 only.

 I was regularly getting a similar crash when using NFS over IPv6, this
 was with a network controller that only offloads checksumming for IPv4,
 the in_delayed_cksum() function is where the network stack does the
 checksum in software.
 
 I confess that the current way that I'm trying to fix it is by
 switching to a network card with hardware checksumming for both IPv4
 and IPv6.

From the capabilities cited above, my card already should do that, right?
 Thomas


Re: uvmfault (7.99.1/amd64)

2014-09-13 Thread Thomas Klausner
On Sat, Sep 13, 2014 at 07:57:20AM +0100, Nick Hudson wrote:
 On 09/13/14 07:55, Thomas Klausner wrote:
 My main machine suddenly hung last night and then rebooted. There was
 no big load on it at that time. dmesg contains:
 
 uvm_fault(0x810157c0, 0x8003393c8000, 1) - e
 fatal page fault in supervisor mode
 trap type 6 code 0 rip 80264fc5 cs 8 rflags 10202 cr2 
 8003393c8000 ilevel 4 rsp fe813d81d720
 curlwp 0xfe813dc10aa0 pid 0.143 lowest kstack 0xfe813d81a2c0
 panic: trap
 cpu7: Begin traceback...
 vpanic() at netbsd:vpanic+0x13c
 snprintf() at netbsd:snprintf
 startlwp() at netbsd:startlwp
 cpu7: End traceback...
 
 dumping to dev 168,3 (offset=8, size=8373576):
 dump Copyright (c) 1996, 1997, 1998, 1999, 2000, 2001, 2002, 2003, 2004, 
 2005,
 (new kernel booting messages follow)
 
 I did get a core dump, and I do have a kernel with symbols.
 # gdb netbsd
 GNU gdb (GDB) 7.7.1
 Copyright (C) 2014 Free Software Foundation, Inc.
 License GPLv3+: GNU GPL version 3 or later http://gnu.org/licenses/gpl.html
 This is free software: you are free to change and redistribute it.
 There is NO WARRANTY, to the extent permitted by law.  Type show copying
 and show warranty for details.
 This GDB was configured as x86_64--netbsd.
 Type show configuration for configuration details.
 For bug reporting instructions, please see:
 http://www.gnu.org/software/gdb/bugs/.
 Find the GDB manual and other documentation resources online at:
 http://www.gnu.org/software/gdb/documentation/.
 For help, type help.
 Type apropos word to search for commands related to word...
 Reading symbols from netbsd...done.
 (gdb) target  kvm netbsd.core
 0x805b6ac5 in cpu_reboot (howto=howto@entry=260, 
 bootstr=bootstr@entry=0x0) at 
 /archive/foreign/src/sys/arch/amd64/amd64/machdep.c:671
 671 dumpsys();
 (gdb) bt
 #0  0x805b6ac5 in cpu_reboot (howto=howto@entry=260, 
 bootstr=bootstr@entry=0x0) at 
 /archive/foreign/src/sys/arch/amd64/amd64/machdep.c:671
 #1  0x807b0ae4 in vpanic (fmt=fmt@entry=0x80c51a95 trap, 
 ap=ap@entry=0xfe813d81d510) at 
 /archive/foreign/src/sys/kern/subr_prf.c:340
 #2  0x807b0b9f in panic (fmt=fmt@entry=0x80c51a95 trap) at 
 /archive/foreign/src/sys/kern/subr_prf.c:256
 #3  0x807fc037 in trap (frame=0xfe813d81d630) at 
 /archive/foreign/src/sys/arch/amd64/amd64/trap.c:298
 #4  0x8010108e in alltraps ()
 #5  0x80264fc5 in .Mmbuf_inner_loop ()
 #6  0xfe8692e23400 in ?? ()
 #7  0xfe813d81d750 in ?? ()
 #8  0x804c3b5e in in_delayed_cksum (m=0x8003393c8000) at 
 /archive/foreign/src/sys/netinet/ip_output.c:791
 Backtrace stopped: previous frame inner to this frame (corrupt stack?)
 
 This does not really look like useful information, does it?
   Thomas
 
 
 Try crash(8). It does a better job of stack traces through traps.

# crash -M netbsd.core -N netbsd 
Crash version 7.99.1, image version 7.99.1.
System panicked: trap
Backtrace from time of crash is available.
crash bt
_KERNEL_OPT_NVGA_RASTERCONSOLE() at 0
_KERNEL_OPT_IPFILTER_COMPAT() at _KERNEL_OPT_IPFILTER_COMPAT+0x3
vpanic() at vpanic+0x145
snprintf() at snprintf
startlwp() at startlwp
crash 

That looks weird.
 Thomas


Re: Some drmkms success on my laptop

2014-09-13 Thread Rhialto
On Sat 13 Sep 2014 at 02:35:11 +0200, Rhialto wrote:
 pms0 at pckbc1 (aux slot)
 pms0: Synaptics touchpad version 6.5
 pms0: Palm detect, One button click pad
 pckbc1: using irq 12 for aux slot
 wsmouse0 at pms0 mux 0

One button click pad - I guess that explains why X only recognizes a
single button. The kernel mis-recognizes it. This is a regression, since
in the 6.x kernels it still works fine.

It is also nothing to do with DRMKMS, it also happens in the GENERIC
kernel. X uses the VESA driver there (which is sub-optimal since it uses
a resolution of 1024 x 768 instead of 1600 x 900).

The assertion failure I mentioned is in this fragment of code, for
reference, line 2833 of src/sys/external/bsd/drm2/dist/drm/i915/i915_gem.c:

2826 static void
2827 i915_gem_object_move_to_inactive(struct drm_i915_gem_object *obj)
2828 {
2829 struct drm_i915_private *dev_priv = obj-base.dev-dev_private;
2830 struct i915_address_space *vm;
2831 struct i915_vma *vma;
2832 
2833 -- BUG_ON(obj-base.write_domain  ~I915_GEM_GPU_DOMAINS);
2834 BUG_ON(!obj-active);
2835 
2836 list_for_each_entry(vm, dev_priv-vm_list, global_link) {
2837 vma = i915_gem_obj_to_vma(obj, vm);
2838 if (vma  !list_empty(vma-mm_list))
2839 list_move_tail(vma-mm_list, vm-inactive_list);
2840 }

-Olaf.
-- 
___ Olaf 'Rhialto' Seibert  -- The Doctor: No, 'eureka' is Greek for
\X/ rhialto/at/xs4all.nl-- 'this bath is too hot.'


pgpp_W7q6lZf0.pgp
Description: PGP signature


Re: Wedges sysinst (Was: Wedges enabled on -current)

2014-09-13 Thread Martin Husemann
On Sat, Sep 13, 2014 at 08:25:24AM +0700, Robert Elz wrote:
 I cannot get any mention of gpt partitioning to appear anywhere - using
 an amd64 xen domU install kernel 7.99.1 (from a day or two ago).

It checks wether some binaries are available in $PATH and hides menus
(for gpt, lvm, cgd, ...) silently if they are not found - I haven't
tried that install kernel image yet, and also there may be bugs in the
hiding code.

Need to have a closer look...

Martin


Re: uvmfault (7.99.1/amd64)

2014-09-13 Thread Robert Swindells

Thomas Klausner wrote:
On Sat, Sep 13, 2014 at 09:40:35AM +0100, Robert Swindells wrote:
 #8  0x804c3b5e in in_delayed_cksum (m=0x8003393c8000) at 
 /archive/foreign/src/sys/netinet/ip_output.c:791
 Backtrace stopped: previous frame inner to this frame (corrupt stack?)
 
 This does not really look like useful information, does it?
 
 Can you tell which protocol family you were using at the time ?

I'm nfs-mounting via wm0:
wm0: flags=8843UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST mtu 1500
capabilities=7ff80TSO4,IP4CSUM_Rx,IP4CSUM_Tx,TCP4CSUM_Rx
capabilities=7ff80TCP4CSUM_Tx,UDP4CSUM_Rx,UDP4CSUM_Tx,TCP6CSUM_Rx
capabilities=7ff80TCP6CSUM_Tx,UDP6CSUM_Rx,UDP6CSUM_Tx,TSO6
enabled=0
ec_capabilities=7VLAN_MTU,VLAN_HWTAGGING,JUMBO_MTU
ec_enabled=0
address: ...
media: Ethernet autoselect (1000baseT 
 full-duplex,flowcontrol,rxpause,txpause)
status: active
inet ...
inet6 ...

I just added a wm card to my main system and it seems solid with all the
offload features turned on, even TSO.

Obviously it doesn't help with finding any problem in the kernel.

My /etc/fstab has IPv4 addresses for the NFS mounts, like this:

192.168.1.2:/volume1/music  /disk/music nfs 
intr,nodev,nosuid,rw,soft,tcp

So it should be IPv4 only.

And TCP, I was using UDP over IPv6.

A common factor is writing to NFS though.

 I was regularly getting a similar crash when using NFS over IPv6, this
 was with a network controller that only offloads checksumming for IPv4,
 the in_delayed_cksum() function is where the network stack does the
 checksum in software.
 
 I confess that the current way that I'm trying to fix it is by
 switching to a network card with hardware checksumming for both IPv4
 and IPv6.

From the capabilities cited above, my card already should do that, right?

No, the enabled=0 means they are all turned off.

To turn on the checksumming you can run:

# ifconfig wm0 ip4csum udp4csum tcp4csum udp6csum tcp6csum

Or put the options in you /etc/ifconfig.wm0 file.

Don't do this if you are using bridge(4) on this machine.

Robert Swindells


daily CVS update output

2014-09-13 Thread NetBSD source update

Updating src tree:
P src/distrib/amd64/cdroms/Makefile.cdrom
P src/distrib/amd64/installimage/Makefile
P src/distrib/amd64/installimage/spec.inst
P src/distrib/i386/cdroms/Makefile.cdrom
P src/distrib/i386/installimage/Makefile
P src/distrib/i386/installimage/spec.inst
P src/distrib/sparc64/cdroms/installcd/Makefile
P src/distrib/vax/cdroms/installcd/Makefile
P src/etc/defaults/rc.conf
P src/sys/arch/acorn26/acorn26/db_interface.c
P src/sys/arch/acorn26/include/vmparam.h
P src/sys/arch/acorn32/acorn32/rpc_machdep.c
P src/sys/arch/acorn32/eb7500atx/eb7500atx_machdep.c
P src/sys/arch/acorn32/podulebus/esc.c
P src/sys/arch/acorn32/podulebus/podulebus.c
P src/sys/arch/acorn32/podulebus/ptsc.c
P src/sys/arch/acorn32/podulebus/sfas.c
U src/sys/arch/arm/allwinner/awin_otg.c
U src/sys/arch/arm/allwinner/awin_otgreg.h
P src/sys/arch/arm/allwinner/awin_reg.h
P src/sys/arch/arm/allwinner/awin_usb.c
P src/sys/arch/arm/allwinner/files.awin
P src/sys/arch/arm/at91/at91bus.c
P src/sys/arch/arm/broadcom/bcm2835_emmc.c
P src/sys/arch/arm/broadcom/files.bcm2835
P src/sys/arch/arm/footbridge/isa/isa_io.c
P src/sys/arch/arm/include/cpufunc_proto.h
P src/sys/arch/arm/include/db_machdep.h
P src/sys/arch/arm/include/ofw.h
P src/sys/arch/arm/include/types.h
P src/sys/arch/arm/iomd/vidcaudio.c
P src/sys/arch/epoc32/epoc32/machdep.c
P src/sys/arch/evbarm/adi_brh/brh_machdep.c
P src/sys/arch/evbarm/armadillo/armadillo9_machdep.c
P src/sys/arch/evbarm/conf/CUBIEBOARD
P src/sys/arch/evbarm/conf/RPI
P src/sys/arch/evbarm/conf/std.awin
P src/sys/arch/evbarm/g42xxeb/g42xxeb_machdep.c
P src/sys/arch/evbarm/hdl_g/hdlg_machdep.c
P src/sys/arch/evbarm/imx31/imx31lk_machdep.c
P src/sys/arch/evbarm/iq80310/iq80310_machdep.c
P src/sys/arch/evbarm/iq80321/iq80321_machdep.c
P src/sys/arch/evbarm/ixdp425/ixdp425_machdep.c
P src/sys/arch/evbarm/ixm1200/ixm1200_machdep.c
P src/sys/arch/evbarm/lubbock/lubbock_machdep.c
P src/sys/arch/evbarm/mini2440/mini2440_machdep.c
P src/sys/arch/evbarm/npwr_fc/npwr_fc_machdep.c
P src/sys/arch/evbarm/nslu2/nslu2_machdep.c
P src/sys/arch/evbarm/smdk2xx0/smdk2410_machdep.c
P src/sys/arch/evbarm/smdk2xx0/smdk2800_machdep.c
P src/sys/arch/evbarm/tsarm/tsarm_machdep.c
P src/sys/arch/evbarm/tsarm/isa/isa_io.c
P src/sys/arch/evbarm/viper/viper_machdep.c
P src/sys/arch/hpcarm/hpcarm/hpc_machdep.c
P src/sys/arch/hpcarm/hpcarm/pxa2x0_hpc_machdep.c
P src/sys/arch/hpcarm/hpcarm/sa11x0_hpc_machdep.c
P src/sys/arch/iyonix/iyonix/iyonix_machdep.c
P src/sys/arch/netwinder/netwinder/netwinder_machdep.c
P src/sys/conf/files
P src/sys/dev/usb/ehci.c
P src/sys/dev/usb/motg.c
P src/sys/dev/usb/motgreg.h
P src/sys/dev/usb/motgvar.h
P src/sys/net/bpf.c
P src/usr.bin/make/var.c
P src/usr.sbin/sysinst/net.c

Updating xsrc tree:


Killing core files:

Running the SUP scanner:
SUP Scan for current starting at Sun Sep 14 03:06:00 2014
SUP Scan for current completed at Sun Sep 14 03:06:50 2014
SUP Scan for mirror starting at Sun Sep 14 03:06:50 2014
SUP Scan for mirror completed at Sun Sep 14 03:09:09 2014




Updating file list:
-rw-rw-r--  1 srcmastr  netbsd  43030493 Sep 14 03:17 ls-lRA.gz