daily CVS update output

2020-01-07 Thread NetBSD source update


Updating src tree:
P src/external/mit/xorg/bin/xtrap/Makefile.xtrap
P src/external/mit/xorg/lib/libXaw/Makefile
P src/external/mit/xorg/lib/libXaw6/Makefile
P src/sys/arch/arm/arm/arm_machdep.c
P src/sys/arch/arm/fdt/files.fdt
P src/sys/arch/arm/fdt/pcihost_fdt.c
P src/sys/arch/arm/nvidia/tegra_pcie.c
P src/sys/arch/arm/sunxi/sunxi_intc.c
P src/sys/arch/arm/sunxi/sunxi_nmi.c
P src/sys/arch/hppa/include/mutex.h
P src/sys/arch/sh3/include/userret.h
P src/sys/arch/x86/x86/pmap.c
P src/sys/arch/xen/x86/xen_pmap.c
P src/sys/compat/netbsd32/netbsd32_vm.c
P src/sys/dev/ic/mfi.c
P src/sys/dev/raidframe/rf_compat32.c
P src/sys/dev/usb/if_aue.c
P src/sys/dev/usb/if_axe.c
P src/sys/dev/usb/if_axen.c
P src/sys/dev/usb/if_cdce.c
P src/sys/dev/usb/if_cue.c
P src/sys/dev/usb/if_kue.c
P src/sys/dev/usb/if_mos.c
P src/sys/dev/usb/if_mue.c
P src/sys/dev/usb/if_smsc.c
P src/sys/dev/usb/if_udav.c
P src/sys/dev/usb/if_upl.c
P src/sys/dev/usb/if_ure.c
P src/sys/dev/usb/if_url.c
P src/sys/dev/usb/if_urndis.c
P src/sys/dev/usb/u3g.c
P src/sys/dev/usb/uark.c
P src/sys/dev/usb/ubsa.c
P src/sys/dev/usb/uchcom.c
P src/sys/dev/usb/uftdi.c
P src/sys/dev/usb/uhub.c
P src/sys/dev/usb/uipad.c
P src/sys/dev/usb/uipaq.c
P src/sys/dev/usb/ukyopon.c
P src/sys/dev/usb/umcs.c
P src/sys/dev/usb/umct.c
P src/sys/dev/usb/umodem.c
P src/sys/dev/usb/uplcom.c
P src/sys/dev/usb/usbnet.h
P src/sys/dev/usb/uvisor.c
P src/sys/external/bsd/drm2/dist/drm/radeon/radeon_atombios_dp.c
P src/sys/kern/kern_mutex.c
P src/sys/kern/tty.c
P src/sys/netsmb/smb_smb.c
P src/usr.bin/make/util.c
P src/usr.bin/make/unit-tests/escape.mk
P src/usr.bin/make/unit-tests/modorder.mk

Updating xsrc tree:


Killing core files:



Updating release-7 src tree (netbsd-7):

Updating release-7 xsrc tree (netbsd-7):



Updating release-8 src tree (netbsd-8):
U doc/CHANGES-8.2
P sys/kern/kern_ksyms.c

Updating release-8 xsrc tree (netbsd-8):




Updating file list:
-rw-rw-r--  1 srcmastr  netbsd  35392280 Jan  8 03:11 ls-lRA.gz


Re: panic on zpool create

2020-01-07 Thread Brian Buhrow
hello David.  I wonder if you're running into another manifestation
of kern/54724, which shows zfs corrupting kernel memory in NetBSD-9.  I
show the problem having to do with xen, but I now believe the problem is
entirely with zfs and it was only coincidental that I ran into it with xen.
I left the machine in question, running zfs, for a month, doing  pretty
much nothing, but when I came back to it, I got weird messages like, proc
table is full.  There were a lot of process running, but I couldn't
determine which one was the one that was "really" stuck, since the number
of commands I could run was extremely limited.

-thanks
-Brian

On Jan 8,  1:07am, David Brownlee wrote:
} Subject: Re: panic on zpool create
} On Tue, 7 Jan 2020 at 12:48, David Brownlee  wrote:
} >
} > On Tue, 7 Jan 2020 at 10:07, J. Hannken-Illjes  
wrote:
} > >
} > > For some reason locking the directory we want to mount on crashes.
} > >
} > > Anything special with the root on this machine?
} > >
} > > Does the directory (/angus_media I suppose) exist?
} >
} > /angus_media was present after the initial panic, removing it did not
} > seem to help.
} >
} > Ran a fsck -fyP on the system which picked up some issues, rebooted
} > and then tried again but the system still panics on zpool import (I'd
} > sent a follow up email noting that it appears the create is working
} > fine but the import panics, based on being able to create the pool on
} > another box running the same OS, but that pool then panics on import
} > on this box).
} >
} > The system is a Dell T320 with three filesystem on an LSI mfii0
} > controller, but I'm testing ZFS as the only two disk on the onboard
} > AHCI
} >
} > I'm going to try a current kernel in a bit (its a server and gateway
} > box so I'm trying to minimise downtime :)
} 
} O-Kaayyy... I have another data point.
} 
} The panic does not occur in single user, but does when the system is running.
} 
} Given this box takes approximately four minutes to complete POST to
} the point it actually starts to boot anything, and has the primary
} copy of quite a lot of data I value, my testing tomorrow is likely to
} be a little painful...
} 
} Will update when I know more :)
} 
} David
>-- End of excerpt from David Brownlee




Failure with ``build.sh -V USE_PIGZGZIP=yes install-image''

2020-01-07 Thread Paul Goyette

On Mon, 6 Jan 2020, Paul Goyette wrote:

Failure to build install-image - sources current as of 2020-01-06 at 13:03:23 
UTC


Creating rootfs...
chmod +r work/var/spool/ftp/hidden
/build/netbsd-local/tools/x86_64/amd64/bin/nbmakefs -M 1519386624 -m 
1519386624 -B 1234 -F 
work.spec -N work/etc -o bsize=16384,fsize=2048,density=8192 
work.rootfs work

nbmakefs: `work' size of 1994735616 is larger than the maxsize of 1519386624.

*** Failed target:  imgroot.fs


This seems to be a repeatable failure.  Running without specifying
USE_PIGZGZIP works correctly.



++--+---+
| Paul Goyette   | PGP Key fingerprint: | E-mail addresses: |
| (Retired)  | FA29 0E3B 35AF E8AE 6651 | p...@whooppee.com |
| Software Developer | 0786 F758 55DE 53BA 7731 | pgoye...@netbsd.org   |
++--+---+


Re: "route_enqueue: queue full, dropped message" blast from a 8.99.32 amd64 domU

2020-01-07 Thread Greg A. Woods
At Fri, 10 May 2019 13:02:05 -0700, "Greg A. Woods"  wrote:
Subject: Re: "route_enqueue: queue full, dropped message" blast from a 8.99.32 
amd64 domU
>
> During normal operation I see just over 8000/s (and with the console
> spewing there are only about 200/s more), and it all seems to add up
> since this is an 8-vcpu domU, and I have HZ=1000.  Before I rebooted I
> was seeing numbers ranging up to 25000/s.

So this happened again today.

However I didn't manage to run "systat vm" and see if there was an
interrupt storm again, and if so what device it was from.

But neither did I have to reboot.

It lasted long enough and strong enough to kill all my X11 connections,
though by the time emacs was dumping core (as it always does when it
loses touch with the X11 server), NFS was running well enough to store
the core files.

I was in the midst of trying to login on the console to investigate
further when things seemed to have returned to normal.  The console
isn't fully usable though -- I see output sent to it, but I can't seem
to send anything to the getty running on it.  I'm not yet sure if this
is a conserver, xen, or NetBSD problem.

This time there were occasional "xennet0: xennet_watchdog" messages too.

There's one other possibly related symptom -- sometimes X11 clients
running on this host seem to have trouble painting characters,
particularly "links" (but it has no trouble displaying images or
scrolling images).  Unfortunately I have not thought to check for a
slightly less voracious interrupt storm while this symptom is present,
but hopefully I will remember to do so next time it appears.  Until now
I've thought this might be some kind of problem with volumes of network
traffic interfering with the font server, though I've not observed any
truly excessive network traffic.

--
Greg A. Woods 

Kelowna, BC +1 250 762-7675   RoboHack 
Planix, Inc.  Avoncote Farms 


pgpewpFianOts.pgp
Description: OpenPGP Digital Signature


Re: panic on zpool create

2020-01-07 Thread David Brownlee
On Tue, 7 Jan 2020 at 12:48, David Brownlee  wrote:
>
> On Tue, 7 Jan 2020 at 10:07, J. Hannken-Illjes  
> wrote:
> >
> > For some reason locking the directory we want to mount on crashes.
> >
> > Anything special with the root on this machine?
> >
> > Does the directory (/angus_media I suppose) exist?
>
> /angus_media was present after the initial panic, removing it did not
> seem to help.
>
> Ran a fsck -fyP on the system which picked up some issues, rebooted
> and then tried again but the system still panics on zpool import (I'd
> sent a follow up email noting that it appears the create is working
> fine but the import panics, based on being able to create the pool on
> another box running the same OS, but that pool then panics on import
> on this box).
>
> The system is a Dell T320 with three filesystem on an LSI mfii0
> controller, but I'm testing ZFS as the only two disk on the onboard
> AHCI
>
> I'm going to try a current kernel in a bit (its a server and gateway
> box so I'm trying to minimise downtime :)

O-Kaayyy... I have another data point.

The panic does not occur in single user, but does when the system is running.

Given this box takes approximately four minutes to complete POST to
the point it actually starts to boot anything, and has the primary
copy of quite a lot of data I value, my testing tomorrow is likely to
be a little painful...

Will update when I know more :)

David


Re: pmap lock changes: Xen panic

2020-01-07 Thread Andrew Doran
Manuel,

On Tue, Jan 07, 2020 at 10:39:33AM +0100, Manuel Bouyer wrote:

> Hello,
> with 2020-01-05 00:40 UTC sources, Xen domUs panics because of what looks like
> locking changes in the pmap code (full log at
> http://www-soc.lip6.fr/~bouyer/NetBSD-tests/xen/HEAD/):
> mlock error: Mutex: mutex_vector_enter,506: assertion failed: !cpu_intr_p(): 
> lock 0xc0b46300 cpu 0 lwp 0xc1aac100
> [  39.3701414] cpu0: Begin traceback...
> [  39.3701414] 
> vpanic(c05b72e8,d7c7c728,d7c7c744,c041c699,c05b72e8,c05b04e0,c05746b4,1fa,c05b0454,c0b46300)
>  at netbsd:vpanic+0x134
> [  39.3701414] 
> panic(c05b72e8,c05b04e0,c05746b4,1fa,c05b0454,c0b46300,0,c1aac100,d7c7c768,c03d2ec9)
>  at netbsd:panic+0x18
> [  39.3701414] 
> lockdebug_abort(c05746b4,1fa,c0b46300,c0b414c8,c05b0454,c0b46240,c0b46300,d7c7c794,c03d3405,c05b0454)
>  at netbsd:lockdebug_abort+0xc9
> [  39.3701414] 
> mutex_abort(c05b0454,0,d39e7004,79339,c0b37340,c1aac100,c0b46240,c0b46300,c1a5e1e8,d7c7c7dc)
>  at netbsd:mutex_abort+0x39
> [  39.3701414] 
> mutex_vector_enter(c0b46300,d7c7c7b8,d7c7c7b4,c03cc135,86908857,d7c7a024,c0b37340,c1aac100,d7c7c7d4,c011924f)
>  at netbsd:mutex_vector_enter+0x355
> [  39.3701414] 
> pmap_extract_ma(c0b46240,d6db3000,d7c7c820,0,c1733508,6,d7e7e000,c1a5e1e0,6,c1a5e008)
>  at netbsd:pmap_extract_ma+0x1a
> [  39.3701414] 
> xbd_diskstart(c189e908,c2672e2c,1c0,d7c7c884,c011d25e,c0b35880,c1a5e018,32b0,c1a5e008,)
>  at netbsd:xbd_diskstart+0x234
> [  39.3701414] 
> dk_start(c1a5e008,0,c23fc4dc,23f1000,0,1,c2670558,c1a5e1e0,6,d7e7e000) at 
> netbsd:dk_start+0xea
> [  39.3701414] 
> xbd_handler(c1a5e008,6,d7c7c978,c18a6318,d7c7c924,c011cc99,c18a6318,d7c7c978,d7c7ca3c,c04a1a7d)
>  at netbsd:xbd_handler+0x12e
> [  39.3701414] 
> xen_intr_biglock_wrapper(c18a6318,d7c7c978,d7c7ca3c,c04a1a7d,c1cba008,23f,0,d7c7c9ec,d7c7ca10,c0c589b8)
>  at netbsd:xen_intr_biglock_wrapper+0x1f
> [  39.3701414] 
> evtchn_do_event(6,d7c7c978,c0e670fc,c0e62548,c0e68494,0,c0b37340,c0d9a000,c0b37340,0)
>  at netbsd:evtchn_do_event+0xf9
> [  39.3701414] 
> do_hypervisor_callback(d7c7c978,0,11,31,11,c0b40011,0,0,d7c7ca44,bfec0d70) at 
> netbsd:do_hypervisor_callback+0x15f
> [  39.3701414] 
> Xhypervisor_pvhvm_callback(c0b46240,d81ae000,696f3000,1,1ef3000,0,1,21,7ff0,21)
>  at netbsd:Xhypervisor_pvhvm_callback+0x67
> 
> 
> In rev 1.35 of xen/x86/xen_pmap.c the pmap lock is taken unconditionally,
> even in the pmap_kernel() case, which means pmap_extract_ma() can't be
> used from interrrupt context any more. I don't think we can impose such
> restrictions on pmap_kernel(); bus_dma(9) needs it.

This was a mistake on my part.  We should never need to lock pmap_kernel()
for pmap_extract() since it only touches the PTEs.  It should be fixed now
with xen_pmap.c 1.37.

Cheers,
Andrew


Re: panic on zpool create

2020-01-07 Thread David Brownlee
On Tue, 7 Jan 2020 at 10:07, J. Hannken-Illjes  wrote:
>
> For some reason locking the directory we want to mount on crashes.
>
> Anything special with the root on this machine?
>
> Does the directory (/angus_media I suppose) exist?

/angus_media was present after the initial panic, removing it did not
seem to help.

Ran a fsck -fyP on the system which picked up some issues, rebooted
and then tried again but the system still panics on zpool import (I'd
sent a follow up email noting that it appears the create is working
fine but the import panics, based on being able to create the pool on
another box running the same OS, but that pool then panics on import
on this box).

The system is a Dell T320 with three filesystem on an LSI mfii0
controller, but I'm testing ZFS as the only two disk on the onboard
AHCI

I'm going to try a current kernel in a bit (its a server and gateway
box so I'm trying to minimise downtime :)

Thanks


Re: panic on zpool create

2020-01-07 Thread J. Hannken-Illjes
> On 6. Jan 2020, at 15:04, David Brownlee  wrote:
> 
> I've just tried to create a zfs pool and had a panic (tried twice, was
> gifted a kernel core each time). This is on latest NetBSD-9.0_RC1 from
> nyftp (Thu Jan  2 10:02:26 UTC 2020)
> 
> The command were "zpool create angus_media wd0" and "zpool create -f
> angus_media wd0 wd1"
> 
> Moved disks to another machine (On which I'd used zfs before), on
> which the latter command completed fine (modulus wd1 and wd2 as
> different device numbers).
> 
> Disks were 6TB with any labels/gpt blanked.
> 
> Bit of a puzzler...
> 
> crash reports:
> 
> _KERNEL_OPT_NARCNET() at 0
> ?() at a9826c8f4000
> vpanic() at vpanic+0x169
> snprintf() at snprintf
> startlwp() at startlwp
> calltrap() at calltrap+0x11
> fstrans_start() at fstrans_start+0x64
> VOP_LOCK() at VOP_LOCK+0x52
> vn_lock() at vn_lock+0x11
> secmodel_extensions_system_cb() at secmodel_extensions_system_cb+0x70
> kauth_authorize_action() at kauth_authorize_action+0xaa
> kauth_authorize_system() at kauth_authorize_system+0x28
> zfs_mount() at zfs_mount+0xcf
> VFS_MOUNT() at VFS_MOUNT+0x4d
> mount_domount() at mount_domount+0xdf
> do_sys_mount() at do_sys_mount+0x580
> sys___mount50() at sys___mount50+0x33
> syscall() at syscall+0x157
> --- syscall (number 410) ---
> 7d6364687aba:

For some reason locking the directory we want to mount on crashes.

Anything special with the root on this machine?

Does the directory (/angus_media I suppose) exist?

--
J. Hannken-Illjes - hann...@eis.cs.tu-bs.de - TU Braunschweig (Germany)


pmap lock changes: Xen panic

2020-01-07 Thread Manuel Bouyer
Hello,
with 2020-01-05 00:40 UTC sources, Xen domUs panics because of what looks like
locking changes in the pmap code (full log at
http://www-soc.lip6.fr/~bouyer/NetBSD-tests/xen/HEAD/):
mlock error: Mutex: mutex_vector_enter,506: assertion failed: !cpu_intr_p(): 
lock 0xc0b46300 cpu 0 lwp 0xc1aac100
[  39.3701414] cpu0: Begin traceback...
[  39.3701414] 
vpanic(c05b72e8,d7c7c728,d7c7c744,c041c699,c05b72e8,c05b04e0,c05746b4,1fa,c05b0454,c0b46300)
 at netbsd:vpanic+0x134
[  39.3701414] 
panic(c05b72e8,c05b04e0,c05746b4,1fa,c05b0454,c0b46300,0,c1aac100,d7c7c768,c03d2ec9)
 at netbsd:panic+0x18
[  39.3701414] 
lockdebug_abort(c05746b4,1fa,c0b46300,c0b414c8,c05b0454,c0b46240,c0b46300,d7c7c794,c03d3405,c05b0454)
 at netbsd:lockdebug_abort+0xc9
[  39.3701414] 
mutex_abort(c05b0454,0,d39e7004,79339,c0b37340,c1aac100,c0b46240,c0b46300,c1a5e1e8,d7c7c7dc)
 at netbsd:mutex_abort+0x39
[  39.3701414] 
mutex_vector_enter(c0b46300,d7c7c7b8,d7c7c7b4,c03cc135,86908857,d7c7a024,c0b37340,c1aac100,d7c7c7d4,c011924f)
 at netbsd:mutex_vector_enter+0x355
[  39.3701414] 
pmap_extract_ma(c0b46240,d6db3000,d7c7c820,0,c1733508,6,d7e7e000,c1a5e1e0,6,c1a5e008)
 at netbsd:pmap_extract_ma+0x1a
[  39.3701414] 
xbd_diskstart(c189e908,c2672e2c,1c0,d7c7c884,c011d25e,c0b35880,c1a5e018,32b0,c1a5e008,)
 at netbsd:xbd_diskstart+0x234
[  39.3701414] 
dk_start(c1a5e008,0,c23fc4dc,23f1000,0,1,c2670558,c1a5e1e0,6,d7e7e000) at 
netbsd:dk_start+0xea
[  39.3701414] 
xbd_handler(c1a5e008,6,d7c7c978,c18a6318,d7c7c924,c011cc99,c18a6318,d7c7c978,d7c7ca3c,c04a1a7d)
 at netbsd:xbd_handler+0x12e
[  39.3701414] 
xen_intr_biglock_wrapper(c18a6318,d7c7c978,d7c7ca3c,c04a1a7d,c1cba008,23f,0,d7c7c9ec,d7c7ca10,c0c589b8)
 at netbsd:xen_intr_biglock_wrapper+0x1f
[  39.3701414] 
evtchn_do_event(6,d7c7c978,c0e670fc,c0e62548,c0e68494,0,c0b37340,c0d9a000,c0b37340,0)
 at netbsd:evtchn_do_event+0xf9
[  39.3701414] 
do_hypervisor_callback(d7c7c978,0,11,31,11,c0b40011,0,0,d7c7ca44,bfec0d70) at 
netbsd:do_hypervisor_callback+0x15f
[  39.3701414] 
Xhypervisor_pvhvm_callback(c0b46240,d81ae000,696f3000,1,1ef3000,0,1,21,7ff0,21) 
at netbsd:Xhypervisor_pvhvm_callback+0x67


In rev 1.35 of xen/x86/xen_pmap.c the pmap lock is taken unconditionally,
even in the pmap_kernel() case, which means pmap_extract_ma() can't be
used from interrrupt context any more. I don't think we can impose such
restrictions on pmap_kernel(); bus_dma(9) needs it.

-- 
Manuel Bouyer 
 NetBSD: 26 ans d'experience feront toujours la difference
--