Re: Missing file after postisnstall?

2018-02-12 Thread Paul Goyette

On Tue, 13 Feb 2018, Paul Goyette wrote:


On Tue, 13 Feb 2018, Valery Ushakov wrote:


On Tue, Feb 13, 2018 at 05:40:22 +0800, Paul Goyette wrote:

I recently updated from 8.99.7 to 8.99.12 and noticed that my daily 
security

job reported a missing file:

Checking special files and directories.
./etc/rc.d/dhcpd6 missing

Shouldn't this have been found and fixed by postinstall?


Why it should be "fixed" if it ain't broken?  :) Your system still
happily boots without DHCP6 server, isn't it?  If you want to update
your configuration files to the new etc.tgz you run etcupdate.

postinstall only does minimal configuration tweaks that are necessary
to keep the system working with the new userland, or at least that was
its original design goal as I remember it.


I also ran `etcupdate -al -s etc.gz -s xetc.gz` and it does not find the 
missing rc.d file, either.


The missing file is listed in an mtree specification file that was 
installed as part of the upgrade, so the file should exist.  But there 
doesn't seem to be any "sample" file anywhere in /usr/share so nothing 
that can be copied.


Since the postinstall/etcupdate process has previously found other 
missing rc.d files, and successfully fixed/installed them, I still 
consider this to be a bug in -current.  The mtree file says the file 
should exist, but it doesn't.




+--+--+----+
| Paul Goyette | PGP Key fingerprint: | E-mail addresses:  |
| (Retired)| FA29 0E3B 35AF E8AE 6651 | paul at whooppee dot com   |
| Kernel Developer | 0786 F758 55DE 53BA 7731 | pgoyette at netbsd dot org |
+--+--++


Missing file after postisnstall?

2018-02-12 Thread Paul Goyette
I recently updated from 8.99.7 to 8.99.12 and noticed that my daily 
security job reported a missing file:


Checking special files and directories.
./etc/rc.d/dhcpd6 missing

Shouldn't this have been found and fixed by postinstall?

+--+--++
| Paul Goyette | PGP Key fingerprint: | E-mail addresses:  |
| (Retired)| FA29 0E3B 35AF E8AE 6651 | paul at whooppee dot com   |
| Kernel Developer | 0786 F758 55DE 53BA 7731 | pgoyette at netbsd dot org |
+--+--++



Re: Status of 8.99.12

2018-02-12 Thread Paul Goyette

On Mon, 12 Feb 2018, Thor Lancelot Simon wrote:


On Mon, Feb 12, 2018 at 08:48:32AM +0800, Paul Goyette wrote:


1. Starting the gnucash program (from pkgsrc finance/gnucash) now takes
   about 3 times as long as before.  Even after successfully loading
   the image (to get libraries etc into the file system cache) it take
   more than three full minutes for the program to initialize.


It previously took 1 full minute?


Yes, at least on the initial load (before caching everything).  On 
subsequent loads it would take at least 20-30 seconds to start.




How does it look without LOCKDEBUG?


I would have to build a new kernel and check.  Do you thing it is truly 
relevant?  Would there be a lot of locking/contention occurring during 
the program start-up phase?


FWIW, someone also suggested that the "3-seconds needed to unmount a 
nullfs" problem could also be affected by LOCKDEBUG.  Interestingly, 
this problem does not seem to persist.  Some time after the system is 
booted, and some time after all the nullfs mounts are completed, the 
unmount process completes as expected;  all 20+ umount complete within 
total of 3-4 seconds, rather than 3-6 seconds each.  I'll try to 
characterize the "some time after" details later, after resolving the 
bigger outstanding issues.





+--+--+----+
| Paul Goyette | PGP Key fingerprint: | E-mail addresses:  |
| (Retired)| FA29 0E3B 35AF E8AE 6651 | paul at whooppee dot com   |
| Kernel Developer | 0786 F758 55DE 53BA 7731 | pgoyette at netbsd dot org |
+--+--++


Re: Missing file after postisnstall?

2018-02-12 Thread Paul Goyette

On Tue, 13 Feb 2018, Valery Ushakov wrote:


On Tue, Feb 13, 2018 at 05:40:22 +0800, Paul Goyette wrote:


I recently updated from 8.99.7 to 8.99.12 and noticed that my daily security
job reported a missing file:

Checking special files and directories.
./etc/rc.d/dhcpd6 missing

Shouldn't this have been found and fixed by postinstall?


Why it should be "fixed" if it ain't broken?  :) Your system still
happily boots without DHCP6 server, isn't it?  If you want to update
your configuration files to the new etc.tgz you run etcupdate.

postinstall only does minimal configuration tweaks that are necessary
to keep the system working with the new userland, or at least that was
its original design goal as I remember it.


I also ran `etcupdate -al -s etc.gz -s xetc.gz` and it does not find 
the missing rc.d file, either.




+--+--+----+
| Paul Goyette | PGP Key fingerprint: | E-mail addresses:  |
| (Retired)| FA29 0E3B 35AF E8AE 6651 | paul at whooppee dot com   |
| Kernel Developer | 0786 F758 55DE 53BA 7731 | pgoyette at netbsd dot org |
+--+--++


Status of 8.99.12

2018-02-11 Thread Paul Goyette
After an extended period of build breaks, I finally got a new release 
built from sources updated on 2018-02-10 at 04:02:43 UTC


I'm seeing several problems with this release that were not seen with my 
previous installation (from last November).


1. Starting the gnucash program (from pkgsrc finance/gnucash) now takes
   about 3 times as long as before.  Even after successfully loading
   the image (to get libraries etc into the file system cache) it take
   more than three full minutes for the program to initialize.

2. Whenever I try to shutdown the system, I get a networking-related
   panic.  The following is manually transcribed:

trap type 4 code 0 rip 0x802d3f75 cs 0x8 rflags 0x10282
  cr2 0x77e0e931c020 ilevel 0x4 rsp 0x80090a7e3c80
curlwp 0xe4afbb6e8700 pid 926.1 lowest kstack
  0x80090a7e0c20
kernel: protection fault trap, code = 0
stopped in 926.1 (avahi-daemon) at ip_setmoptions+0x237: movq
  360(%rax),%rdi
traceback:
ip_setmoptions + 0x237
ip_rtloutput + 0x218
udp_ctloutput + 0x82
udp_ctloutput_wrapper + 0x2c
sosetopt + 0x67
sys_setsockopt + 0x91
syscall + 0x1ed (syscall #105)

3. After getting the above, as soon as I type a single character as
   command input to ddb(4), I get a LOCKDEBUG panic.  I didn't yet
   transcribe the 40+ lines of output, but the backtrace clearly
   includes a couple entries from the xhci (USB-3) driver.

4. While the system is running, I have noticed that un-mounting nullfs
   mounts is very slow.  Using mksandbox (from pkgsrc), I create a
   sandbox with about 22 null mounts.  Creating/mounting is no problem,
   and everything runs as expected.  However, when unmounting these
   nullfs, each one takes between 3 and 6 wall-seconds, during which
   the umount process is running at 100% of one CPU.  Additionally,
   some of these umounts seem to grab the CPU with interrupts disabled,
   resulting in total stall of the machine for the duration (and, in X,
   cursor movement stalls/gets "jerky").  All the unmounts eventually
   complete successflly.




+--+--+----+
| Paul Goyette | PGP Key fingerprint: | E-mail addresses:  |
| (Retired)| FA29 0E3B 35AF E8AE 6651 | paul at whooppee dot com   |
| Kernel Developer | 0786 F758 55DE 53BA 7731 | pgoyette at netbsd dot org |
+--+--++


Re: Status of 8.99.12

2018-02-11 Thread Paul Goyette

On Mon, 12 Feb 2018, Ryota Ozaki wrote:


2. Whenever I try to shutdown the system, I get a networking-related
   panic.  The following is manually transcribed:

trap type 4 code 0 rip 0x802d3f75 cs 0x8 rflags 0x10282
  cr2 0x77e0e931c020 ilevel 0x4 rsp 0x80090a7e3c80
curlwp 0xe4afbb6e8700 pid 926.1 lowest kstack
  0x80090a7e0c20
kernel: protection fault trap, code = 0
stopped in 926.1 (avahi-daemon) at ip_setmoptions+0x237: movq
  360(%rax),%rdi
traceback:
ip_setmoptions + 0x237
ip_rtloutput + 0x218
udp_ctloutput + 0x82
udp_ctloutput_wrapper + 0x2c
sosetopt + 0x67
sys_setsockopt + 0x91
syscall + 0x1ed (syscall #105)


Is the panic fixed by the following patch?

diff --git a/sys/netinet/ip_output.c b/sys/netinet/ip_output.c
index 44d8032f387..2e5e346af91 100644
--- a/sys/netinet/ip_output.c
+++ b/sys/netinet/ip_output.c
@@ -1927,9 +1927,13 @@ ip_drop_membership(struct ip_moptions *imo,
const struct sockopt *sopt)
* Give up the multicast address record to which the
* membership points.
*/
-   IFNET_LOCK(imo->imo_membership[i]->inm_ifp);
+{
+   struct ifnet *inm_ifp = imo->imo_membership[i]->inm_ifp;
+   IFNET_LOCK(inm_ifp);
   in_delmulti(imo->imo_membership[i]);
-   IFNET_UNLOCK(imo->imo_membership[i]->inm_ifp);
+   /* ifp should not leave thanks to solock */
+   IFNET_UNLOCK(inm_ifp);
+}

   /*
* Remove the gap in the membership array.



Yes it appears to address the problem.  Without this patch, the above 
crash was 100% reproducible (5 out of 5).  With this patch applied (and 
no other changes) I have had 3 consecutive reboots without and problem!


Thanks for the quick turn-around.



+--+--+--------+
| Paul Goyette | PGP Key fingerprint: | E-mail addresses:  |
| (Retired)| FA29 0E3B 35AF E8AE 6651 | paul at whooppee dot com   |
| Kernel Developer | 0786 F758 55DE 53BA 7731 | pgoyette at netbsd dot org |
+--+--++


Re: Status of 8.99.12

2018-02-11 Thread Paul Goyette

On Mon, 12 Feb 2018, Paul Goyette wrote:


2. Whenever I try to shutdown the system, I get a networking-related
  panic.  The following is manually transcribed:

trap type 4 code 0 rip 0x802d3f75 cs 0x8 rflags 0x10282
  cr2 0x77e0e931c020 ilevel 0x4 rsp 0x80090a7e3c80
curlwp 0xe4afbb6e8700 pid 926.1 lowest kstack
  0x80090a7e0c20
kernel: protection fault trap, code = 0
stopped in 926.1 (avahi-daemon) at ip_setmoptions+0x237: movq
  360(%rax),%rdi
traceback:
ip_setmoptions + 0x237
ip_rtloutput + 0x218
udp_ctloutput + 0x82
udp_ctloutput_wrapper + 0x2c
sosetopt + 0x67
sys_setsockopt + 0x91
syscall + 0x1ed (syscall #105)


This appears to be fixed by a patch provided by ozakir@


3. After getting the above, as soon as I type a single character as
  command input to ddb(4), I get a LOCKDEBUG panic.  I didn't yet
  transcribe the 40+ lines of output, but the backtrace clearly
  includes a couple entries from the xhci (USB-3) driver.


Here's the console output from the LOCKDEBUG panic - all transcribed by 
hand, but hopefully without too many typos!


Mutex error: mutex_vector_enter,523: spin lock held
lock address: 0xe410e9d1d9a0   type: spin
initialized:  0x802bac06
shared holds:  0   exclusive:  1
shares wanted: 0   exclusive:  0
current CPU:  11   last held: 11
curlwp:   0xe41fc09ad2c0   last held: 0xe41fc09ad2c0
last locked*: 0x802b81de   unlocked:  0x80291179
owner field:  0x00010600   wait/spin:0/1
panic: LOCKDEBUG: Mutex error: mutex_vector_enter,523: spin lock held

And the backtrace is

vpanic+0x140
snprintf
lockdebug_more
mutex_enter+0x69d
xhci_device_intr_start+0x125
usbd_start_next+0x65
xhci_soft_intr+0x49b
xhci_poll+0x37
ukbd_cngetc+0x19
cngetc+0x34
db_readline+0x65
db_read_line+0x15
db_command_loop+0x84
db_trap+0xe3
kbd_trap+0xe2
trap (number 4)

followed by the original backtrace (see above, starting with 
ip_setmoptions).







4. While the system is running, I have noticed that un-mounting nullfs
  mounts is very slow.  Using mksandbox (from pkgsrc), I create a
  sandbox with about 22 null mounts.  Creating/mounting is no problem,
  and everything runs as expected.  However, when unmounting these
  nullfs, each one takes between 3 and 6 wall-seconds, during which
  the umount process is running at 100% of one CPU.  Additionally,
  some of these umounts seem to grab the CPU with interrupts disabled,
  resulting in total stall of the machine for the duration (and, in X,
  cursor movement stalls/gets "jerky").  All the unmounts eventually
  complete successflly.




+--+--+----+
| Paul Goyette | PGP Key fingerprint: | E-mail addresses:  |
| (Retired)| FA29 0E3B 35AF E8AE 6651 | paul at whooppee dot com   |
| Kernel Developer | 0786 F758 55DE 53BA 7731 | pgoyette at netbsd dot org |
+--+--++



+--+--+----+
| Paul Goyette | PGP Key fingerprint: | E-mail addresses:  |
| (Retired)| FA29 0E3B 35AF E8AE 6651 | paul at whooppee dot com   |
| Kernel Developer | 0786 F758 55DE 53BA 7731 | pgoyette at netbsd dot org |
+--+--++


Re: Automated report: NetBSD-current/i386 build failure

2018-02-10 Thread Paul Goyette

On Sat, 10 Feb 2018, Andreas Gustafsson wrote:


Christos Zoulas wrote:

|   i486--netbsdelf-install: CA.pl: stat: No such file or directory
|   *** 
[/tmp/bracket/build/2018.02.09.17.14.26-i386/destdir/usr/share/examples/openssl/CA.pl]
 Error code 1

Looking into it.


The build now gets past that point, but later fails at:

--- kern-XEN3_DOM0 ---
/tmp/bracket/build/2018.02.10.06.22.22-i386/src/sys/arch/i386/i386/db_interface.c:198:12:
 error: unused variable 'dbreg' [-Werror=unused-variable]
 db_regs_t dbreg;
   ^


This was already fixed - thanks to Christos@



+--+--++
| Paul Goyette | PGP Key fingerprint: | E-mail addresses:  |
| (Retired)| FA29 0E3B 35AF E8AE 6651 | paul at whooppee dot com   |
| Kernel Developer | 0786 F758 55DE 53BA 7731 | pgoyette at netbsd dot org |
+--+--++


Re: 'drvctl -d' problem

2018-02-20 Thread Paul Goyette

What happens if you cd to /dev first?  Or, specify

drvctl -d /dev/iwn0

There could be a problem with lookup of the device node...  It's 
happened before, although I thought it had been fixed.



On Wed, 21 Feb 2018, Makoto Fujiwara wrote:


I'm often use /sbin/drvctl, usually with -d device argument.
(the situation is after waked up from sleep).

Recently I'm getting syntax error and resulting help shown.
The good version is at 8.99.4, and bad example is at 8.99.12.
my 8.99.12 version is kept with the name drvctl-8.99.12.

Following is the log for above symptom.

cf-j10@makoto 11:16:56/180221(..sbin/drvctl)% sudo /sbin/drvctl-8.99.12 -d iwn0
Usage: drvctl-8.99.12 -r [-a attribute] busdevice [locator ...]
  drvctl-8.99.12 -d device
  drvctl-8.99.12 [-nt] -l [device]
  drvctl-8.99.12 [-n] -p device [property]
  drvctl-8.99.12 -Q device
  drvctl-8.99.12 -R device
  drvctl-8.99.12 -S device

cf-j10@makoto 11:17:12/180221(..sbin/drvctl)% sudo /sbin/drvctl  -d iwn0
cf-j10@makoto 11:17:29/180221(..sbin/drvctl)% sudo /sbin/drvctl  -r pci3
cf-j10@makoto 11:17:36/180221(..sbin/drvctl)%
cf-j10@makoto 11:17:51/180221(..sbin/drvctl)% ls -l /sbin/drvctl*
-r-xr-xr-x  1 root  wheel  19384 Oct 12 18:53 /sbin/drvctl
-r-xr-xr-x  1 root  wheel  19376 Jan 29 00:00 /sbin/drvctl-8.99.12

checked the commit log for drvctl.c, but I did not get the meaningfull
diffs. Please help me to fix the problem, thank you.
--
mak...@ki.nu
m...@netbsd.org
-
Makoto Fujiwara,
Chiba, Japan, Narita Airport and Disneyland prefecture.

!DSPAM:5a8cdcdb185711760256429!




+--+--++
| Paul Goyette | PGP Key fingerprint: | E-mail addresses:  |
| (Retired)| FA29 0E3B 35AF E8AE 6651 | paul at whooppee dot com   |
| Kernel Developer | 0786 F758 55DE 53BA 7731 | pgoyette at netbsd dot org |
+--+--++


dig(1) broken - openssl fallout?

2018-02-20 Thread Paul Goyette
It doesn't seem to matter what host I try to lookup, I get the following 
error messages:


# dig www.netbsd.org.
21-Feb-2018 08:01:50.585 ENGINE_by_id failed (crypto failure)
21-Feb-2018 08:01:50.585 error:25070067:DSO support routines:DSO_load:could not 
load the shared 
library:/build/netbsd-local/src_ro/crypto/external/bsd/openssl/dist/crypto/dso/dso_lib.c:161:
21-Feb-2018 08:01:50.585 error:260B6084:engine routines:dynamic_load:dso not 
found:/build/netbsd-local/src_ro/crypto/external/bsd/openssl/dist/crypto/engine/eng_dyn.c:414:
21-Feb-2018 08:01:50.585 error:2606A074:engine routines:ENGINE_by_id:no such 
engine:/build/netbsd-local/src_ro/crypto/external/bsd/openssl/dist/crypto/engine/eng_list.c:339:id=gost
dig: dst_lib_init: crypto failure
#

+--+--++
| Paul Goyette | PGP Key fingerprint: | E-mail addresses:  |
| (Retired)| FA29 0E3B 35AF E8AE 6651 | paul at whooppee dot com   |
| Kernel Developer | 0786 F758 55DE 53BA 7731 | pgoyette at netbsd dot org |
+--+--++


Re: i386 config w/tpm* at acpi? build fails.

2018-02-21 Thread Paul Goyette

Probably the same issue as what I ran into with dbcool(4) last week.

Look in files.acpi for a line that looks like

define acpi {}

and remove it (or comment it out).


On Wed, 21 Feb 2018, John D. Baker wrote:


Shortly after i386-current switched to GCC-6, my first (non-update) build
went fine.  Since then, something changed and now configs which include:

 tpm* at acpi?

fail with:

[...]
--- tpm_acpi.o ---
/x/current/src/sys/dev/acpi/tpm_acpi.c:88:27: error: 'tpm_acpi_ids' defined but 
not used [-Werror=unused-const-variable=]
static const char * const tpm_acpi_ids[] = {
  ^~~~
[...]


--
|/"\ John D. Baker, KN5UKS   NetBSD Darwin/MacOS X
|\ / jdbaker[snail]mylinuxisp[flyspeck]comOpenBSDFreeBSD
| X  No HTML/proprietary data in email.   BSD just sits there and works!
|/ \ GPGkeyID:  D703 4A7E 479F 63F8 D3F4  BD99 9572 8F23 E4AD 1645


!DSPAM:5a8e1a0450141313947420!




+--+--+----+
| Paul Goyette | PGP Key fingerprint: | E-mail addresses:  |
| (Retired)| FA29 0E3B 35AF E8AE 6651 | paul at whooppee dot com   |
| Kernel Developer | 0786 F758 55DE 53BA 7731 | pgoyette at netbsd dot org |
+--+--++


Re: i386 config w/tpm* at acpi? build fails.

2018-02-21 Thread Paul Goyette

Hmmm, that's not going to work!

First, the suggestion should have been to look for

define tpm {}

but I looked, and it's not there!  Instead, this looks more like an 
issue that mrg@ fixed with pmax's tc driver yesterday.  It has to do 
with the line that reads


attach tpm at acpinodebus with tpm_acpi

I'm not sure what the real, proper fix is for this one.




On Thu, 22 Feb 2018, Paul Goyette wrote:


Probably the same issue as what I ran into with dbcool(4) last week.

Look in files.acpi for a line that looks like

define acpi {}

and remove it (or comment it out).


On Wed, 21 Feb 2018, John D. Baker wrote:


Shortly after i386-current switched to GCC-6, my first (non-update) build
went fine.  Since then, something changed and now configs which include:

 tpm* at acpi?

fail with:

[...]
--- tpm_acpi.o ---
/x/current/src/sys/dev/acpi/tpm_acpi.c:88:27: error: 'tpm_acpi_ids' defined 
but not used [-Werror=unused-const-variable=]

static const char * const tpm_acpi_ids[] = {
  ^~~~
[...]


--
|/"\ John D. Baker, KN5UKS   NetBSD Darwin/MacOS X
|\ / jdbaker[snail]mylinuxisp[flyspeck]comOpenBSDFreeBSD
| X  No HTML/proprietary data in email.   BSD just sits there and works!
|/ \ GPGkeyID:  D703 4A7E 479F 63F8 D3F4  BD99 9572 8F23 E4AD 1645


!DSPAM:5a8e1a0450141313947420!




+--+--+----+
| Paul Goyette | PGP Key fingerprint: | E-mail addresses:  |
| (Retired)| FA29 0E3B 35AF E8AE 6651 | paul at whooppee dot com   |
| Kernel Developer | 0786 F758 55DE 53BA 7731 | pgoyette at netbsd dot org |
+--+--++



+--+--+----+
| Paul Goyette | PGP Key fingerprint: | E-mail addresses:  |
| (Retired)| FA29 0E3B 35AF E8AE 6651 | paul at whooppee dot com   |
| Kernel Developer | 0786 F758 55DE 53BA 7731 | pgoyette at netbsd dot org |
+--+--++


Re: Xen Domu kernel crash at start of boot

2018-06-21 Thread Paul Goyette

Stopped in pid 0.15 (system) at 80205968:   fxsavel


I seem to recall some recent changes related to save/restore of fpu 
state.




On Thu, 21 Jun 2018, Chuck Zmudzinski wrote:

I am getting a kernel crash almost immediately after booting the current 
kernel. I am running NetBSD/xen amd64 on a Debian Linux 8.10 DOM0 which uses 
Xen-4.4. Last week's kernel was good. I built a kernel from a cvs update a 
couple of days ago and tried it. It crashed. I tried the most recent daily 
snapshot available from NetBSD daily builds. It crashed too. Here is the 
information from the console about the daily snapshot kernel that crashed (it 
was built earlier today):


[   1.000] NetBSD 8.99.19 (XEN3_DOMU) #0: Thu Jun 21 11:48:05 UTC 2018
[   1.000] 
mkre...@mkrepro.netbsd.org:/usr/src/sys/arch/xen/compile/XEN3_DOMU


Here is the information I got from the console about the crash:

[   1.000] total memory = 3072 MB
[   1.000] avail memory = 2963 MB
[   1.000] cpu_rng: RDRAND
[   1.000] running cgd selftest aes-xts-256 aes-xts-512 done
[   1.000] mainbus0 (root)
[   1.000] hypervisor0 at mainbus0: Xen version 4.4.1
[   1.000] vcpu0 at hypervisor0
[   1.000] vcpu0: Intel(R) Core(TM) i5-4590S CPU @ 3.00GHz, id 0x306c3
[   1.000] vcpu0: package 0, core 3, smt 0
[   1.000] vcpu1 at hypervisor0
[   1.000] vcpu1: Intel(R) Core(TM) i5-4590S CPU @ 3.00GHz, id 0x306c3
[   1.000] vcpu1: package 0, core 3, smt 0
[   1.000] xenbus0 at hypervisor0: Xen Virtual Bus Interface
[   1.000] xencons0 at hypervisor0: Xen Virtual Console Driver
[   1.030] fatal protection fault in supervisor mode
[   1.030] trap type 4 code 0 rip 0x80205968 cs 0x1e030 
rflags 0x10046 cr2 0 ilevel 0 rsp 0xa000a570fbf0
[   1.030] curlwp 0xa642d4a0 pid 0.15 lowest kstack 
0xa000a570b2c0

kernel: protection fault trap, code=0
Stopped in pid 0.15 (system) at 80205968:   fxsavel
ds  0
es  0
fs  fd00
gs  0
rdi a000a570fbf8
rsi 1
rbp a000a570fe68
rbx 0
rdx 2
rcx 0
rax 0
r8  a668
r9  cccd
r10 64
r11 0
r12 a000a570fbf8
r13 1
r14 0
r15 0
rip 80205968
cs  e030
rflags  10046
rsp a000a570fbf0
ss  e02b
80205968:   fxsavel
db{1}> bt
?() at 80205968
?() at 802313f0
?() at 8023155e

I could not get any useful info from addr2line. Any ideas why it crashes?

Thanks,

Chuck Zmudzinski


!DSPAM:5b2c48e0136191148838969!




+--+--++
| Paul Goyette | PGP Key fingerprint: | E-mail addresses:  |
| (Retired)| FA29 0E3B 35AF E8AE 6651 | paul at whooppee dot com   |
| Kernel Developer | 0786 F758 55DE 53BA 7731 | pgoyette at netbsd dot org |
+--+--++

Re: Problem with shutting down the Xserver

2018-07-27 Thread Paul Goyette

On Fri, 27 Jul 2018, Martin Husemann wrote:


On Fri, Jul 27, 2018 at 08:26:12AM +0800, Paul Goyette wrote:

Has anyone else seen anything similar?  Any suggestions on how to debug this
further?


For debugging: make sure you have the debug and xdebug sets installed.
Make the system accessible via ssh. When the problem happens again:
from another machine ssh in, pgrep for the X server, use gdb -p to attach
to it and get a backtrace.


"ssh in" won't work - the machine is non-responsive to network.   :(

Also, unfortunately, the only "other machine" I have is a Windoze 
laptop;  I doubt that it would be a suitable place on which to run the 
"gdb -p" command.




+--+--+--------+
| Paul Goyette | PGP Key fingerprint: | E-mail addresses:  |
| (Retired)| FA29 0E3B 35AF E8AE 6651 | paul at whooppee dot com   |
| Kernel Developer | 0786 F758 55DE 53BA 7731 | pgoyette at netbsd dot org |
+--+--++


Re: Problem with shutting down the Xserver

2018-07-27 Thread Paul Goyette

On Fri, 27 Jul 2018, Martin Husemann wrote:


For debugging: make sure you have the debug and xdebug sets installed.
Make the system accessible via ssh. When the problem happens again:
from another machine ssh in, pgrep for the X server, use gdb -p to attach
to it and get a backtrace.


"ssh in" won't work - the machine is non-responsive to network.   :(

Also, unfortunately, the only "other machine" I have is a Windoze laptop;  I
doubt that it would be a suitable place on which to run the "gdb -p"
command.


Oh, so the kernel driver actually locks up?
Notebook or desktop? Next suggestion would be serial console and try
to enter ddb ...


The problem machine is a desktop.  It has a serial port.  Unfortunately 
the Windoze laptop does not, so nothing to which the other end of a 
serial cable could be attached.




+--+--+--------+
| Paul Goyette | PGP Key fingerprint: | E-mail addresses:  |
| (Retired)| FA29 0E3B 35AF E8AE 6651 | paul at whooppee dot com   |
| Kernel Developer | 0786 F758 55DE 53BA 7731 | pgoyette at netbsd dot org |
+--+--++


Problem with shutting down the Xserver

2018-07-26 Thread Paul Goyette
Sometime not so long ago, my old video card died and I had to get a new 
one.  Ever since then, when I "log out" the Xserver locks up my system.


Details:

1. This has been happening at least since the end of May, running a
   -current 6.99.18 kernel.  I've just completed an update to
   yesterday's 6.99.22 and the problem still existgs.

2. I'm using "native" X from xsrc, _not_ using pkgsrc.

3. I use xdm to manage my one display.

4. The problem occurs whether I'm using my personal account (which has
   fvwm window manager) or root (which uses the default twm).  So it
   is unlikely to be related to the window manager.

5. When I terminate my X session, the screen resets and then goes
   completely black and a simple underline cursor appears at the
   top-left corner.  A few seconds later the cursor disappears.  As
   far as I can tell, the video card is still generating a valid
   signal, as the monitor does not display its "No signal" warning.

6. The video card being used is a nVidia GeForce GTX 1050 Ti:

   dmesg:
vga0 at pci1 dev 0 function 0: vendor 10de product 1c82 (rev.
0xa1)
wsdisplay0 at vga0 kbdmux 1: console (80x25, vt100 emulation)

   lspci:
   +-03.0-[01]--+-00.0  NVIDIA Corporation GP107 [GeForce GTX 1050 Ti]
   |\-00.1  NVIDIA Corporation GP107GL High Definition 
Audio Controller

   From the above you can see that I'm using the basic vga driver,
   since the nouveau driver doesn't handle such a modern card.

7. As far as I can tell, the X server is using the vesa display driver,
   with the vbe sub-module:

...
[   162.803] (II) LoadModule: "vesa"
[   162.804] (II) Loading /usr/X11R7/lib/modules/drivers/vesa_drv.so
[   162.815] (II) Module vesa: vendor="X.Org Foundation"
[   162.815]compiled for 1.18.4, module version = 2.4.0
[   162.815]Module class: X.Org Video Driver
[   162.815]ABI class: X.Org Video Driver, version 20.0
...
[   162.857] (II) VESA: driver for VESA chipsets: vesa
[   162.857] (--) Using wscons driver on /dev/ttyE4 in pcvt 
compatibility mode (version 3.32)
[   162.857] (--) using VT number 5
[   162.886] (EE) [drm] Failed to open DRM device for pci:0001:01:00.0: 
-19
[   162.890] (EE) [drm] Failed to open DRM device for pci:0001:01:00.0: 
-19
[   162.890] (WW) VGA arbiter: cannot open kernel arbiter, no 
multi-card support
[   162.890] (II) Loading sub module "vbe"
[   162.890] (II) LoadModule: "vbe"
...
[   163.001] (II) VESA(0): Creating default Display subsection in 
Screen section
"Builtin Default vesa Screen 0" for depth/fbbpp 24/32
[   163.001] (==) VESA(0): Depth 24, (--) framebuffer bpp 32
[   163.001] (==) VESA(0): RGB weight 888
[   163.001] (==) VESA(0): Default visual is TrueColor
[   163.001] (==) VESA(0): Using gamma correction (1.0, 1.0, 1.0)
...

8. The problem occurs whether or not I execute a ``vesa on'' command
   in the boot loader.

9. The system doesn't reboot, and does not respond to network pings.
   It also does not seem to have entered DDB(4) since it appears to
   have (tried to) reset the video card and switch to vt0.  (I have
   ddb.onpanic set to "2" for some reason I cannot remember, and now
   I can't even find the docs on what "2" means!)


If I ``kill -KILL ...'' all of the processes related to xdm (including 
the X server itself), then the system does not hang, and I can log in on 
the console and manually execute ``/etc/rc.d/xdm start'' and everything 
is back to normal.  This would indicate to me that the X server is doing 
some sort of clean-up processing and that that is failing.


Has anyone else seen anything similar?  Any suggestions on how to debug 
this further?



+--+--++
| Paul Goyette | PGP Key fingerprint: | E-mail addresses:  |
| (Retired)| FA29 0E3B 35AF E8AE 6651 | paul at whooppee dot com   |
| Kernel Developer | 0786 F758 55DE 53BA 7731 | pgoyette at netbsd dot org |
+--+--++


tools build failure?

2018-08-15 Thread Paul Goyette

Anyone else seeing this?

#  link  mandoc/mandoc
cc -O -DOSNAME=\"NetBSD\ 8.99\" -DHAVE_CONFIG_H -I. 
-I/build/netbsd-local/tools/x86_64/amd64/include/compat 
-I/build/netbsd-local/src_ro/tools/compat -DHAVE_NBTOOL_CONFIG_H=1 -D_FILE_OFFSET_BITS=64 
  -o mandoc eqn_html.lo eqn_term.lo html.lo main.lo man_html.lo man_term.lo mandoc_xr.lo 
manpath.lo mdoc_html.lo mdoc_markdown.lo mdoc_term.lo out.lo roff_html.lo roff_term.lo 
tbl_html.lo tbl_term.lo term.lo term_ascii.lo term_ps.lo term_tab.lo tree.lo att.lo 
chars.lo compat_ohash.lo eqn.lo lib.lo man.lo man_macro.lo man_validate.lo mandoc.lo 
mandoc_aux.lo mandoc_ohash.lo mdoc.lo mdoc_argv.lo mdoc_macro.lo mdoc_man.lo 
mdoc_state.lo mdoc_validate.lo msec.lo preconv.lo read.lo roff.lo roff_validate.lo st.lo 
tag.lo tbl.lo tbl_data.lo tbl_layout.lo tbl_opts.lo compat_strtonum.lo 
compat_reallocarray.lo -L/build/netbsd-local/tools/x86_64/amd64/lib -lnbcompat -lrt -lz
mandoc_aux.lo: In function `mandoc_recallocarray':
mandoc_aux.c:(.text+0x13c): undefined reference to `recallocarray'
*** [mandoc] Error code 1

nbmake[3]: stopped in /build/netbsd-local/src_ro/tools/mandoc
1 error

It's repeatable at least on my system, and a cvs update shows no new 
changes...



+--+--+----+
| Paul Goyette | PGP Key fingerprint: | E-mail addresses:  |
| (Retired)| FA29 0E3B 35AF E8AE 6651 | paul at whooppee dot com   |
| Kernel Developer | 0786 F758 55DE 53BA 7731 | pgoyette at netbsd dot org |
+--+--++


Build break - kasan issue in rump

2018-08-20 Thread Paul Goyette

Always rump finds stuff!

This is on amd64, with sources updated as of 2018-08-20 at 22:44:31 UTC

#create  librump/kern_malloc.d
.
.
.
/build/netbsd-local/src_ro/lib/librump/../../sys/rump/../kern/kern_malloc.c:75:23:
 fatal error: opt_kasan.h: No such file or directory
 #include "opt_kasan.h"
   ^
compilation terminated.


+--+--+----+
| Paul Goyette | PGP Key fingerprint: | E-mail addresses:  |
| (Retired)| FA29 0E3B 35AF E8AE 6651 | paul at whooppee dot com   |
| Kernel Developer | 0786 F758 55DE 53BA 7731 | pgoyette at netbsd dot org |
+--+--++


Re: Build break - kasan issue in rump

2018-08-20 Thread Paul Goyette

FWIW, I suspect the following will fix the build break:

--- kern_malloc.c   20 Aug 2018 15:04:52 -  1.148
+++ kern_malloc.c   21 Aug 2018 01:19:22 -
@@ -72,7 +72,9 @@
 #include 
 __KERNEL_RCSID(0, "$NetBSD: kern_malloc.c,v 1.148 2018/08/20 15:04:52 maxv Exp 
$");

+#ifdef _KERNEL_OPT
 #include "opt_kasan.h"
+#endif

 #include 
 #include 

On Tue, 21 Aug 2018, Paul Goyette wrote:


Always rump finds stuff!

This is on amd64, with sources updated as of 2018-08-20 at 22:44:31 UTC

#create  librump/kern_malloc.d
.
.
.
/build/netbsd-local/src_ro/lib/librump/../../sys/rump/../kern/kern_malloc.c:75:23: 
fatal error: opt_kasan.h: No such file or directory

#include "opt_kasan.h"
  ^
compilation terminated.


+--+--+--------+
| Paul Goyette | PGP Key fingerprint: | E-mail addresses:  |
| (Retired)| FA29 0E3B 35AF E8AE 6651 | paul at whooppee dot com   |
| Kernel Developer | 0786 F758 55DE 53BA 7731 | pgoyette at netbsd dot org |
+--+--++



+--+--+--------+
| Paul Goyette | PGP Key fingerprint: | E-mail addresses:  |
| (Retired)| FA29 0E3B 35AF E8AE 6651 | paul at whooppee dot com   |
| Kernel Developer | 0786 F758 55DE 53BA 7731 | pgoyette at netbsd dot org |
+--+--++


Re: Build failure due to DRM intel i915

2018-08-29 Thread Paul Goyette
I just did a build.sh release on sources updated as of 2018-08-29 at 
07:46:17 UTC without any issue.



On Wed, 29 Aug 2018, Riccardo Mottola wrote:


Hi,

I upgraded CVS this morning (3 hours ago), built tools and kernel.
When building Kernl Modules, I get this failure:

#   compile  i915drmkms/intel_dsi.o
/usr/src/obj/tooldir.NetBSD-8.99.23-amd64/bin/x86_64--netbsd-gcc -O2 
-march=core2 -g   -std=gnu99    -Wall -Wstrict-prototypes 
-Wmissing-prototypes -Wpointer-arith -Wno-sign-compare -Wsystem-headers   
-Wno-traditional   -Wa,--fatal-warnings -Wreturn-type -Wswitch -Wshadow 
-Wcast-qual -Wwrite-strings -Wextra -Wno-unused-parameter -Wno-sign-compare 
-Werror -Wno-shadow -ffreestanding  -fno-strict-aliasing -Wno-pointer-sign 
-mno-red-zone -mno-mmx -mno-sse -mno-avx -msoft-float -mcmodel=kernel 
-fno-omit-frame-pointer   -I/usr/src/common/include 
--sysroot=/usr/src/obj/destdir.amd64 -I/usr/src/sys/external/bsd/drm2/include 
-I/usr/src/sys/external/bsd/common/include 
-I/usr/src/sys/external/bsd/drm2/dist/include 
-I/usr/src/sys/external/bsd/drm2/dist/include/drm 
-I/usr/src/sys/external/bsd/drm2/dist/uapi 
-I/usr/src/sys/external/bsd/drm2/dist -D__KERNEL__ 
-DCONFIG_BACKLIGHT_CLASS_DEVICE=0 -DCONFIG_BACKLIGHT_CLASS_DEVICE_MODULE=0 
-DCONFIG_DRM_FBDEV_EMULATION=0 -DCONFIG_FB=0 -DDIAGNOSTIC 
-I/usr/src/sys/sys/modules/drmkms -I/usr/src/sys/external/bsd/drm2/i915drm 
-I/usr/src/sys/external/bsd/drm2/dist/drm/i915 -DCONFIG_DRM_I915_FBDEV=1 
-DCONFIG_DRM_I915_PRELIMINARY_HW_SUPPORT=0 -DNACPICA=1 -DNVGA=1 
-I/usr/src/common/include  -nostdinc -I. -I/usr/src/sys/modules/i915drmkms 
-isystem /usr/src/sys -isystem /usr/src/sys/arch -isystem 
/usr/src/sys/../common/include -D_KERNEL -D_LKM -D_MODULE 
-DSYSCTL_INCLUDE_DESCR -c 
/usr/src/sys/external/bsd/drm2/dist/drm/i915/intel_dsi.c
In file included from 
/usr/src/sys/external/bsd/drm2/dist/drm/i915/intel_dsi.c:42:0:
/usr/src/sys/external/bsd/drm2/dist/drm/i915/intel_dsi.h:107:23: error: field 
'base' has incomplete type

  struct mipi_dsi_host base;
   ^~~~
/usr/src/sys/external/bsd/drm2/dist/drm/i915/intel_dsi.c:97:25: error: 
'struct mipi_dsi_msg' declared inside parameter list will not be visible 
outside of this definition or declaration [-Werror]

    const struct mipi_dsi_msg *msg)
 ^~~~
/usr/src/sys/external/bsd/drm2/dist/drm/i915/intel_dsi.c: In function 
'intel_dsi_host_transfer':
/usr/src/sys/external/bsd/drm2/dist/drm/i915/intel_dsi.c:103:25: error: 
storage size of 'packet' isn't known

  struct mipi_dsi_packet packet;
 ^~
/usr/src/sys/external/bsd/drm2/dist/drm/i915/intel_dsi.c:108:8: error: 
implicit declaration of function 'mipi_dsi_create_packet' 
[-Werror=implicit-function-declaration]

  ret = mipi_dsi_create_packet(, msg);
    ^~
/usr/src/sys/external/bsd/drm2/dist/drm/i915/intel_dsi.c:115:9: error: 
dereferencing pointer to incomplete type 'const struct mipi_dsi_msg'

  if (msg->flags & MIPI_DSI_MSG_USE_LPM) {
 ^~
/usr/src/sys/external/bsd/drm2/dist/drm/i915/intel_dsi.c:115:19: error: 
'MIPI_DSI_MSG_USE_LPM' undeclared (first use in this function)

  if (msg->flags & MIPI_DSI_MSG_USE_LPM) {
   ^~~~
/usr/src/sys/external/bsd/drm2/dist/drm/i915/intel_dsi.c:115:19: note: each 
undeclared identifier is reported only once for each function it appears in
/usr/src/sys/external/bsd/drm2/dist/drm/i915/intel_dsi.c:105:21: error: 
variable 'data' set but not used [-Werror=unused-but-set-variable]

  const u8 *header, *data;
 ^~~~
/usr/src/sys/external/bsd/drm2/dist/drm/i915/intel_dsi.c:103:25: error: 
unused variable 'packet' [-Werror=unused-variable]

  struct mipi_dsi_packet packet;
 ^~
/usr/src/sys/external/bsd/drm2/dist/drm/i915/intel_dsi.c: At top level:
/usr/src/sys/external/bsd/drm2/dist/drm/i915/intel_dsi.c:172:21: error: 
variable 'intel_dsi_host_ops' has initializer but incomplete type

 static const struct mipi_dsi_host_ops intel_dsi_host_ops = {
 ^
/usr/src/sys/external/bsd/drm2/dist/drm/i915/intel_dsi.c:173:2: error: 
unknown field 'attach' specified in initializer

  .attach = intel_dsi_host_attach,
  ^
/usr/src/sys/external/bsd/drm2/dist/drm/i915/intel_dsi.c:173:12: error: 
excess elements in struct initializer [-Werror]

  .attach = intel_dsi_host_attach,


I suppose this is due to the recent import!


cheers,
Riccardo

!DSPAM:5b867a60142972046016392!




+--+--+----+
| Paul Goyette | PGP Key fingerprint: | E-mail addresses:  |
| (Retired)| FA29 0E3B 35AF E8AE 6651 | paul at whooppee dot com   |
| Kernel Developer | 0786 F758 55DE 53BA 7731 | pgoyette at netbsd dot org |
+--+--++

Re: panic when removing a file in current

2018-07-18 Thread Paul Goyette

On Thu, 19 Jul 2018, Paul Goyette wrote:


Let me update my source tree, re-build, and check.

What port are you using?  i386? amd64? other?


Hmmm.  A -currrent from just a few minutes ago builds and runs correctly 
on amd64 (using QEMU).


# dd if=/dev/zero of=test.test bs=8192 count=1k
1024+0 records in
1024+0 records 
# rm test.test

# uname -a
NetBSD  8.99.22 NetBSD 8.99.22 (GENERIC) #1: Thu Jul 19 03:30:19 UTC 2018  
p...@speedy.whooppee.com:/build/netbsd-local/obj/amd64/sys/arch/amd64/compile/GENERIC
 amd64
#




On Thu, 19 Jul 2018, Johnny Billquist wrote:


Anyone seen this, or know what it's about?

On NetBSD/vax, with 8.99.22 from today.

Removing any file that has disk blocks allocated to it:

[ 653.3285523] ufs_inactive: unlinked ino 50313 on "/home" has non zero 
size 0 or blocks 1ac0 with allerror 0

[ 653.3484633] panic: ufs_inactive: dirty filesystem?
[ 653.3788284] cpu0: Begin traceback...
[ 653.3984724] panic: ufs_inactive: dirty filesystem?
[ 653.4090004] Stack traceback :
[ 653.4231115]   Process is executing in user space.
[ 653.4286045] cpu0: End traceback...
Stopped in pid 39.1 (rm) at netbsd:vpanic+0xc5: pushl   $0


If a file is small enough to have all the data in the inode itself, rm 
survives fine.


 Johnny

--
Johnny Billquist  || "I'm on a bus
 ||  on a psychedelic trip
email: b...@softjar.se ||  Reading murder books
pdp is alive! ||  tryin' to stay hip" - B. Idol






+--+--+----+
| Paul Goyette | PGP Key fingerprint: | E-mail addresses:  |
| (Retired)| FA29 0E3B 35AF E8AE 6651 | paul at whooppee dot com   |
| Kernel Developer | 0786 F758 55DE 53BA 7731 | pgoyette at netbsd dot org |
+--+--++

!DSPAM:5b50040990413217635383!




+--+--+----+
| Paul Goyette | PGP Key fingerprint: | E-mail addresses:  |
| (Retired)| FA29 0E3B 35AF E8AE 6651 | paul at whooppee dot com   |
| Kernel Developer | 0786 F758 55DE 53BA 7731 | pgoyette at netbsd dot org |
+--+--++


Re: panic when removing a file in current

2018-07-18 Thread Paul Goyette

Let me update my source tree, re-build, and check.

What port are you using?  i386? amd64? other?


On Thu, 19 Jul 2018, Johnny Billquist wrote:


Anyone seen this, or know what it's about?

On NetBSD/vax, with 8.99.22 from today.

Removing any file that has disk blocks allocated to it:

[ 653.3285523] ufs_inactive: unlinked ino 50313 on "/home" has non zero size 
0 or blocks 1ac0 with allerror 0

[ 653.3484633] panic: ufs_inactive: dirty filesystem?
[ 653.3788284] cpu0: Begin traceback...
[ 653.3984724] panic: ufs_inactive: dirty filesystem?
[ 653.4090004] Stack traceback :
[ 653.4231115]   Process is executing in user space.
[ 653.4286045] cpu0: End traceback...
Stopped in pid 39.1 (rm) at netbsd:vpanic+0xc5: pushl   $0


If a file is small enough to have all the data in the inode itself, rm 
survives fine.


 Johnny

--
Johnny Billquist  || "I'm on a bus
 ||  on a psychedelic trip
email: b...@softjar.se ||  Reading murder books
pdp is alive! ||  tryin' to stay hip" - B. Idol

!DSPAM:5b50017885611524031356!




+--+--+----+
| Paul Goyette | PGP Key fingerprint: | E-mail addresses:  |
| (Retired)| FA29 0E3B 35AF E8AE 6651 | paul at whooppee dot com   |
| Kernel Developer | 0786 F758 55DE 53BA 7731 | pgoyette at netbsd dot org |
+--+--++


Something got too big - build.sh install-image broke!

2018-09-05 Thread Paul Goyette
With sources updated on 2018-09-05 at 10:59:37 UTC (about 11 hours ago) 
I'm seeing


...
Creating rootfs...
chmod +r work/var/spool/ftp/hidden
/build/netbsd-local/tools/x86_64/amd64/bin/nbmakefs -M 1488977920 -m 1488977920 
-B 1234 -F work.spec -N work/etc -o bsize=16384,fsize=2048,density=8192 
work.rootfs work
nbmakefs: `work' size of 1496743936 is larger than the maxsize of 1488977920.

*** Failed target:  imgroot.fs


Any clues on what to bump?


+--+--++
| Paul Goyette | PGP Key fingerprint: | E-mail addresses:  |
| (Retired)| FA29 0E3B 35AF E8AE 6651 | paul at whooppee dot com   |
| Kernel Developer | 0786 F758 55DE 53BA 7731 | pgoyette at netbsd dot org |
+--+--++


Re: incomplete obsolescence of "viadrm"

2018-07-11 Thread Paul Goyette
While we're on the subject of viadrm, I noticed that it was removed from 
the amd64 GENERIC and ALL configs.


Since one of the premises of removal was that viadrmums was a suitable 
replacement, shouldn't that have been added to the relevant configs?



On Wed, 11 Jul 2018, John D. Baker wrote:


Following the removal of "viadrm" from sources and i386 GENERIC and ALL
configs, the module form is only being obsoleted and removed from:

 /stand/i386/8.99.21/modules/viadrm/viadrm.kmod

and its immediate parent directory.

There are two other instances which are not being accounted for:

 /stand/i386-xen/8.99.21/modules/viadrm/viadrm.kmod
 /stand/i386pae-xen/8.99.21/modules/viadrm/viadrm.kmod

Following the latest 'postinstall ... fix obsolete', only the first
instance was removed while the other two were left intact.

--
|/"\ John D. Baker, KN5UKS   NetBSD Darwin/MacOS X
|\ / jdbaker[snail]mylinuxisp[flyspeck]comOpenBSDFreeBSD
| X  No HTML/proprietary data in email.   BSD just sits there and works!
|/ \ GPGkeyID:  D703 4A7E 479F 63F8 D3F4  BD99 9572 8F23 E4AD 1645


!DSPAM:5b46ae64281411907016811!




+--+--+----+
| Paul Goyette | PGP Key fingerprint: | E-mail addresses:  |
| (Retired)| FA29 0E3B 35AF E8AE 6651 | paul at whooppee dot com   |
| Kernel Developer | 0786 F758 55DE 53BA 7731 | pgoyette at netbsd dot org |
+--+--++


Re: incomplete obsolescence of "viadrm"

2018-07-12 Thread Paul Goyette

On Thu, 12 Jul 2018, m...@netbsd.org wrote:


On Thu, Jul 12, 2018 at 09:32:47AM +0800, Paul Goyette wrote:

While we're on the subject of viadrm, I noticed that it was removed from the
amd64 GENERIC and ALL configs.

Since one of the premises of removal was that viadrmums was a suitable
replacement, shouldn't that have been added to the relevant configs?


If you think it's good, we can enable it by default.
On my machine it produces a corrupt display, although I think vesa
failed harder.


It should be included in ALL, whether or not it works 100%.

I have no problem with adding it as a comment line in GENERIC.



+--+--++
| Paul Goyette | PGP Key fingerprint: | E-mail addresses:  |
| (Retired)| FA29 0E3B 35AF E8AE 6651 | paul at whooppee dot com   |
| Kernel Developer | 0786 F758 55DE 53BA 7731 | pgoyette at netbsd dot org |
+--+--++


Re: running out of file descriptors

2018-04-18 Thread Paul Goyette

On Wed, 18 Apr 2018, Thomas Klausner wrote:


Hi!

I've recently updated to a NetBSD built on April 3rd. In my latest bulk builds 
I noticed

/netbsd: file: table is full - increase kern.maxfiles or MAXFILES

It was around 3700, I've bumped it to 8000.

I wonder why I needed to do that though. Did something start using
more file descriptors, or is something leaking file descriptors?

Did anyone else notice something similar?


The other day I had an instance of the shell (tcsh) reporting "No more 
processes" for a brief period.  The problem self-corrected so I didn't 
research any further.




+--+--+----+
| Paul Goyette | PGP Key fingerprint: | E-mail addresses:  |
| (Retired)| FA29 0E3B 35AF E8AE 6651 | paul at whooppee dot com   |
| Kernel Developer | 0786 F758 55DE 53BA 7731 | pgoyette at netbsd dot org |
+--+--++


Bad man-page cross-refs?

2018-04-20 Thread Paul Goyette

fixmount(8) seems to refer to mtab(5) and rmtab(5), neither exist.


+--+--++
| Paul Goyette | PGP Key fingerprint: | E-mail addresses:  |
| (Retired)| FA29 0E3B 35AF E8AE 6651 | paul at whooppee dot com   |
| Kernel Developer | 0786 F758 55DE 53BA 7731 | pgoyette at netbsd dot org |
+--+--++


Two fatal errors with yesterday's -current

2018-04-22 Thread Paul Goyette
With sources built from 2018-04-20 23:11:20 UTC I encountered two fatal 
errors while booting:


First, the system was unable to identify either of my hard drives:

...
[   6.4285988] ums0 at uhidev2: 3 buttons and Z dir
[   6.4285988] wsmouse0 at ums0 mux 0
[   6.6787124] wd0: IDENTIFY failed
[   6.6787124] wd0: fixing 0 sector size
[   6.6787124] wd0: secperunit and ncylinders are zero
[   9.6800696] wd0(ahcisata0:0:0): using PIO mode 0
[   9.6800696] wd1 at atabus1 drive 0
[   9.7100829] ehci_sync_hc: timed out
[  10.7105355] ehci_sync_hc: timed out
[  12.6814270] wd1: IDENTIFY failed
[  12.6814270] wd1: fixing 0 sector size
[  12.6814270] wd1: secperunit and ncylinders are zero
...


Second, I received a whole bunch of the "ehci_sync_hc: timed out"
messages.  The two occurrences above were the first, but there were
several more later on in the boot.

After all the messages, the boot process prompted me for a root device
(since the real root device suffered from the IDENTIFY failed).  And
just as soon as I entered one character on the keyboard, BOOM it crashed
and dropped into ddb.

Unfortunately, I was unable to get a crash dump (no hard drive!) , and I
also forgot to transcribe the backtrace.  If necessary, I can repeat the
experiment.

For comparison purposes, I have attached the dmesg from my earlier 
(sources dated 2018-03-20 11:25:00 UTC) working kernel.



+--+--+----+
| Paul Goyette | PGP Key fingerprint: | E-mail addresses:  |
| (Retired)| FA29 0E3B 35AF E8AE 6651 | paul at whooppee dot com   |
| Kernel Developer | 0786 F758 55DE 53BA 7731 | pgoyette at netbsd dot org |
+--+--++Copyright (c) 1996, 1997, 1998, 1999, 2000, 2001, 2002, 2003, 2004, 2005,
2006, 2007, 2008, 2009, 2010, 2011, 2012, 2013, 2014, 2015, 2016, 2017,
2018 The NetBSD Foundation, Inc.  All rights reserved.
Copyright (c) 1982, 1986, 1989, 1991, 1993
The Regents of the University of California.  All rights reserved.

NetBSD 8.99.14 (SPEEDY 2018-03-20 11:25:00 UTC) #0: Wed Mar 21 10:38:29 UTC 2018

p...@speedy.whooppee.com:/build/netbsd-local/obj/amd64/sys/arch/amd64/compile/SPEEDY
total memory = 127 GB
avail memory = 124 GB
cpu_rng: RDSEED
timecounter: Timecounters tick every 10.000 msec
timecounter: Timecounter "i8254" frequency 1193182 Hz quality 100
ASUS All Series (System Version)
mainbus0 (root)
ACPI: RSDP 0x000F0540 24 (v02 ALASKA)
ACPI: XSDT 0xCB235090 94 (v01 ALASKA A M I01072009 AMI  
00010013)
ACPI: FACP 0xCB26B9D0 00010C (v05 ALASKA A M I01072009 AMI  
00010013)
ACPI: DSDT 0xCB2351B8 036814 (v02 ALASKA A M I01072009 INTL 
20091013)
ACPI: FACS 0xCD48CF80 40
ACPI: APIC 0xCB26BAE0 000138 (v03 ALASKA A M I01072009 AMI  
00010013)
ACPI: FPDT 0xCB26BC18 44 (v01 ALASKA A M I01072009 AMI  
00010013)
ACPI: FIDT 0xCB26BC60 9C (v01 ALASKA A M I01072009 AMI  
00010013)
ACPI: MCFG 0xCB26BD00 3C (v01 ALASKA A M I01072009 MSFT 
0097)
ACPI: ASF! 0xCB2825A8 A0 (v32 INTEL   HCG 0001 TFSM 
000F4240)
ACPI: SSDT 0xCB26BD98 00036D (v01 SataRe SataTabl 1000 INTL 
20120913)
ACPI: UEFI 0xCB26C108 42 (v01 ALASKA A M I01072009  
)
ACPI: HPET 0xCB26C150 38 (v01 ALASKA A M I0001 INTL 
20091013)
ACPI: MSCT 0xCB26C188 90 (v01 ALASKA A M I0001 INTL 
20091013)
ACPI: SLIT 0xCB26C218 2D (v01 ALASKA A M I0001 INTL 
20091013)
ACPI: SRAT 0xCB26C248 001158 (v03 ALASKA A M I0001 INTL 
20091013)
ACPI: WDDT 0xCB26D3A0 40 (v01 ALASKA A M I INTL 
20091013)
ACPI: SSDT 0xCB26D3E0 0151C1 (v02 ALASKA PmMgt0001 INTL 
20120913)
ACPI: 3 ACPI AML tables successfully acquired and loaded
ioapic0 at mainbus0 apid 1: pa 0xfec0, version 0x20, 24 pins
ioapic1 at mainbus0 apid 2: pa 0xfec01000, version 0x20, 24 pins
cpu0 at mainbus0 apid 0
cpu0: Intel(R) Core(TM) i7-6900K CPU @ 3.20GHz, id 0x406f1
cpu0: package 0, core 0, smt 0
cpu1 at mainbus0 apid 2
cpu1: Intel(R) Core(TM) i7-6900K CPU @ 3.20GHz, id 0x406f1
cpu1: package 0, core 1, smt 0
cpu2 at mainbus0 apid 4
cpu2: Intel(R) Core(TM) i7-6900K CPU @ 3.20GHz, id 0x406f1
cpu2: package 0, core 2, smt 0
cpu3 at mainbus0 apid 6
cpu3: Intel(R) Core(TM) i7-6900K CPU @ 3.20GHz, id 0x406f1
cpu3: package 0, core 3, smt 0
cpu4 at mainbus0 apid 8
cpu4: Intel(R) Core(TM) i7-6900K CPU @ 3.20GHz, id 0x406f1
cpu4: package 0, core 4, smt 0
cpu5 at mainbus0 apid 10
cpu5: Intel(R) Core(TM) i7-6900K CPU @ 3.20GHz, id 0x406f1
cpu5: package 0, core 5, smt 0
cpu6 at mainbus0 apid 12
cpu6: Intel(R) Core(TM) i7-6900K CPU @ 3.20GHz, id 0x406f1
cpu6: package 0, core 6, smt 0
cpu7 at mainbus0 apid 14
cpu7: Intel(R) Core(TM) i7-6900K CPU @ 3.20GHz, id 0x4

Radeon video card support

2018-06-20 Thread Paul Goyette
I'm unfortunately well aware that modern nVidia graphics cardds aren't 
working with the noveaudrmkms stuff.  (I've got this nice spiffy new 
GTX-1050 running in vesa mode.)


I've located a source for a reasonably current, reasonably-priced, 
radeon card with RX580.  But before I rush out and purchase one, I 
figured it might be a good idea to find out how well it performs from 
other users.


Anyone got any success stories?  Or horror stories?  :)



+--+--++
| Paul Goyette | PGP Key fingerprint: | E-mail addresses:  |
| (Retired)| FA29 0E3B 35AF E8AE 6651 | paul at whooppee dot com   |
| Kernel Developer | 0786 F758 55DE 53BA 7731 | pgoyette at netbsd dot org |
+--+--++


Re: flist issue for current build

2018-10-08 Thread Paul Goyette
Hmmm, since you are building on macos, it could be a case-sensitive file 
system issue.  I have a hmac man page in section 3, but not HMAC.



On Mon, 8 Oct 2018, Adam wrote:


Greetings,

This is what I get while trying to build amd64 or i386 (today sources).

===  3 extra files in DESTDIR  =
Files in DESTDIR but missing from flist.
File is obsolete or flist is out of date ?
--
./usr/share/man/man3/DES_random_key.3.gz
./usr/share/man/man3/HMAC.3.gz
./usr/share/man/man3/MD5.3.gz
=  end of 3 extra files  ===

Please, fix. :)


Did you start with an empty destdir?

Martin


Yes. I usually point the destdir to a ram-disk, so it's always empty.
Also, I cross-build on macOS, but that shouldn't matter, right?

Kind regards,
Adam

!DSPAM:5bbb25ce195973638832627!




+--+--++
| Paul Goyette | PGP Key fingerprint: | E-mail addresses:  |
| (Retired)| FA29 0E3B 35AF E8AE 6651 | paul at whooppee dot com   |
| Kernel Developer | 0786 F758 55DE 53BA 7731 | pgoyette at netbsd dot org |
+--+--++


Strange build error on port evbarm64

2018-10-17 Thread Paul Goyette

While trying to make sure I haven't broken anything, I've been building
the same 67 builds as the releng cluster, on my pgoyette-compat branch.
I get the same success/failure as the releng cluster, which tells me
that my changes are unlikely to produce a build failure.

With one exception - the evbarm-aarch64 build...

I consistently get the following failure during the "release" target
(this is with build.sh's noise level set to 3, so I can see the actual
command that fails):

/build/netbsd-compat/tools/x86_64/evbarm64/bin/aarch64--netbsd-install -c -r -m 
444 bootaa64.efi /build/netbsd-compat/release/evbarm/installation/misc
aarch64--netbsd-install: 
/build/netbsd-compat/release/evbarm/installation/misc.inst.8ySV7u: mkstemp: No 
such file or directory

The error message would seem to indicate that the directory
$RELEASEDIR/release/evbarm/installation does not exist, thus mkstemp
cannot create the new temp file misc.inst.* within the directory.

Of course, the exact file name varies from build to build, but the
failure occurs in the same place, every time.  And the failure occurs
regardlesss of j=1 vs j=12, and it occurs whether or not I have
-V MKDEBUG=yes defined.  Yet the releng build succeeds.  The only
remaining (significant) difference I can see is my use of

-O /obj/evbarm64
  vs-M /evbarm-aarch64/201810160500Z-obj

I do not specify the stuff for reproducible builds, but that should not
matter (famous last words?).

-P -B 201810160500Z -V NETBSD_OFFICIAL_RELEASE=no

I can provide the entire log file (with noise-level=3) if it would be
helpful.  Here is the log header, along with a bit more context around
the eventual failure:

===> build.sh command:./build.sh -T 
/build/netbsd-compat/tools/x86_64/evbarm64 -D /build/netbsd-compat/dest/evbarm64 
-O /build/netbsd-compat/obj/evbarm64 -R /build/netbsd-compat/release -V 
RELEASEMACHINEDIR=evbarm64 -V MKDEBUG=yes -V MKKDEBUG=no -U -N3 -m evbarm64 -j1 
release
===> build.sh started:Wed Oct 17 02:04:05 UTC 2018
===> NetBSD version:  8.99.25
===> MACHINE: evbarm
===> MACHINE_ARCH:aarch64
===> Build platform:  NetBSD 8.99.25 amd64
===> HOST_SH: /bin/sh
===> MAKECONF file:   /etc/mk.conf
===> TOOLDIR path:/build/netbsd-compat/tools/x86_64/evbarm64
===> DESTDIR path:/build/netbsd-compat/dest/evbarm64
===> RELEASEDIR path: /build/netbsd-compat/release
===> Updated makewrapper: 
/build/netbsd-compat/tools/x86_64/evbarm64/bin/nbmake-evbarm64
...
release ===> etc/evbarm/cdroms/installcd
cd /build/netbsd-compat/src/sys/stand/efiboot/bootaa64 && 
/build/netbsd-compat/tools/x86_64/evbarm64/bin/nbmake release
rm -f machine &&  ln -s 
/build/netbsd-compat/src/sys/stand/efiboot/bootaa64/../../../arch/evbarm/include machine
if [ -d 
/build/netbsd-compat/src/sys/stand/efiboot/bootaa64/../../../arch/aarch64/include ]; 
then  rm -f aarch64 && ln -s 
/build/netbsd-compat/src/sys/stand/efiboot/bootaa64/../../../arch/aarch64/include 
aarch64;  fi
if [ -d 
/build/netbsd-compat/src/sys/stand/efiboot/bootaa64/../../../arch/evbarm/include ]; 
then  rm -f evbarm && ln -s 
/build/netbsd-compat/src/sys/stand/efiboot/bootaa64/../../../arch/evbarm/include 
evbarm;  fi
if [ -d /build/netbsd-compat/src/sys/stand/efiboot/bootaa64/../../../arch/arm/include 
]; then  rm -f arm && ln -s 
/build/netbsd-compat/src/sys/stand/efiboot/bootaa64/../../../arch/arm/include arm;  fi
true
/build/netbsd-compat/tools/x86_64/evbarm64/bin/aarch64--netbsd-install -c  -r 
-m 444 bootaa64.efi  /build/netbsd-compat/release/evbarm/installation/misc
aarch64--netbsd-install: 
/build/netbsd-compat/release/evbarm/installation/misc.inst.8ySV7u: mkstemp: No 
such file or directory
*** [release] Error code 1

Any help you can provide on identifying why I cannot get a successful 
build.sh release would be appreciated!



+--+--++
| Paul Goyette | PGP Key fingerprint: | E-mail addresses:  |
| (Retired)| FA29 0E3B 35AF E8AE 6651 | paul at whooppee dot com   |
| Kernel Developer | 0786 F758 55DE 53BA 7731 | pgoyette at netbsd dot org |
+--+--++


Re: Strange build error on port evbarm64

2018-10-17 Thread Paul Goyette

I've done some additional experimentation

This does not seem to be a result of using -O vs -M (ie, MAKEOBJDIR vs
MAKEOBJDIRPREFIX).  I get the same failure with each option.

It also does not seem to be a problem on my branch, as I get the same
failure on HEAD.

/build/netbsd-compat/tools/x86_64/evbarm64/bin/aarch64--netbsd-install -c -r 
-m 444 bootaa64.efi /build/netbsd-compat/release/evbarm/installation/misc
aarch64--netbsd-install: 
/build/netbsd-compat/release/evbarm/installation/misc.inst.8ySV7u: mkstemp: 
No such file or directory


The error message would seem to indicate that the directory
$RELEASEDIR/release/evbarm/installation does not exist, thus mkstemp
cannot create the new temp file misc.inst.* within the directory.


Looking closer, I find that the directory /installation DOES exist!
And there is even a /installation/misc subdirectory.  The install
command seems to have some sort of typo - it is trying to create the
temp file in

/installation/misc.inst.xx
   vs   /installation/misc/inst.xx
  ^
__|

I don't understand why this does not cause a problem on the releng
builds.

Anyway, I'm not going to worry about it, as long as it's not a problem
due to my pgoyette-compat branch!



On Wed, 17 Oct 2018, Paul Goyette wrote:


While trying to make sure I haven't broken anything, I've been building
the same 67 builds as the releng cluster, on my pgoyette-compat branch.
I get the same success/failure as the releng cluster, which tells me
that my changes are unlikely to produce a build failure.

With one exception - the evbarm-aarch64 build...

I consistently get the following failure during the "release" target
(this is with build.sh's noise level set to 3, so I can see the actual
command that fails):

/build/netbsd-compat/tools/x86_64/evbarm64/bin/aarch64--netbsd-install -c -r 
-m 444 bootaa64.efi /build/netbsd-compat/release/evbarm/installation/misc
aarch64--netbsd-install: 
/build/netbsd-compat/release/evbarm/installation/misc.inst.8ySV7u: mkstemp: 
No such file or directory


The error message would seem to indicate that the directory
$RELEASEDIR/release/evbarm/installation does not exist, thus mkstemp
cannot create the new temp file misc.inst.* within the directory.

Of course, the exact file name varies from build to build, but the
failure occurs in the same place, every time.  And the failure occurs
regardlesss of j=1 vs j=12, and it occurs whether or not I have
-V MKDEBUG=yes defined.  Yet the releng build succeeds.  The only
remaining (significant) difference I can see is my use of

-O /obj/evbarm64
 vs -M /evbarm-aarch64/201810160500Z-obj

I do not specify the stuff for reproducible builds, but that should not
matter (famous last words?).

-P -B 201810160500Z -V NETBSD_OFFICIAL_RELEASE=no

I can provide the entire log file (with noise-level=3) if it would be
helpful.  Here is the log header, along with a bit more context around
the eventual failure:

===> build.sh command:./build.sh -T 
/build/netbsd-compat/tools/x86_64/evbarm64 -D 
/build/netbsd-compat/dest/evbarm64 -O /build/netbsd-compat/obj/evbarm64 -R 
/build/netbsd-compat/release -V RELEASEMACHINEDIR=evbarm64 -V MKDEBUG=yes -V 
MKKDEBUG=no -U -N3 -m evbarm64 -j1 release

===> build.sh started:Wed Oct 17 02:04:05 UTC 2018
===> NetBSD version:  8.99.25
===> MACHINE: evbarm
===> MACHINE_ARCH:aarch64
===> Build platform:  NetBSD 8.99.25 amd64
===> HOST_SH: /bin/sh
===> MAKECONF file:   /etc/mk.conf
===> TOOLDIR path:/build/netbsd-compat/tools/x86_64/evbarm64
===> DESTDIR path:/build/netbsd-compat/dest/evbarm64
===> RELEASEDIR path: /build/netbsd-compat/release
===> Updated makewrapper: 
/build/netbsd-compat/tools/x86_64/evbarm64/bin/nbmake-evbarm64

...
release ===> etc/evbarm/cdroms/installcd
cd /build/netbsd-compat/src/sys/stand/efiboot/bootaa64 && 
/build/netbsd-compat/tools/x86_64/evbarm64/bin/nbmake release
rm -f machine &&  ln -s 
/build/netbsd-compat/src/sys/stand/efiboot/bootaa64/../../../arch/evbarm/include 
machine
if [ -d 
/build/netbsd-compat/src/sys/stand/efiboot/bootaa64/../../../arch/aarch64/include 
]; then  rm -f aarch64 && ln -s 
/build/netbsd-compat/src/sys/stand/efiboot/bootaa64/../../../arch/aarch64/include 
aarch64;  fi
if [ -d 
/build/netbsd-compat/src/sys/stand/efiboot/bootaa64/../../../arch/evbarm/include 
]; then  rm -f evbarm && ln -s 
/build/netbsd-compat/src/sys/stand/efiboot/bootaa64/../../../arch/evbarm/include 
evbarm;  fi
if [ -d 
/build/netbsd-compat/src/sys/stand/efiboot/bootaa64/../../../arch/arm/include 
]; then  rm -f arm && ln -s 
/build/netbsd-compat/src/sys/stand/efiboot/bootaa64/../../../arch/arm/include 
arm;  fi

true
/build/netbsd-compat/tools/x86_64/evbarm64/bin/aarch64--netbsd-install -c  -r 
-m 444 bootaa64.ef

Re: flist issue for current build

2018-10-08 Thread Paul Goyette

On Mon, 8 Oct 2018, Adam Ciarci?~Dski wrote:


They come from:
- src/crypto/external/bsd/openssl/lib/libcrypto/man/Makefile (upper-case 
man-pages)
- src/crypto/external/bsd/openssl/lib/libdes/Makefile (lower-case man-pages)


But why are you getting .3.gz files?
If it were plain .3 files I could understand the issue, but .3.gz ?


By using MKMANZ=yes :)


I knew I had seen this option before!  The option is, however, not
documented in the src/BUILDING file.  (It is mentioned in the
src/share/mk/bsd.README file, along with a number of other MKxxx
variables.)

Would it be a good idea to add a cross-reference from BUILDING to
the bsd.README file?  Or consolidate the information in one place
or the other?


+--+--++
| Paul Goyette | PGP Key fingerprint: | E-mail addresses:  |
| (Retired)| FA29 0E3B 35AF E8AE 6651 | paul at whooppee dot com   |
| Kernel Developer | 0786 F758 55DE 53BA 7731 | pgoyette at netbsd dot org |
+--+--++

Problem building a release of aarch64

2018-10-02 Thread Paul Goyette

I've managed to get clean builds on my [pgoyette-compat] branch from 66
out of the 67 architectures that are currently being built by the releng
build cluster.  The only one that isn't working is aarch64.

It seems to get all the way through building the world, but it fails at
this point:

...
release ===> etc/evbarm/instkernel/sshramdisk
release ===> etc/evbarm/instkernel/instkernel
test -z "" ||  /build/test/tools/x86_64/evbarm64/bin/aarch64--netbsd-install -r 
-p -c -m 444/build/test/release/evbarm64/installation/instkernel
release ===> etc/evbarm/cdroms
release ===> etc/evbarm/cdroms/installcd
cd /build/test/src/sys/stand/efiboot/bootaa64 && 
/build/test/tools/x86_64/evbarm64/bin/nbmake release
/build/test/tools/x86_64/evbarm64/bin/aarch64--netbsd-install -c -p -r 
-m 444 bootaa64.efi  /build/test/release/evbarm/installation/misc

aarch64--netbsd-install: 
/build/test/release/evbarm/installation/misc.inst.sJgZbU: mkstemp: No such file 
or directory
*** [release] Error code 1

Thinking that I might have broken something, I checked out sources from
MAIN, at the point of my most recent sync-with-head.  (This corresponds
to tag pgoyette-compat-0930, very close to 2018-09-30 at 00:00 UTC.)
And sure enough, a build of the MAIN branch at that time-stamp fails in
the same manner.

I had seen similar failures earlier, for some of the evbearm's, and I
traced those down to an accidental use of MKKDEBUG (which turns on -g
for gcc compiles, which makes things a lot bigger).  But I've checked
my logs and confirmed that MKKDEBUG is no longer enabled, and the only
kernel source being compiled with -g is debugsyms.o

Does anyone have any additional clues as to what might be going wrong?

FWIW, here's my build.sh command:

cd /build/test/src
./build.sh \
-T /build/test/tools/x86_64/evbarm64 \
-D /build/test/dest/evbarm64 \
-O /build/test/obj/evbarm64 \
-R /build/test/release \
-V RELEASEMACHINEDIR=evbarm64 \
-V MKDEBUG=no \
-V MKKDEBUG=no -U -m evbarm64 \
-j1 release


And the /build/test/src sources were checked out from anoncvs using

cvs update -r pgoyette-compat-0930


+--+--+--------+
| Paul Goyette | PGP Key fingerprint: | E-mail addresses:  |
| (Retired)| FA29 0E3B 35AF E8AE 6651 | paul at whooppee dot com   |
| Kernel Developer | 0786 F758 55DE 53BA 7731 | pgoyette at netbsd dot org |
+--+--++


Re: Problem with /etc/security script?

2018-10-06 Thread Paul Goyette

On Sun, 7 Oct 2018, Robert Elz wrote:


   Date:Sun, 7 Oct 2018 05:23:00 +0800 (+08)
   From:Paul Goyette 
   Message-ID:  

 | Lately I've been noticing messages of the following form:
 |
 | Checking mailbox ownership.
 | user paul.lock mailbox is owned by paul
 | user paul.lock mailbox is --, group wheel

You might want to work out what is using .lock file style locking in /var/mail

The current convention is to to use O_EXLOCK normally, not that old
style locking.


I'm just using a normal simple postfix as far as I know.


 | It seems like /etc/security tried to skip over the .lock files, but the
 | test only checks for the filename having a leading '.' rather than
 | matching ${user}.lock

I think it is intending to skip over dot files, rather than lock files.   '.'
is a valid char in user names, even if not often used, so simply
omitting files containing dots would not be a good idea.   If we wanted
to allow for old style user.lock files it would want to be skipping file
names that end in ".lock" not ones that happen to contain a '.'
(and then probably not skip, but validate that the xxx in xxx.lock is
owned by xxx)


Hmmm.



+--+--+----+
| Paul Goyette | PGP Key fingerprint: | E-mail addresses:  |
| (Retired)| FA29 0E3B 35AF E8AE 6651 | paul at whooppee dot com   |
| Kernel Developer | 0786 F758 55DE 53BA 7731 | pgoyette at netbsd dot org |
+--+--++


Re: Problem with /etc/security script?

2018-10-06 Thread Paul Goyette

On Sun, 7 Oct 2018, Robert Elz wrote:


   Date:Sun, 7 Oct 2018 07:39:42 +0800 (+08)
   From:Paul Goyette 
   Message-ID:  

 | I'm just using a normal simple postfix as far as I know.

You could check its config, to see how it delivers mail (but it should be using
mail.local to do that, and invoking it without the -l flag unless your /var/mail
is accessed via NFS.


No config changes in the last few days, and this just starting happening 
a few days ago.


I _think_ there's something updated in pkgsrc which is doing this, but I 
have nearly 600 packages installed and no clue as to which ones were 
recently updated.  (I rebuild and reinstall my entire package-set as a 
unit, not individually...)



But it is more likely that the .lock files are coming from some user agent -
something that is reading mail, and has been configured to use the wrong
kind of locking when fetching mail from /var/mail


Possible - see above.



+--+--++
| Paul Goyette | PGP Key fingerprint: | E-mail addresses:  |
| (Retired)| FA29 0E3B 35AF E8AE 6651 | paul at whooppee dot com   |
| Kernel Developer | 0786 F758 55DE 53BA 7731 | pgoyette at netbsd dot org |
+--+--++


Problem with /etc/security script?

2018-10-06 Thread Paul Goyette

Lately I've been noticing messages of the following form:

Checking mailbox ownership.
user paul.lock mailbox is owned by paul
user paul.lock mailbox is --, group wheel


It seems like /etc/security tried to skip over the .lock files, but the 
test only checks for the filename having a leading '.' rather than 
matching ${user}.lock


if checkyesno check_varmail; then
ls -lA /var/mail | \
awk '   NR == 1 { next; }

$9 ~ /^\./ {next; }

$3 != $9 {
print "user " $9 " mailbox is owned by " $3
}
$1 != "-rw---" {
print "user " $9 " mailbox is " $1 ", group " $4
}' > $OUTPUT
if [ -s $OUTPUT ] ; then
printf "\nChecking mailbox ownership.\n"
cat $OUTPUT
fi
fi


Shouldn't the dot-check line be

$9 ~ /\./ {next; }

?



Re: Merge (was "Collapse") of the pgoyette-compat branch (fwd)

2019-01-16 Thread Paul Goyette

FWIW, it's been pointed out to me that "collapse" of the branch might
be mistakenly taken to indicate that the branch is beyond hope!  :)

Definitely not true - it's just an unfortunate holdover from a former
$DAYJOB.  Of course, the intended meaning was to "merge" the branch
back into mainline.



On Wed, 16 Jan 2019, Paul Goyette wrote:


On Wed, 16 Jan 2019, Kamil Rytarowski wrote:


On 16.01.2019 10:20, Paul Goyette wrote:

Given the dearth of response to the notice posted on current-users I
thought I'd widen the audience a bit, to make sure that all concerned
have a chance to react!

-- Forwarded message --
Date: Mon, 14 Jan 2019 08:21:18 +0800 (PST)
From: Paul Goyette 
To: current-users@netbsd.org
Subject: Collapse of the pgoyette-compat branch

Folks,

Having reached today a significant milestone (conversion of the
rtsock_50 code to use function-pointers for callbacks), I'm ready
to collapse/merge the pgoyette-compat branch into HEAD.?? Current
status of the branch, including a list of open items, can be found
at



What are the benefits of the new branch? All compat code usable as
modules?


Well, I haven't reached 100% yet, but much closer than before.

There's much more granularity than before, with a separate compat_xx
module per NetBSD version, rather than a single monolithic compat
module.  Similarly for those architectures that support it, there are
version-specific compat32_xx modules.  As a result, you can load (or
autoload) only as much compat code as you want - it's no longer an
all-or-nothing operation.

A lot of the #ifdef spaghetti that existed has been removed and
replaced with function pointers that get loaded or initialized when
optional code is built into the kernel.  Just as an example, the
vnd(4) driver is optional, but if it is present there are some
compat functions that the mainline driver needs to call (for some
compatability ioctl's).  The compat module simply provides it own
replacement function pointers, overriding the initial values (which
point at no-op functions).

There's also a mechanism for ensuring that a module doesn't get
unloaded while some other code is following one of these function
pointers.  (A very very VERY big thanks goes out to riastradh@ for
the basic design of this, using a combination of localcount(9) and
pserialize(9) for interlocking.)

One of the things that triggered my work here was the problem with
the rtsock code.  Most people know that I use modules whenever I
can, and that includes the compat stuff.  Yet when the rtsock code
was modified, the compat functions could only be reached if they
were built-in.  They could not be part of any compat module.  (Well,
they could be included in the compat module, but there was no
mechanism for the main rtsock code to call non-built-in compat
routines.)

There were also some module infrastructure changes made, mostly a
removal of some limitations.  There's no longer a maximum number of
"required" prerequisite modules, and there's also no limit to the
depth of recursion when loading "required" modules.

Please also look at the file

[pgoyette-compat]src/sys/doc/TODO.compat-branch

for additional details.


Hopefully all this makes some sort of sense.  I really am not very
good at explaining or justifying things.  (Don't ever pick me as a
member of your debate team - I can almost guarantee you would lose
any competition in which I particpate!)





[pgoyette-compat]src/doc/TODO.compat-module

The only thing that's holding me back from committing is an inability
to commmit to timely resolution of any fallout that might arise, due
to $PERSONAL_LIFE issues.?? But, given that most people actually don't
use modules anyway, any delays in fixing the fallout should likely
have minimal impact.?? With luck, we'll actually get this into HEAD in
time for netbsd-9 after all!?? (And christos@ has encouraged me to go
ahead and commit sooner rather than later!)

So, I'm planning to commit within the next two weeks (around the 27th
or 28th of January).?? Until then, I'll respond to comments/reviews as
best as I can.


+--+--+----+

| Paul Goyette | PGP Key fingerprint: | E-mail
addresses:?? |
| (Retired)?? | FA29 0E3B 35AF E8AE 6651 | paul at whooppee 
dot

com |
| Kernel Developer | 0786 F758 55DE 53BA 7731 | pgoyette at netbsd dot
org |
+--+--++







+--+------++
| Paul Goyette | PGP Key fingerprint: | E-mail addresses:  |
| (Retired)| FA29 0E3B 35AF E8AE 6651 | paul at whooppee dot com   |
| Kernel Developer | 0786 F758 55D

Re: Collapse of the pgoyette-compat branch (fwd)

2019-01-16 Thread Paul Goyette

On Wed, 16 Jan 2019, Kamil Rytarowski wrote:


On 16.01.2019 10:20, Paul Goyette wrote:

Given the dearth of response to the notice posted on current-users I
thought I'd widen the audience a bit, to make sure that all concerned
have a chance to react!

-- Forwarded message --
Date: Mon, 14 Jan 2019 08:21:18 +0800 (PST)
From: Paul Goyette 
To: current-users@netbsd.org
Subject: Collapse of the pgoyette-compat branch

Folks,

Having reached today a significant milestone (conversion of the
rtsock_50 code to use function-pointers for callbacks), I'm ready
to collapse/merge the pgoyette-compat branch into HEAD.?? Current
status of the branch, including a list of open items, can be found
at



What are the benefits of the new branch? All compat code usable as
modules?


Well, I haven't reached 100% yet, but much closer than before.

There's much more granularity than before, with a separate compat_xx
module per NetBSD version, rather than a single monolithic compat
module.  Similarly for those architectures that support it, there are
version-specific compat32_xx modules.  As a result, you can load (or
autoload) only as much compat code as you want - it's no longer an
all-or-nothing operation.

A lot of the #ifdef spaghetti that existed has been removed and
replaced with function pointers that get loaded or initialized when
optional code is built into the kernel.  Just as an example, the
vnd(4) driver is optional, but if it is present there are some
compat functions that the mainline driver needs to call (for some
compatability ioctl's).  The compat module simply provides it own
replacement function pointers, overriding the initial values (which
point at no-op functions).

There's also a mechanism for ensuring that a module doesn't get
unloaded while some other code is following one of these function
pointers.  (A very very VERY big thanks goes out to riastradh@ for
the basic design of this, using a combination of localcount(9) and
pserialize(9) for interlocking.)

One of the things that triggered my work here was the problem with
the rtsock code.  Most people know that I use modules whenever I
can, and that includes the compat stuff.  Yet when the rtsock code
was modified, the compat functions could only be reached if they
were built-in.  They could not be part of any compat module.  (Well,
they could be included in the compat module, but there was no
mechanism for the main rtsock code to call non-built-in compat
routines.)

There were also some module infrastructure changes made, mostly a
removal of some limitations.  There's no longer a maximum number of
"required" prerequisite modules, and there's also no limit to the
depth of recursion when loading "required" modules.

Please also look at the file

[pgoyette-compat]src/sys/doc/TODO.compat-branch

for additional details.


Hopefully all this makes some sort of sense.  I really am not very
good at explaining or justifying things.  (Don't ever pick me as a
member of your debate team - I can almost guarantee you would lose
any competition in which I particpate!)





[pgoyette-compat]src/doc/TODO.compat-module

The only thing that's holding me back from committing is an inability
to commmit to timely resolution of any fallout that might arise, due
to $PERSONAL_LIFE issues.?? But, given that most people actually don't
use modules anyway, any delays in fixing the fallout should likely
have minimal impact.?? With luck, we'll actually get this into HEAD in
time for netbsd-9 after all!?? (And christos@ has encouraged me to go
ahead and commit sooner rather than later!)

So, I'm planning to commit within the next two weeks (around the 27th
or 28th of January).?? Until then, I'll respond to comments/reviews as
best as I can.


+--+--+--------+

| Paul Goyette | PGP Key fingerprint: | E-mail
addresses:?? |
| (Retired)?? | FA29 0E3B 35AF E8AE 6651 | paul at whooppee dot
com |
| Kernel Developer | 0786 F758 55DE 53BA 7731 | pgoyette at netbsd dot
org |
+--+--++







+--+--+----+
| Paul Goyette | PGP Key fingerprint: | E-mail addresses:  |
| (Retired)| FA29 0E3B 35AF E8AE 6651 | paul at whooppee dot com   |
| Kernel Developer | 0786 F758 55DE 53BA 7731 | pgoyette at netbsd dot org |
+--+--++

Re: Panic on a -current from 13/12/2018

2018-12-15 Thread Paul Goyette

On Sat, 15 Dec 2018, Chavdar Ivanov wrote:


Hi,

On 8.99.27 AMD64 running under VirtualBox I got this morning the panic
in http://ci4ic4.tx0.org/ci4ic4-panic-01.png

I have the  coredump, if it is of interest. I thought it might be
useful, as it is apparently in the wm driver.


Oh, dear.

I just updated my one-and-only machine (real hardware, nothing virual)
to yesterday's -current.   And my one-and-only network interface is
(of course) a wm0!

I hope this panic is not frequently encountered.


+--+--++
| Paul Goyette | PGP Key fingerprint: | E-mail addresses:  |
| (Retired)| FA29 0E3B 35AF E8AE 6651 | paul at whooppee dot com   |
| Kernel Developer | 0786 F758 55DE 53BA 7731 | pgoyette at netbsd dot org |
+--+--++


Re: How to boot in UEFI mode: practically no documentation

2018-05-29 Thread Paul Goyette

I bookmarked this Email a while ago - it might help...

https://mail-index.netbsd.org/current-users/2017/02/28/msg031220.html

On Tue, 29 May 2018, Thomas Mueller wrote:

Where do I find documentation on how to boot NetBSD amd64 or possibly 
i386 in UEFI mode?


I couldn't find any man page and couldn't find anything useful in the 
online NetBSD wiki.


I don't want to be limited to NetBSD, would also want to be able to 
boot FreeBSD and Linux in UEFI mode.


I noticed /usr/mdec/bootx64.efi and bootia32.efi (on an amd64 
installation) but don't really know where these would lead to.


I also saw /usr/src/sys/arch/i386/stand/efiboot .


Tom


!DSPAM:5b0d0c85242761302317190!




+--+--++
| Paul Goyette | PGP Key fingerprint: | E-mail addresses:  |
| (Retired)| FA29 0E3B 35AF E8AE 6651 | paul at whooppee dot com   |
| Kernel Developer | 0786 F758 55DE 53BA 7731 | pgoyette at netbsd dot org |
+--+--++


Re: HEADS UP: drmkms update incoming

2018-08-27 Thread Paul Goyette

On Mon, 27 Aug 2018, Taylor R Campbell wrote:


The drmkms update commit bomb is finished.  I've compile-tested amd64,
i386, macppc, sparc64, and evbarm/TEGRA, and they all build.  I'll be
around this evening US/Eastern to clean up fallout, of which I'm sure
there will be some, but I need to get to $DAYJOB for a while now!


I know that most everyone else already has the context.  But for those
few of us who don't, could you give a quick statement of what this
actually accomplishes, besides importing the current state-of-linux
as of version x.y.z?

Do we get support for new(er) graphics cards (if so,which ones)?  Do
we get specific functionality that was not available in the previous
code?  Is this all (for now), or is there more coming?  (I remember
there being a comment on IRC about "doing it all over again" to come
up to speed with the next go-round...)

Oh, and I guess this info would also belong in src/doc/CHANGES if it
hasn't already been added.  :)

Thanks so much for all the hard work, to you (and maya and everyone
(else who collaborated on this.  It was obviously a herculean task,
and it's nice to know that someone(tm) was up to it!


+--+--+----+
| Paul Goyette | PGP Key fingerprint: | E-mail addresses:  |
| (Retired)| FA29 0E3B 35AF E8AE 6651 | paul at whooppee dot com   |
| Kernel Developer | 0786 F758 55DE 53BA 7731 | pgoyette at netbsd dot org |
+--+--++


Re: ThinkPad - suspend-to-RAM intel-x86 issues and tests

2018-11-20 Thread Paul Goyette

On Tue, 20 Nov 2018, David H. Gutteridge wrote:


FWIW, I'm able to get suspend and resume to work reliably on a Lenovo
T420 with NetBSD-8.0_STABLE. (With 8.99.x, it doesn't work as reliably
because the SATA driver seems to have issues after resumption which
don't occur with 8.0.)


Jaromir Dolocek did a lot of work on ahcisata recently.  You might try 
to coordninate with him to see if his changes have affected the suspend 
and resume stuff.



+--+--++
| Paul Goyette | PGP Key fingerprint: | E-mail addresses:  |
| (Retired)| FA29 0E3B 35AF E8AE 6651 | paul at whooppee dot com   |
| Kernel Developer | 0786 F758 55DE 53BA 7731 | pgoyette at netbsd dot org |
+--+--++


Collapse of the pgoyette-compat branch

2019-01-13 Thread Paul Goyette

Folks,

Having reached today a significant milestone (conversion of the
rtsock_50 code to use function-pointers for callbacks), I'm ready
to collapse/merge the pgoyette-compat branch into HEAD.  Current
status of the branch, including a list of open items, can be found
at

[pgoyette-compat]src/doc/TODO.compat-module

The only thing that's holding me back from committing is an inability
to commmit to timely resolution of any fallout that might arise, due
to $PERSONAL_LIFE issues.  But, given that most people actually don't
use modules anyway, any delays in fixing the fallout should likely
have minimal impact.  With luck, we'll actually get this into HEAD in
time for netbsd-9 after all!  (And christos@ has encouraged me to go
ahead and commit sooner rather than later!)

So, I'm planning to commit within the next two weeks (around the 27th
or 28th of January).  Until then, I'll respond to comments/reviews as
best as I can.


+--+--++
| Paul Goyette | PGP Key fingerprint: | E-mail addresses:  |
| (Retired)| FA29 0E3B 35AF E8AE 6651 | paul at whooppee dot com   |
| Kernel Developer | 0786 F758 55DE 53BA 7731 | pgoyette at netbsd dot org |
+--+--++


Re: Error in vi (or in man page)

2018-09-14 Thread Paul Goyette

On Fri, 14 Sep 2018, Christos Zoulas wrote:


I think it is better to keep the behavior (exit) and fix the man page.
vim exits too. It is usually the case that the user does not want to
edit the file with the same name!


In my case, it was a typo.  I meant -R vs -r, so yes I really did intend 
to edit the file, just read-only.


It doesn't really matter to me which one we fix;  I'm mostly concerned 
about having the code match the docs.



+--+--++
| Paul Goyette | PGP Key fingerprint: | E-mail addresses:  |
| (Retired)| FA29 0E3B 35AF E8AE 6651 | paul at whooppee dot com   |
| Kernel Developer | 0786 F758 55DE 53BA 7731 | pgoyette at netbsd dot org |
+--+--++


Re: Error in vi (or in man page)

2018-09-14 Thread Paul Goyette

On Fri, 14 Sep 2018, Rin Okuyama wrote:


On 2018/09/14 20:37, Christos Zoulas wrote:

In article ,
Rin Okuyama   wrote:

...

It would be better to fix our vi in accordance with POSIX. Thoughts?


I think it is better to keep the behavior (exit) and fix the man page.
vim exits too. It is usually the case that the user does not want to
edit the file with the same name!


Well, I agree that the current behavior is safer for users.
I fixed the manpage.


Thank you!


+--+--++
| Paul Goyette | PGP Key fingerprint: | E-mail addresses:  |
| (Retired)| FA29 0E3B 35AF E8AE 6651 | paul at whooppee dot com   |
| Kernel Developer | 0786 F758 55DE 53BA 7731 | pgoyette at netbsd dot org |
+--+--++


Error in vi (or in man page)

2018-09-13 Thread Paul Goyette

Current vi(1) man page says

-r Recover the specified files, or, if no files are specified,
   list the files that could be recovered.  If no recoverable
   files by the specified name exist, the file is edited as if
   the -r option had not been specified.

However, in actuality, if no recoverable files by the specified name
exist, vi simply reports that fact, and does nothing;  it does not
edit the file "as if the -r option had not been specified."

#  ls kern_time_50.c
kern_time_50.c
# vi -r kern_time_50.c
No files named kern_time_50.c, readable by you, to recover
#

+--+--+----+
| Paul Goyette | PGP Key fingerprint: | E-mail addresses:  |
| (Retired)| FA29 0E3B 35AF E8AE 6651 | paul at whooppee dot com   |
| Kernel Developer | 0786 F758 55DE 53BA 7731 | pgoyette at netbsd dot org |
+--+--++


Problem with -current - video-card related?

2018-09-19 Thread Paul Goyette

While attempting to verify kern/53019 I tried to boot a -current kernel
from today's sources, unsuccessfully.

It seems to load everything, then clears the display, prints out about
two or three lines of text, and then the display goes completely black
(not even a cursor).  It obviously panics, because in a short while it
reboots.

FWIW I have a GTX 1050-Ti (nouveau) video card, running in "vga" mode
(I don't enable vesa at the boot prompt).  And it works just fine on a
8.99.22 kernel (as long as "fine" doesn't imply "video acceleration").

I have attached a copy of my dmesg output (gzipped to avoid any limits
on message length!) in case it helps.


+--+--+--------+
| Paul Goyette | PGP Key fingerprint: | E-mail addresses:  |
| (Retired)| FA29 0E3B 35AF E8AE 6651 | paul at whooppee dot com   |
| Kernel Developer | 0786 F758 55DE 53BA 7731 | pgoyette at netbsd dot org |
+--+--++

dmesg.txt.gz
Description: Binary data


Re: Merge (was "Collapse") of the pgoyette-compat branch (fwd)

2019-01-26 Thread Paul Goyette

Just a HEADS UP, I have completed the merge of [pgoyette-compat] into
-current.

I tried not to break anything, but this is a major change and I'm sure
that something will break.  Please let me know of any fallout from the
merge (preferably via Email) and I will address as soon as $REALLIFE
permits.



On Wed, 16 Jan 2019, Paul Goyette wrote:


FWIW, it's been pointed out to me that "collapse" of the branch might
be mistakenly taken to indicate that the branch is beyond hope!  :)

Definitely not true - it's just an unfortunate holdover from a former
$DAYJOB.  Of course, the intended meaning was to "merge" the branch
back into mainline.



On Wed, 16 Jan 2019, Paul Goyette wrote:


On Wed, 16 Jan 2019, Kamil Rytarowski wrote:


On 16.01.2019 10:20, Paul Goyette wrote:

Given the dearth of response to the notice posted on current-users I
thought I'd widen the audience a bit, to make sure that all concerned
have a chance to react!

-- Forwarded message --
Date: Mon, 14 Jan 2019 08:21:18 +0800 (PST)
From: Paul Goyette 
To: current-users@netbsd.org
Subject: Collapse of the pgoyette-compat branch

Folks,

Having reached today a significant milestone (conversion of the
rtsock_50 code to use function-pointers for callbacks), I'm ready
to collapse/merge the pgoyette-compat branch into HEAD.?? Current
status of the branch, including a list of open items, can be found
at



What are the benefits of the new branch? All compat code usable as
modules?


Well, I haven't reached 100% yet, but much closer than before.

There's much more granularity than before, with a separate compat_xx
module per NetBSD version, rather than a single monolithic compat
module.  Similarly for those architectures that support it, there are
version-specific compat32_xx modules.  As a result, you can load (or
autoload) only as much compat code as you want - it's no longer an
all-or-nothing operation.

A lot of the #ifdef spaghetti that existed has been removed and
replaced with function pointers that get loaded or initialized when
optional code is built into the kernel.  Just as an example, the
vnd(4) driver is optional, but if it is present there are some
compat functions that the mainline driver needs to call (for some
compatability ioctl's).  The compat module simply provides it own
replacement function pointers, overriding the initial values (which
point at no-op functions).

There's also a mechanism for ensuring that a module doesn't get
unloaded while some other code is following one of these function
pointers.  (A very very VERY big thanks goes out to riastradh@ for
the basic design of this, using a combination of localcount(9) and
pserialize(9) for interlocking.)

One of the things that triggered my work here was the problem with
the rtsock code.  Most people know that I use modules whenever I
can, and that includes the compat stuff.  Yet when the rtsock code
was modified, the compat functions could only be reached if they
were built-in.  They could not be part of any compat module.  (Well,
they could be included in the compat module, but there was no
mechanism for the main rtsock code to call non-built-in compat
routines.)

There were also some module infrastructure changes made, mostly a
removal of some limitations.  There's no longer a maximum number of
"required" prerequisite modules, and there's also no limit to the
depth of recursion when loading "required" modules.

Please also look at the file

[pgoyette-compat]src/sys/doc/TODO.compat-branch

for additional details.


Hopefully all this makes some sort of sense.  I really am not very
good at explaining or justifying things.  (Don't ever pick me as a
member of your debate team - I can almost guarantee you would lose
any competition in which I particpate!)





[pgoyette-compat]src/doc/TODO.compat-module

The only thing that's holding me back from committing is an inability
to commmit to timely resolution of any fallout that might arise, due
to $PERSONAL_LIFE issues.?? But, given that most people actually don't
use modules anyway, any delays in fixing the fallout should likely
have minimal impact.?? With luck, we'll actually get this into HEAD in
time for netbsd-9 after all!?? (And christos@ has encouraged me to go
ahead and commit sooner rather than later!)

So, I'm planning to commit within the next two weeks (around the 27th
or 28th of January).?? Until then, I'll respond to comments/reviews as
best as I can.


+--+--+----+

| Paul Goyette | PGP Key fingerprint: | E-mail
addresses:?? |
| (Retired)?? | FA29 0E3B 35AF E8AE 6651 | paul at whooppee 
dot

com |
| Kernel Developer | 0786 F758 55DE 53BA 7731 | pgoyette at netbsd dot
org |
+--+--++







+--+------++

Re: Automated report: NetBSD-current/i386 build failure

2019-01-27 Thread Paul Goyette

I think I just fixed this...

On Sun, 27 Jan 2019, Andreas Gustafsson wrote:


Th eNetBSD Test Fixture wrote:

*** [kern_mod_80.d] Error code 1


To wit:

 --- dependall-compat_80 ---
 
/tmp/bracket/build/2019.01.27.19.13.04-i386/src/sys/compat/common/kern_mod_80.c:52:31:
 fatal error: sys/compat/module.h: No such file or directory
  #include 
^
--
Andreas Gustafsson, g...@gson.org

!DSPAM:5c4e20cf220867463594025!




+--+--++
| Paul Goyette | PGP Key fingerprint: | E-mail addresses:  |
| (Retired)| FA29 0E3B 35AF E8AE 6651 | paul at whooppee dot com   |
| Kernel Developer | 0786 F758 55DE 53BA 7731 | pgoyette at netbsd dot org |
+--+--++


Re: Automated report: NetBSD-current/i386 build failure

2019-01-27 Thread Paul Goyette

This is being worked on...

On Mon, 28 Jan 2019, NetBSD Test Fixture wrote:


This is an automatically generated notice of a NetBSD-current/i386
build failure.

The failure occurred on babylon5.netbsd.org, a NetBSD/amd64 host,
using sources from CVS date 2019.01.27.22.06.07.

An extract from the build.sh output follows:

   /tmp/bracket/build/2019.01.27.22.06.07-i386/src/sys/conf/files:155: syntax 
error
   /tmp/bracket/build/2019.01.27.22.06.07-i386/src/sys/conf/files:167: option 
`COMPAT_LINUX32' dependency `COMPAT_LINUX' is an unknown option
   --- kern-GENERIC ---
   /tmp/bracket/build/2019.01.27.22.06.07-i386/src/sys/conf/files:154: syntax 
error
   /tmp/bracket/build/2019.01.27.22.06.07-i386/src/sys/conf/files:155: syntax 
error
   /tmp/bracket/build/2019.01.27.22.06.07-i386/src/sys/conf/files:167: option 
`COMPAT_LINUX32' dependency `COMPAT_LINUX' is an unknown option
   --- kern-LEGACY ---
   /tmp/bracket/build/2019.01.27.22.06.07-i386/src/sys/conf/files:154: syntax 
error
   /tmp/bracket/build/2019.01.27.22.06.07-i386/src/sys/conf/files:155: syntax 
error
   /tmp/bracket/build/2019.01.27.22.06.07-i386/src/sys/conf/files:167: option 
`COMPAT_LINUX32' dependency `COMPAT_LINUX' is an unknown option
   --- kern-INSTALL ---
   /tmp/bracket/build/2019.01.27.22.06.07-i386/src/sys/conf/files:154: syntax 
error
   /tmp/bracket/build/2019.01.27.22.06.07-i386/src/sys/conf/files:155: syntax 
error
   /tmp/bracket/build/2019.01.27.22.06.07-i386/src/sys/conf/files:167: option 
`COMPAT_LINUX32' dependency `COMPAT_LINUX' is an unknown option
   --- kern-XEN3PAE_DOMU ---
   *** Stop.
   --- distrib-dirs ---
   --- check_DESTDIR ---
   --- NetBSD.dist.tmp ---
   --- kern-XEN3PAE_DOMU ---
   *** [kern-XEN3PAE_DOMU] Error code 1
   nbmake[1]: stopped in /tmp/bracket/build/2019.01.27.22.06.07-i386/src/etc

The following commits were made between the last successful build and
the failed build:

   2019.01.27.22.00.14 pgoyette src/sys/conf/files,v 1.1223
   2019.01.27.22.06.07 pgoyette src/sys/conf/files,v 1.1224

Log files can be found at:

   
http://releng.NetBSD.org/b5reports/i386/commits-2019.01.html#2019.01.27.22.06.07

!DSPAM:5c4e549f287651283720264!




+--+--++
| Paul Goyette | PGP Key fingerprint: | E-mail addresses:  |
| (Retired)| FA29 0E3B 35AF E8AE 6651 | paul at whooppee dot com   |
| Kernel Developer | 0786 F758 55DE 53BA 7731 | pgoyette at netbsd dot org |
+--+--++


RE: swwdoq workqueue (was Re: panic in sysmon_envsys_unregister())

2019-03-27 Thread Paul Goyette

OK, I just committed this.

On Wed, 27 Mar 2019, Paul Goyette wrote:


On Tue, 26 Mar 2019, Michael van Elst wrote:


And while looking for places that work queues get destroyed in
dev/sysmon/ it seems that swwdog's workqueue is created twice when
loaded as a module.


Yep. The modularization is broken.


Do either of you want to fix this?  Or should I pick it up?  (I don't mind, I 
just don't want to duplicate effort.)




++--+---+
| Paul Goyette   | PGP Key fingerprint: | E-mail addresses: |
| (Retired)  | FA29 0E3B 35AF E8AE 6651 | p...@whooppee.com |
| Software Developer | 0786 F758 55DE 53BA 7731 | pgoye...@netbsd.org   |
++--+---+

!DSPAM:5c9b38d0100841793941041!




++--+---+
| Paul Goyette   | PGP Key fingerprint: | E-mail addresses: |
| (Retired)  | FA29 0E3B 35AF E8AE 6651 | p...@whooppee.com |
| Software Developer | 0786 F758 55DE 53BA 7731 | pgoye...@netbsd.org   |
++--+---+


RE: swwdoq workqueue (was Re: panic in sysmon_envsys_unregister())

2019-03-27 Thread Paul Goyette

On Tue, 26 Mar 2019, Michael van Elst wrote:


And while looking for places that work queues get destroyed in
dev/sysmon/ it seems that swwdog's workqueue is created twice when
loaded as a module.


Yep. The modularization is broken.


Do either of you want to fix this?  Or should I pick it up?  (I don't 
mind, I just don't want to duplicate effort.)




++--+---+
| Paul Goyette   | PGP Key fingerprint: | E-mail addresses: |
| (Retired)  | FA29 0E3B 35AF E8AE 6651 | p...@whooppee.com |
| Software Developer | 0786 F758 55DE 53BA 7731 | pgoye...@netbsd.org   |
++--+---+


/usr/tests/modules usage

2019-04-06 Thread Paul Goyette

It seemts to me that the original intent of this directory was for
"things that [help] test the modules system".  But recently we seem
to have acquired at least a couple entries here which are "module-
that-help-test-other-things".

Shouldn't the latter class be in the directory appropriate to what
is being tested?  For example, threadpool and ufetchstore testers
should perhaps belong in /usr/tests/kern ...  (And, of course, the
corresponding comments/changes related to related source code.)


++--+-------+
| Paul Goyette   | PGP Key fingerprint: | E-mail addresses: |
| (Retired)  | FA29 0E3B 35AF E8AE 6651 | p...@whooppee.com |
| Software Developer | 0786 F758 55DE 53BA 7731 | pgoye...@netbsd.org   |
++--+---+


Re: Automated report: NetBSD-current/i386 build failure

2019-04-06 Thread Paul Goyette

My amd64 build failed with

checkflist ===> distrib/sets

==  3 missing files in DESTDIR  
Files in flist but missing from DESTDIR.
File wasn't installed ?
--
./usr/tests/modules/t_ufetchstore
./usr/tests/modules/ufetchstore_tester
./usr/tests/modules/ufetchstore_tester/ufetchstore_tester.kmod
  end of 3 missing files  ==

Looks like there's a missing Makefile in that directory.



On Sat, 6 Apr 2019, Andreas Gustafsson wrote:


The i386 build is still failing, but now in a different place:

--- dependall-sys ---
/tmp/bracket/build/2019.04.06.09.33.07-i386/src/sys/compat/freebsd/freebsd_syscall.c:
 In function 'freebsd_syscall':
/tmp/bracket/build/2019.04.06.09.33.07-i386/src/sys/compat/freebsd/freebsd_syscall.c:103:9:
 error: passing argument 2 of 'ufetch_long' from incompatible pointer type 
[-Werror=incompatible-pointer-types]
--- dependall-external ---
--- dependall ---
--- dependall-sys ---
);
^
In file included from 
/tmp/bracket/build/2019.04.06.09.33.07-i386/src/sys/sys/timevar.h:66:0,
from 
/tmp/bracket/build/2019.04.06.09.33.07-i386/src/sys/sys/time.h:307,
from 
/tmp/bracket/build/2019.04.06.09.33.07-i386/src/sys/sys/param.h:145,
from 
/tmp/bracket/build/2019.04.06.09.33.07-i386/src/sys/compat/freebsd/freebsd_syscall.c:35:
/tmp/bracket/build/2019.04.06.09.33.07-i386/src/sys/sys/systm.h:341:5: note: 
expected 'long unsigned int *' but argument is of type 'register_t * {aka int 
*}'
int ufetch_long(const unsigned long *uaddr, unsigned long *valp);
^~~

--
Andreas Gustafsson, g...@gson.org

!DSPAM:5ca8931b139052003219678!




++--+---+
| Paul Goyette   | PGP Key fingerprint: | E-mail addresses: |
| (Retired)  | FA29 0E3B 35AF E8AE 6651 | p...@whooppee.com |
| Software Developer | 0786 F758 55DE 53BA 7731 | pgoye...@netbsd.org   |
++--+---+


Re: Automated report: NetBSD-current/i386 build failure

2019-04-06 Thread Paul Goyette

On Sat, 6 Apr 2019, Paul Goyette wrote:


My amd64 build failed with

checkflist ===> distrib/sets

==  3 missing files in DESTDIR  
Files in flist but missing from DESTDIR.
File wasn't installed ?
--
./usr/tests/modules/t_ufetchstore
./usr/tests/modules/ufetchstore_tester
./usr/tests/modules/ufetchstore_tester/ufetchstore_tester.kmod
  end of 3 missing files  ==

Looks like there's a missing Makefile in that directory.


maya@ fetched the missing file and committed it.  The amd64 build
is now working.

Back to i386 failures  :)  The current issue looks like another
fallout from ufetch/ustore ...


On Sat, 6 Apr 2019, Andreas Gustafsson wrote:


The i386 build is still failing, but now in a different place:

--- dependall-sys ---
/tmp/bracket/build/2019.04.06.09.33.07-i386/src/sys/compat/freebsd/freebsd_syscall.c: 
In function 'freebsd_syscall':
/tmp/bracket/build/2019.04.06.09.33.07-i386/src/sys/compat/freebsd/freebsd_syscall.c:103:9: 
error: passing argument 2 of 'ufetch_long' from incompatible pointer type 
[-Werror=incompatible-pointer-types]

--- dependall-external ---
--- dependall ---
--- dependall-sys ---
);
^
In file included from 
/tmp/bracket/build/2019.04.06.09.33.07-i386/src/sys/sys/timevar.h:66:0,
from 
/tmp/bracket/build/2019.04.06.09.33.07-i386/src/sys/sys/time.h:307,
from 
/tmp/bracket/build/2019.04.06.09.33.07-i386/src/sys/sys/param.h:145,
from 
/tmp/bracket/build/2019.04.06.09.33.07-i386/src/sys/compat/freebsd/freebsd_syscall.c:35:
/tmp/bracket/build/2019.04.06.09.33.07-i386/src/sys/sys/systm.h:341:5: 
note: expected 'long unsigned int *' but argument is of type 'register_t * 
{aka int *}'

int ufetch_long(const unsigned long *uaddr, unsigned long *valp);
^~~

--
Andreas Gustafsson, g...@gson.org

!DSPAM:5ca8931b139052003219678!




++--+---+
| Paul Goyette   | PGP Key fingerprint: | E-mail addresses: |
| (Retired)  | FA29 0E3B 35AF E8AE 6651 | p...@whooppee.com |
| Software Developer | 0786 F758 55DE 53BA 7731 | pgoye...@netbsd.org   |
++--+---+



++--+---+
| Paul Goyette   | PGP Key fingerprint: | E-mail addresses: |
| (Retired)  | FA29 0E3B 35AF E8AE 6651 | p...@whooppee.com |
| Software Developer | 0786 F758 55DE 53BA 7731 | pgoye...@netbsd.org   |
++--+---+


npf configuration for blacklistd

2019-03-29 Thread Paul Goyette

With all the discussion going on (re removal of pf), I revisited my
attempts to implement blacklistd.  But I'm still having some issues
getting npf configured.

I have two external-facing interfaces, both of which should be handled
identically by blacklistd.  I tried using the npf examples, with an
interface group containug both wm0 and tun0, but npf won't deal with
it - it complains about having multiple members in the $ext_if group.
(See PR kern/51818)

So, I tried creating two groups, one for each interface, but both
having the same blacklistd ruleset.  Now npf complains "some table
has a duplicate entry" and still doesn't start.

So, any suggestions on how to make this work?

(FWIW, I have no real opinion on the greater question(s) regarding the
possible demise of pf and/or ipf.)

++--+---+
| Paul Goyette   | PGP Key fingerprint: | E-mail addresses: |
| (Retired)  | FA29 0E3B 35AF E8AE 6651 | p...@whooppee.com |
| Software Developer | 0786 F758 55DE 53BA 7731 | pgoye...@netbsd.org   |
++--+---+


atc(6) - display current score when game exits?

2019-03-16 Thread Paul Goyette
I've been a long-time fan of atc(6), but one thing has always bothered 
me!


It seems that when you lose a game (and you always eventually will 
lose!), the playing field is erased before the high-scores list is 
printed.  Since the playing field is the only place where your current 
game's score is displayed, you end up with a screen that (most often) 
says only "You didn't beat your previous score".  Your _previous_ score 
is displayed as part of the high-score list, but your current score is 
NOT displayed.  Thus you can't see how close you came to your previous 
score.


The following simple patch seems to solve this:

Index: log.c
===
RCS file: /cvsroot/src/games/atc/log.c,v
retrieving revision 1.23
diff -u -p -r1.23 log.c
--- log.c   10 Jan 2017 20:40:53 -  1.23
+++ log.c   16 Mar 2019 22:52:52 -
@@ -161,6 +161,9 @@ log_score(int list_em)
struct utsname  lname;
longoffset;

+   printf("You directed %d planes safely to their destination.\n\n",
+   thisscore.planes);
+
if (score_fp == NULL) {
warnx("no score file available");
return (-1);

Any objections?

++--+-------+
| Paul Goyette   | PGP Key fingerprint: | E-mail addresses: |
| (Retired)  | FA29 0E3B 35AF E8AE 6651 | p...@whooppee.com |
| Software Developer | 0786 F758 55DE 53BA 7731 | pgoye...@netbsd.org   |
++--+---+


Strange usb crash during shutdown

2019-02-09 Thread Paul Goyette
I was shutting down my 8.99.32 system (kernel & userland both from just 
a few days ago), and got a strange crash.  It happened after all the 
"special" filesystems were dismounted (/proc, /kern, etc.) but before 
the "regular" dismounts for /var etc.


Here's the back-trace (manually transcribed)

mutex_oncpu + 1e
mutex_vector_enter + d9
uhub_explore + 3bc
uhub_explore + cc
usb_discover.isra.2 + 68
usb_Event_thread + 77

It had started to write a crash dump but appeared to be doing a full 
non-sparse dump and it would have taken a very long time to dump my 
entire 128GB...



+--+--+----+
| Paul Goyette | PGP Key fingerprint: | E-mail addresses:  |
| (Retired)| FA29 0E3B 35AF E8AE 6651 | paul at whooppee dot com   |
| Kernel Developer | 0786 F758 55DE 53BA 7731 | pgoyette at netbsd dot org |
+--+--++


Re: SIOCS80211 ioctl and 8.99.32

2019-01-28 Thread Paul Goyette

On Mon, 28 Jan 2019, bch wrote:


On Mon, Jan 28, 2019 at 1:51 PM Paul Goyette  wrote:


I notice that christos made some changes here overnight while I was
(gasp) sleeping!  Can you check to see if his changes fix your
problem?




It didn???t appear fixed w/i the last 0.5h or so, if that helps...


OK, this is item #2 on my To-Do list.  I'll try to get to it before 
leaving for my dentist appointment in a couple hours.




+--+--++
| Paul Goyette | PGP Key fingerprint: | E-mail addresses:  |
| (Retired)| FA29 0E3B 35AF E8AE 6651 | paul at whooppee dot com   |
| Kernel Developer | 0786 F758 55DE 53BA 7731 | pgoyette at netbsd dot org |
+--+--++

Re: SIOCS80211 ioctl and 8.99.32

2019-01-28 Thread Paul Goyette

I notice that christos made some changes here overnight while I was
(gasp) sleeping!  Can you check to see if his changes fix your
problem?


On Mon, 28 Jan 2019, Ryo ONODERA wrote:


Hi,

From: Patrick Welche , Date: Mon, 28 Jan 2019 10:58:54 +


Just tried a 8.99.32 kernel and get

Successfully initialized wpa_supplicant
ioctl[SIOCS80211, op=16, arg_len=0]: Inappropriate ioctl for device
iwn0: Failed to initialize driver interface

when running wpa_supplicant. (8.99.30 userland from 22 Jan - works with
22 Jan kernel)


I have the same problem with iwm(4).
And wpa_supplicant's error message seems wrong.
It should be
ioctl[SIOCG80211, op=16, arg_len=0]: Inappropriate ioctl for device


Cheers,

Patrick


--
Ryo ONODERA // r...@tetera.org
PGP fingerprint = 82A2 DC91 76E0 A10A 8ABB  FD1B F404 27FA C7D1 15F3

!DSPAM:5c4ee142213832910019183!




+--+--++
| Paul Goyette | PGP Key fingerprint: | E-mail addresses:  |
| (Retired)| FA29 0E3B 35AF E8AE 6651 | paul at whooppee dot com   |
| Kernel Developer | 0786 F758 55DE 53BA 7731 | pgoyette at netbsd dot org |
+--+--++


Re: SIOCS80211 ioctl and 8.99.32

2019-01-28 Thread Paul Goyette

On Mon, 28 Jan 2019, bch wrote:


On Mon, Jan 28, 2019 at 1:51 PM Paul Goyette  wrote:


I notice that christos made some changes here overnight while I was
(gasp) sleeping!  Can you check to see if his changes fix your
problem?




It didn???t appear fixed w/i the last 0.5h or so, if that helps...


OK, I think I figured it out.  It's actually the same problem as the
one I was chasing for msaitoh@

I've got a tentative fix, testing it now.


+--+--++
| Paul Goyette | PGP Key fingerprint: | E-mail addresses:  |
| (Retired)| FA29 0E3B 35AF E8AE 6651 | paul at whooppee dot com   |
| Kernel Developer | 0786 F758 55DE 53BA 7731 | pgoyette at netbsd dot org |
+--+--++

Re: SIOCS80211 ioctl and 8.99.32

2019-01-28 Thread Paul Goyette

This should be fixed now.

An additional commit is pending to change the name of the hook, but
no functional change is intended there!



On Mon, 28 Jan 2019, Ryo ONODERA wrote:


Hi,

From: Patrick Welche , Date: Mon, 28 Jan 2019 10:58:54 +


Just tried a 8.99.32 kernel and get

Successfully initialized wpa_supplicant
ioctl[SIOCS80211, op=16, arg_len=0]: Inappropriate ioctl for device
iwn0: Failed to initialize driver interface

when running wpa_supplicant. (8.99.30 userland from 22 Jan - works with
22 Jan kernel)


I have the same problem with iwm(4).
And wpa_supplicant's error message seems wrong.
It should be
ioctl[SIOCG80211, op=16, arg_len=0]: Inappropriate ioctl for device


Cheers,

Patrick


--
Ryo ONODERA // r...@tetera.org
PGP fingerprint = 82A2 DC91 76E0 A10A 8ABB  FD1B F404 27FA C7D1 15F3

!DSPAM:5c4ee142213832910019183!




+--+--++
| Paul Goyette | PGP Key fingerprint: | E-mail addresses:  |
| (Retired)| FA29 0E3B 35AF E8AE 6651 | paul at whooppee dot com   |
| Kernel Developer | 0786 F758 55DE 53BA 7731 | pgoyette at netbsd dot org |
+--+--++


Re: wifi SIOCS80211 error in 8.99.32 and iwn0

2019-01-30 Thread Paul Goyette

That should have been fixed already, but the build cluster might not
have picked up the fix.  If you can't build your own kernel, wait for
the next build cycle.  Other folks who had the same problem have
confirmed the fix, so I'm pretty sure your problem will be fixed too.

:)



On Wed, 30 Jan 2019, David Brownlee wrote:


I'm running a netbsd-8 userland with the latest nyftp current kernel on
amd64 and just saw this on boot:

iwn0: starting wpa_supplicant
iwn0: failed to start wpa_supplicant
iwn0: Sucessfully initialized wpa_supplicant
ioctl[SIOCS80211, op=16, arg_len=0]: Inappropriate ioctl for device
iwn0: Failed to initialize driver interface

wifi appears to work fine... just an FYI in case its relevant to anyone...

Thanks

David


!DSPAM:5c51735988201261034429!



+--+--++
| Paul Goyette | PGP Key fingerprint: | E-mail addresses:  |
| (Retired)| FA29 0E3B 35AF E8AE 6651 | paul at whooppee dot com   |
| Kernel Developer | 0786 F758 55DE 53BA 7731 | pgoyette at netbsd dot org |
+--+--++


Re: SIOCS80211 ioctl and 8.99.32

2019-01-28 Thread Paul Goyette

I will investigate as soon as possible.  But it may take some time.


On Mon, 28 Jan 2019, Ryo ONODERA wrote:


Hi,

From: Patrick Welche , Date: Mon, 28 Jan 2019 10:58:54 +


Just tried a 8.99.32 kernel and get

Successfully initialized wpa_supplicant
ioctl[SIOCS80211, op=16, arg_len=0]: Inappropriate ioctl for device
iwn0: Failed to initialize driver interface

when running wpa_supplicant. (8.99.30 userland from 22 Jan - works with
22 Jan kernel)


I have the same problem with iwm(4).
And wpa_supplicant's error message seems wrong.
It should be
ioctl[SIOCG80211, op=16, arg_len=0]: Inappropriate ioctl for device


Cheers,

Patrick


--
Ryo ONODERA // r...@tetera.org
PGP fingerprint = 82A2 DC91 76E0 A10A 8ABB  FD1B F404 27FA C7D1 15F3

!DSPAM:5c4ee142213832910019183!




+--+--++
| Paul Goyette | PGP Key fingerprint: | E-mail addresses:  |
| (Retired)| FA29 0E3B 35AF E8AE 6651 | paul at whooppee dot com   |
| Kernel Developer | 0786 F758 55DE 53BA 7731 | pgoyette at netbsd dot org |
+--+--++


Re: no options COMPAT_43

2019-04-15 Thread Paul Goyette

There may be some other option you have enabled which requires COMPAT_43

Try to boot your new kernel and use modstat(8) to see what other modules 
might require the compat_43 module.



On Mon, 15 Apr 2019, Masanobu SAITOH wrote:


Hi.

I tried to make a kernel without COMPAT_43 from conf/GENERIC.
I added "no options COMPAT_43" at the end of conf/GENERIC or
conf/GENERIC.local, but compile/GENERIC/opt_compat_43.h had:

#define COMPAT_43   1

Is this behavior intended? When I added "no options COMPAT_43"
twice, config(8) said:

GENERIC:1219: warning: options `COMPAT_43' is not defined

so my "no options COMPAT_43" lines are at after loading of
sys/conf/compat_netbsd.config.

--
---
   SAITOH Masanobu (msai...@execsw.org
msai...@netbsd.org)

!DSPAM:5cb45ac1290301372017733!




++--+-------+
| Paul Goyette   | PGP Key fingerprint: | E-mail addresses: |
| (Retired)  | FA29 0E3B 35AF E8AE 6651 | p...@whooppee.com |
| Software Developer | 0786 F758 55DE 53BA 7731 | pgoye...@netbsd.org   |
++--+---+


Re: no options COMPAT_43

2019-04-15 Thread Paul Goyette

On Mon, 15 Apr 2019, Paul Goyette wrote:


There may be some other option you have enabled which requires COMPAT_43

Try to boot your new kernel and use modstat(8) to see what other modules 
might require the compat_43 module.


A quick check on my recent amd64 build shows that compat_linux requires
compat_43.



On Mon, 15 Apr 2019, Masanobu SAITOH wrote:


Hi.

I tried to make a kernel without COMPAT_43 from conf/GENERIC.
I added "no options COMPAT_43" at the end of conf/GENERIC or
conf/GENERIC.local, but compile/GENERIC/opt_compat_43.h had:

#define COMPAT_43   1

Is this behavior intended? When I added "no options COMPAT_43"
twice, config(8) said:

GENERIC:1219: warning: options `COMPAT_43' is not defined

so my "no options COMPAT_43" lines are at after loading of
sys/conf/compat_netbsd.config.

--
---
   SAITOH Masanobu (msai...@execsw.org
msai...@netbsd.org)

!DSPAM:5cb45ac1290301372017733!




++--+-------+
| Paul Goyette   | PGP Key fingerprint: | E-mail addresses: |
| (Retired)  | FA29 0E3B 35AF E8AE 6651 | p...@whooppee.com |
| Software Developer | 0786 F758 55DE 53BA 7731 | pgoye...@netbsd.org   |
++--+---+



++--+-------+
| Paul Goyette   | PGP Key fingerprint: | E-mail addresses: |
| (Retired)  | FA29 0E3B 35AF E8AE 6651 | p...@whooppee.com |
| Software Developer | 0786 F758 55DE 53BA 7731 | pgoye...@netbsd.org   |
++--+---+


Re: date/strftime() returning wrong timezone name

2019-04-17 Thread Paul Goyette

On my 8.99/35 system from last month I get

$  TZ=Australia/Melbourne date; TZ=NZ date
Wed Apr 17 16:21:31 AEST 2019
Wed Apr 17 18:21:31 NZST 2019
$

On an 8.99.37 within qemu (built only a couple days ago), I get

# TZ=Australia/Melbourne date; TZ=NZ date
Wed Apr 17 16:24:10 LMT 2019
Wed Apr 17 18:24:10 LMT 2019
#


I'd guess that a recent import of the latest tzcode has gone awry.




On Wed, 17 Apr 2019, Geoff Wing wrote:


Hi,
running /sbin/dmesg and /bin/date I am seeing a timezone name of "LMT" instead
of my normal "AEST"

Copying "date" and my zoneinfo file from a working computer, I still see bad
info.


From -current (compiled myself and from nyftp snapshot):

% TZ=Australia/Melbourne date; TZ=NZ date
Wed Apr 17 16:04:16 LMT 2019
Wed Apr 17 16:04:16 LMT 2019


From a month ago:

% TZ=Australia/Melbourne date; TZ=NZ date
Wed Apr 17 16:03:54 AEST 2019
Wed Apr 17 16:03:54 NZST 2019

"LMT" was the correct timezone name 125-150 years ago for those zones.

Is this reproducible for anyone else or something unique to me?

I see there have been some recent changes to strftime() so perhaps
those are the cause of my problem.

Regards,
Geoff

!DSPAM:5cb6c3a8134038455320440!




++------+---+
| Paul Goyette   | PGP Key fingerprint: | E-mail addresses: |
| (Retired)  | FA29 0E3B 35AF E8AE 6651 | p...@whooppee.com |
| Software Developer | 0786 F758 55DE 53BA 7731 | pgoye...@netbsd.org   |
++--+---+


Minor annoyance - keyboard LED not restored

2019-06-08 Thread Paul Goyette
This is on a amd64 8.99.37 system built from sources dated 2019-04-24 
23:45:06 UTC, and with in-tree (native) Xorg...


When switching from the X session on VT05 to any other VT, the state of
the "NumLock" LED is retained.  However, when switching to the X session
the LED is always turned off, regardless of previous state.

The internal keyboard state appears to be maintained correctly, as the
numeric keypad still generates numbers (rather than cursor movement
escape sequences), and pressing the NumLock key _twice_ turns the LED
back on.

I've noticed discrepancies with the NumLock LED before, but never
bothered to look into it in any detail until just now.  So I don't have
the slightest clue as to when this might have started.




++--+---+
| Paul Goyette   | PGP Key fingerprint: | E-mail addresses: |
| (Retired)  | FA29 0E3B 35AF E8AE 6651 | p...@whooppee.com |
| Software Developer | 0786 F758 55DE 53BA 7731 | pgoye...@netbsd.org   |
++--+---+


Re: What to do with base X11 for netbsd-9 ?

2019-05-29 Thread Paul Goyette

Well, my problems are not with X but with in-kernel nouveau driver.  I
have noted my issues several times in the past:

  * GTX 1050-Ti not supported, so I have to use vga kernel driver.
  * Shutting down xdm(8) normally results in a totally black screen
with no control, and no ability even for blind typing.
  * Therefore, a system shutdown or reboot requires manually using
``kill -9'' on xdm (from the console), followed by shutdown(9).
  * Once I've killed xdm, I cannot restart it - I _must_ reboot!
  * Attempts to use the vesa driver (rather than vga) have been
unsuccessful.

I have a (rather "sub-optimal") workaround for my situation, so it's not
critical, and I would not use my issues as a reason for delaying the
up-coming branch of NetBSD-9.

I am curious why we need to apparently build all of the LLVM stuff for a
release?  If LLVM is needed as part of the build, it should be built as
host tools;  I don't understand why we _also_ need to build LLVM for the
target machine.  (And, if the answer to this is YES, we should probably
find a way to build-and-install a minimal subset, not the whole thing!)



On Wed, 29 May 2019, Martin Husemann wrote:


Hey folks,

I would like to get your feedback about the current state of the base X11,
as there have been lots of threads about various issues and I have lost
the overview of what works in -current and what does not.

We *really* need to branch netbsd-9 very soon now, and it is unclear (at
least to me) what should happen to the in-tree X11 version.

- we have very obvious display bugs at first sight in xdm
- self hosting builds on 4 GB amd64 machines are not really possible
  any more (due to LLVM runtime dependencies, which can not even be
  disabled at build time right now)
- reports here (and on other lists) about display problems and success
  stories have no clear winner (but probably due to me losing track)

So what is you story/opinion: is base X11 usable in -current? If not,
what needs to be done or what hardware needs fixes?

Martin

!DSPAM:5cee313a49191121140670!




++--+---+
| Paul Goyette   | PGP Key fingerprint: | E-mail addresses: |
| (Retired)  | FA29 0E3B 35AF E8AE 6651 | p...@whooppee.com |
| Software Developer | 0786 F758 55DE 53BA 7731 | pgoye...@netbsd.org   |
++--+---+


Re: Automated report: NetBSD-current/i386 build failure

2019-06-17 Thread Paul Goyette

Hmmm...

With sources from 2019-06-17 at 17:43:48 UTC I have just completed a 
amd64 'build.sh release' successfully.


Does i386 include the chfs file system, but not include flash(4) device?




On Mon, 17 Jun 2019, Andreas Gustafsson wrote:


The build is still failing as of source date 2019.06.17.16.34.02:

 -- kern-INSTALL ---
 /tmp/bracket/build/2019.06.17.16.34.02-i386/tools/bin/i486--netbsdelf-ld: 
chfs_vfsops.o: in function `chfs_mountfs':
 chfs_vfsops.c:(.text+0xaef): undefined reference to `flash_cdevsw'

--
Andreas Gustafsson, g...@gson.org

!DSPAM:5d07dfe481556811919384!




++--+---+
| Paul Goyette   | PGP Key fingerprint: | E-mail addresses: |
| (Retired)  | FA29 0E3B 35AF E8AE 6651 | p...@whooppee.com |
| Software Developer | 0786 F758 55DE 53BA 7731 | pgoye...@netbsd.org   |
++--+---+


Re: Automated report: NetBSD-current/i386 build failure

2019-04-27 Thread Paul Goyette
 2
   nbmake[6]: stopped in 
/tmp/bracket/build/2019.04.27.06.18.15-i386/src/sys/rump/net/lib
   --- dependall-libnetcan ---

The following commits were made between the last successful build and
the failed build:

   2019.04.27.06.18.15 pgoyette src/sys/net/if_faith.c,v 1.60
   2019.04.27.06.18.15 pgoyette src/sys/net/if_mpls.c,v 1.35
   2019.04.27.06.18.15 pgoyette src/sys/net/if_srt.c,v 1.30
   2019.04.27.06.18.15 pgoyette src/sys/netcan/if_canloop.c,v 1.6

Log files can be found at:

   
http://releng.NetBSD.org/b5reports/i386/commits-2019.04.html#2019.04.27.06.18.15

!DSPAM:5cc416b7207041263116732!




++--+-------+
| Paul Goyette   | PGP Key fingerprint: | E-mail addresses: |
| (Retired)  | FA29 0E3B 35AF E8AE 6651 | p...@whooppee.com |
| Software Developer | 0786 F758 55DE 53BA 7731 | pgoye...@netbsd.org   |
++--+---+


Re: Strange apropos(1) behavior?

2019-04-19 Thread Paul Goyette

On Fri, 19 Apr 2019, Paul Goyette wrote:

Should it be possible to use both the -l (legacy mode) option and specify a 
specific section -[1-9] ?


# apropos -l -8 specific
apropos: no such table: mandb
apropos: No relevant results obtained.
Please make sure that you spelled all the terms correctly or try using 
different keywords.


And another anomaly:

# apropos -p specific
apropos: Cannot allocate 140187716927593 bytes: Cannot allocate memory

And while we're at it, the man page for apropos(1) says that the -p 
option _defaults_ to using more(1) as the pager, implying that it should 
be possible to use a different pager.  Yet there is no option to tell 
apropos to which other pager the output should be piped!


And finally, the -P option is documented in the apropos(1) man page as 
also "Turn on pager formatting".  Is this correct?  Or should one of 
these two options be a "Turn _off_ pager formatting" ?  :)




++--+-------+
| Paul Goyette   | PGP Key fingerprint: | E-mail addresses: |
| (Retired)  | FA29 0E3B 35AF E8AE 6651 | p...@whooppee.com |
| Software Developer | 0786 F758 55DE 53BA 7731 | pgoye...@netbsd.org   |
++--+---+


Strange apropos(1) behavior?

2019-04-19 Thread Paul Goyette
Should it be possible to use both the -l (legacy mode) option and 
specify a specific section -[1-9] ?


# apropos -l -8 specific
apropos: no such table: mandb
apropos: No relevant results obtained.
Please make sure that you spelled all the terms correctly or try using 
different keywords.



++--+---+
| Paul Goyette   | PGP Key fingerprint: | E-mail addresses: |
| (Retired)  | FA29 0E3B 35AF E8AE 6651 | p...@whooppee.com |
| Software Developer | 0786 F758 55DE 53BA 7731 | pgoye...@netbsd.org   |
++--+---+


Re: current long build time

2019-04-28 Thread Paul Goyette

On Sun, 28 Apr 2019, Riccardo Mottola wrote:


Hi,

after a long build time I see

 end of 37 extra files 

*** Failed target: checkflist

I did build using "./build.sh -U -x distribution" so it was not an update 
build.



What can I do? just delete the files? if there is a command to get this list 
I can try...


The list of extra files should immediately preceed the above messages.
(You did log your build command output in a file, right?)  This usually
indicated that there are files in the $DESTDIR left over from a previous
build.

Remove them, and then rerun the build.sh with -u (and save the output in
a log file!)


++--+---+
| Paul Goyette   | PGP Key fingerprint: | E-mail addresses: |
| (Retired)  | FA29 0E3B 35AF E8AE 6651 | p...@whooppee.com |
| Software Developer | 0786 F758 55DE 53BA 7731 | pgoye...@netbsd.org   |
++--+---+


Error building kernel-only

2019-06-26 Thread Paul Goyette

With sources up-to-date, a ``build.sh release'' succeeds.

HOWEVER, a ``build.sh kernel=XXX'' fails with the following errors
(the entire log is attached...)

In file included from 
/build/netbsd-local/src_ro/sys/compat/netbsd32/netbsd32.h:107:0,
 from 
/build/netbsd-local/src_ro/sys/arch/amd64/amd64/process_machdep.c:89:
./machine/netbsd32_machdep.h:108:2: error: unknown type name 'siginfo32_t'
  siginfo32_t sf_si;
  ^~~
./machine/netbsd32_machdep.h:109:2: error: unknown type name 'ucontext32_t'
  ucontext32_t sf_uc;
  ^~~~
In file included from 
/build/netbsd-local/src_ro/sys/arch/amd64/amd64/process_machdep.c:89:0:
/build/netbsd-local/src_ro/sys/compat/netbsd32/netbsd32.h:324:2: error: unknown 
type name 'siginfo32_t'
  siginfo32_t psi_siginfo; /* signal information structure */
  ^~~
/build/netbsd-local/src_ro/sys/compat/netbsd32/netbsd32.h:1178:26: error: 
unknown type name 'siginfo32_t'; did you mean 'siginfo_t'?
 void netbsd32_si_to_si32(siginfo32_t *, const siginfo_t *);
  ^~~
  siginfo_t
/build/netbsd-local/src_ro/sys/compat/netbsd32/netbsd32.h:1179:45: error: 
unknown type name 'siginfo32_t'
 void netbsd32_si32_to_si(siginfo_t *, const siginfo32_t *);
 ^~~
/build/netbsd-local/src_ro/sys/compat/netbsd32/netbsd32.h:1181:63: error: 
'struct __ksiginfo32' declared inside parameter list will not be visible 
outside of this definition or declaration [-Werror]
 void netbsd32_ksi32_to_ksi(struct _ksiginfo *si, const struct __ksiginfo32 
*si32);
   ^~~~
cc1: all warnings being treated as errors
*** [process_machdep.o] Error code 1

++--+---+
| Paul Goyette   | PGP Key fingerprint: | E-mail addresses: |
| (Retired)  | FA29 0E3B 35AF E8AE 6651 | p...@whooppee.com |
| Software Developer | 0786 F758 55DE 53BA 7731 | pgoye...@netbsd.org   |
++--+---+===> build.sh command:./build.sh -T /build/netbsd-local/tools/x86_64/amd64 
-D /build/netbsd-local/dest/amd64 -O /build/netbsd-local/obj/amd64 -R 
/build/netbsd-local/release -V RELEASEMACHINEDIR=amd64 -V MKDEBUG=yes -V 
MKKDEBUG=yes -U -x -N0 -m amd64 -j1 kernel=SPEEDY releasekernel=SPEEDY
===> build.sh started:Thu Jun 27 00:34:12 UTC 2019
===> NetBSD version:  8.99.48
===> MACHINE: amd64
===> MACHINE_ARCH:x86_64
===> Build platform:  NetBSD 8.99.45 amd64
===> HOST_SH: /bin/sh
===> MAKECONF file:   /etc/mk.conf
===> TOOLDIR path:/build/netbsd-local/tools/x86_64/amd64
===> DESTDIR path:/build/netbsd-local/dest/amd64
===> RELEASEDIR path: /build/netbsd-local/release
===> Updated makewrapper: 
/build/netbsd-local/tools/x86_64/amd64/bin/nbmake-amd64
===> Building kernel without building new tools
===> Building kernel: SPEEDY
===> Build directory: 
/build/netbsd-local/obj/amd64/sys/arch/amd64/compile/SPEEDY
Build directory is /build/netbsd-local/obj/amd64/sys/arch/amd64/compile/SPEEDY
Don't forget to run "make depend"
depending the kern library objects
making sure the kern library is up to date...
building standard kern library
/build/netbsd-local/src_ro/sys/external/bsd/drm2/dist/drm/nouveau/nouveau_bo.c: 
In function 'nouveau_ttm_io_mem_reserve':
/build/netbsd-local/src_ro/sys/external/bsd/drm2/dist/drm/nouveau/nouveau_bo.c:1469:57:
 warning: this statement may fall through [-Wimplicit-fallthrough=]
   if (drm->device.info.family < NV_DEVICE_INFO_V0_TESLA || !node->memtype)
   ~~^
/build/netbsd-local/src_ro/sys/external/bsd/drm2/dist/drm/nouveau/nouveau_bo.c:1473:2:
 note: here
  case TTM_PL_VRAM:
  ^~~~
/build/netbsd-local/src_ro/sys/external/bsd/drm2/dist/drm/nouveau/nvkm/engine/dma/nouveau_nvkm_engine_dma_usernv04.c:
 In function 'nv04_dmaobj_new':
/build/netbsd-local/src_ro/sys/external/bsd/drm2/dist/drm/nouveau/nvkm/engine/dma/nouveau_nvkm_engine_dma_usernv04.c:129:18:
 warning: this statement may fall through [-Wimplicit-fallthrough=]
   dmaobj->flags0 |= 0x8000;
   ~~~^
/build/netbsd-local/src_ro/sys/external/bsd/drm2/dist/drm/nouveau/nvkm/engine/dma/nouveau_nvkm_engine_dma_usernv04.c:130:2:
 note: here
  case NV_MEM_ACCESS_RW:
  ^~~~
/build/netbsd-local/src_ro/sys/external/bsd/drm2/dist/drm/nouveau/nvkm/engine/fifo/nouveau_nvkm_engine_fifo_nv04.c:
 In function 'nv04_fifo_swmthd':
/build/netbsd-local/src_ro/sys/external/bsd/drm2/dist/drm/nouveau/nvkm/engine/fifo/nouveau_nvkm_engine_fifo_nv04.c:124:3:
 warning: this statement may fall through [-Wimplicit-fallthrough=]
   nvkm_wr32(device, 0x003280, (engine &= ~mask));
   ^

Re: current long build time

2019-04-25 Thread Paul Goyette

On Thu, 25 Apr 2019, Riccardo Mottola wrote:


Hi all,

I am in the process updating (thus to test) my laptop (HP620 - dual core 
amd64) to current.


My first attempt failed, in userland because I used "-u" flag and clearly 
something broke.


Now I went the "Long" route

1) build tools, build kernel

2) reboot new kernel (works!)

3) rebuild tools to match kernel

4) build & install modules

5) build userland (-U -x)


I notice that 5) is running since yesterday evening, almost 24hrs now - I was 
accustomed to it to finish in a couple of hours, usually I was able to update 
during business hours. Is this expected, like are we compiling lots of new 
stuff... or is something going wrong that considerably slows down the 
process.


If you are building in-tree X then we are now building a bunch of LLVM 
stuff (needed for the new mesa).  It takes a LONG time to build.




++--+-------+
| Paul Goyette   | PGP Key fingerprint: | E-mail addresses: |
| (Retired)  | FA29 0E3B 35AF E8AE 6651 | p...@whooppee.com |
| Software Developer | 0786 F758 55DE 53BA 7731 | pgoye...@netbsd.org   |
++--+---+


Build error

2019-08-25 Thread Paul Goyette
With sources updated to just a few minutes ago, I'm getting the 
following when building on a 9.99.4 amd64 host:


dependall ===> tests/crypto/libcrypto/srp
/build/netbsd-local/tools/x86_64/amd64/lib/gcc/x86_64--netbsd/7.4.0/../../../../x86_64--netbsd/bin/ld:
 /build/netbsd-local/dest/amd64/lib/libnpf.so: undefined reference to 
`nvlist_exists'
/build/netbsd-local/tools/x86_64/amd64/lib/gcc/x86_64--netbsd/7.4.0/../../../../x86_64--netbsd/bin/ld:
 /build/netbsd-local/dest/amd64/lib/libnpf.so: undefined reference to 
`dnvlist_take_number'

(several more undefined references follow)


++--+---+
| Paul Goyette   | PGP Key fingerprint: | E-mail addresses: |
| (Retired)  | FA29 0E3B 35AF E8AE 6651 | p...@whooppee.com |
| Software Developer | 0786 F758 55DE 53BA 7731 | pgoye...@netbsd.org   |
++--+---+


Re: Build error

2019-08-25 Thread Paul Goyette

OK, I did another ``cvs up'' and the problem appears to have been fixed!

Sorry for the noise.


On Sun, 25 Aug 2019, Paul Goyette wrote:

With sources updated to just a few minutes ago, I'm getting the following 
when building on a 9.99.4 amd64 host:


dependall ===> tests/crypto/libcrypto/srp
/build/netbsd-local/tools/x86_64/amd64/lib/gcc/x86_64--netbsd/7.4.0/../../../../x86_64--netbsd/bin/ld: 
/build/netbsd-local/dest/amd64/lib/libnpf.so: undefined reference to 
`nvlist_exists'
/build/netbsd-local/tools/x86_64/amd64/lib/gcc/x86_64--netbsd/7.4.0/../../../../x86_64--netbsd/bin/ld: 
/build/netbsd-local/dest/amd64/lib/libnpf.so: undefined reference to 
`dnvlist_take_number'


(several more undefined references follow)


++--+---+
| Paul Goyette   | PGP Key fingerprint: | E-mail addresses: |
| (Retired)  | FA29 0E3B 35AF E8AE 6651 | p...@whooppee.com |
| Software Developer | 0786 F758 55DE 53BA 7731 | pgoye...@netbsd.org   |
++--+---+



++--+---+
| Paul Goyette   | PGP Key fingerprint: | E-mail addresses: |
| (Retired)  | FA29 0E3B 35AF E8AE 6651 | p...@whooppee.com |
| Software Developer | 0786 F758 55DE 53BA 7731 | pgoye...@netbsd.org   |
++--+---+


Re: NetBSD 9.99.1 Quick Question

2019-07-31 Thread Paul Goyette

On Wed, 31 Jul 2019, Thomas Mueller wrote:

How do users of 8.99.51 and earlier 8.99.x decide whether to go with 
9.99.1 or 9.0_BETA?


I am on

NetBSD amelia2 8.99.51 NetBSD 8.99.51 (NetBSD-HEAD amd64.nb899-20190723) #1: 
Tue Jul 23 13:35:29 GMT 2019  
root@amelia2:/usr/obj/usr/src/sys/arch/amd64/compile/SANDY7 amd64

as I type this; also built i386 from the same source tree.


If you were previously on 8.99.x then you were tracking -current and
not any particular release.  IMHO you should now track the 9.99.x
series, as that is the new -current!

If you intend to deploy a stable 9.0 release (rather than tracking
-current) I would recommend installing 9.0-BETA to help us make the
release as "solid" as possible.




++--+---+
| Paul Goyette   | PGP Key fingerprint: | E-mail addresses: |
| (Retired)  | FA29 0E3B 35AF E8AE 6651 | p...@whooppee.com |
| Software Developer | 0786 F758 55DE 53BA 7731 | pgoye...@netbsd.org   |
++--+---+


Re: NetBSD 9.99.1 Quick Question

2019-08-05 Thread Paul Goyette

On Mon, 5 Aug 2019, Rhialto wrote:


On Thu 01 Aug 2019 at 22:52:56 +, Thomas Mueller wrote:

Is there any preference on which Xorg I should use on a new
installation: modular or native?


I personally prefer native, since I consider X to be part of the
operating system (or at least not part of the extra software that is
installed according to the taste of the user).


Same here - I've always used the in-tree native X, and never even looked 
at the pkgsrc stuff.




++--+---+
| Paul Goyette   | PGP Key fingerprint: | E-mail addresses: |
| (Retired)  | FA29 0E3B 35AF E8AE 6651 | p...@whooppee.com |
| Software Developer | 0786 F758 55DE 53BA 7731 | pgoye...@netbsd.org   |
++--+---+


Re: Automated report: NetBSD-current/i386 test failure

2019-08-08 Thread Paul Goyette

On Thu, 8 Aug 2019, Paul Goyette wrote:


I am investigating...


This should be fixed by sys/kern/kern_module.c rev 1.138






On Thu, 8 Aug 2019, NetBSD Test Fixture wrote:


This is an automatically generated notice of a new failure of the
NetBSD test suite.

The newly failing test case is:

   modules/t_builtin:busydisable

The above test failed in each of the last 3 test runs, and passed in
at least 27 consecutive runs before that.

The following commits were made between the last successful test and
the failed test:

   2019.08.07.00.38.01 pgoyette src/sys/dev/ccd.c,v 1.180
   2019.08.07.00.38.02 pgoyette src/sys/dev/iscsi/iscsi_main.c,v 1.31
   2019.08.07.00.38.02 pgoyette src/sys/dev/usb/usbnet.c,v 1.7
   2019.08.07.00.38.02 pgoyette src/sys/kern/kern_module.c,v 1.137
   2019.08.07.00.38.02 pgoyette src/sys/kern/sysv_ipc.c,v 1.40
   2019.08.07.00.38.02 pgoyette src/sys/kern/sysv_msg.c,v 1.75
   2019.08.07.00.38.02 pgoyette src/sys/kern/sysv_sem.c,v 1.98
   2019.08.07.00.38.02 pgoyette src/sys/kern/sysv_shm.c,v 1.137
   2019.08.07.00.38.02 pgoyette src/sys/miscfs/genfs/layer_vfsops.c,v 1.52
   2019.08.07.00.38.02 pgoyette src/sys/sys/module.h,v 1.47
   2019.08.07.00.38.02 pgoyette src/sys/sys/msg.h,v 1.28
   2019.08.07.00.38.02 pgoyette src/sys/sys/sem.h,v 1.34
   2019.08.07.00.38.02 pgoyette src/sys/sys/shm.h,v 1.53
   2019.08.07.00.39.23 pgoyette src/sys/sys/param.h,v 1.603
   2019.08.07.01.09.49 roy src/lib/libc/sys/read.2,v 1.37

Log files can be found at:

   
http://releng.NetBSD.org/b5reports/i386/commits-2019.08.html#2019.08.07.01.09.49

!DSPAM:5d4bb60467481808415646!




++--+---+
| Paul Goyette   | PGP Key fingerprint: | E-mail addresses: |
| (Retired)  | FA29 0E3B 35AF E8AE 6651 | p...@whooppee.com |
| Software Developer | 0786 F758 55DE 53BA 7731 | pgoye...@netbsd.org   |
++--+---+



++--+---+
| Paul Goyette   | PGP Key fingerprint: | E-mail addresses: |
| (Retired)  | FA29 0E3B 35AF E8AE 6651 | p...@whooppee.com |
| Software Developer | 0786 F758 55DE 53BA 7731 | pgoye...@netbsd.org   |
++--+---+


Re: Automated report: NetBSD-current/i386 test failure

2019-08-08 Thread Paul Goyette

I am investigating...


On Thu, 8 Aug 2019, NetBSD Test Fixture wrote:


This is an automatically generated notice of a new failure of the
NetBSD test suite.

The newly failing test case is:

   modules/t_builtin:busydisable

The above test failed in each of the last 3 test runs, and passed in
at least 27 consecutive runs before that.

The following commits were made between the last successful test and
the failed test:

   2019.08.07.00.38.01 pgoyette src/sys/dev/ccd.c,v 1.180
   2019.08.07.00.38.02 pgoyette src/sys/dev/iscsi/iscsi_main.c,v 1.31
   2019.08.07.00.38.02 pgoyette src/sys/dev/usb/usbnet.c,v 1.7
   2019.08.07.00.38.02 pgoyette src/sys/kern/kern_module.c,v 1.137
   2019.08.07.00.38.02 pgoyette src/sys/kern/sysv_ipc.c,v 1.40
   2019.08.07.00.38.02 pgoyette src/sys/kern/sysv_msg.c,v 1.75
   2019.08.07.00.38.02 pgoyette src/sys/kern/sysv_sem.c,v 1.98
   2019.08.07.00.38.02 pgoyette src/sys/kern/sysv_shm.c,v 1.137
   2019.08.07.00.38.02 pgoyette src/sys/miscfs/genfs/layer_vfsops.c,v 1.52
   2019.08.07.00.38.02 pgoyette src/sys/sys/module.h,v 1.47
   2019.08.07.00.38.02 pgoyette src/sys/sys/msg.h,v 1.28
   2019.08.07.00.38.02 pgoyette src/sys/sys/sem.h,v 1.34
   2019.08.07.00.38.02 pgoyette src/sys/sys/shm.h,v 1.53
   2019.08.07.00.39.23 pgoyette src/sys/sys/param.h,v 1.603
   2019.08.07.01.09.49 roy src/lib/libc/sys/read.2,v 1.37

Log files can be found at:

   
http://releng.NetBSD.org/b5reports/i386/commits-2019.08.html#2019.08.07.01.09.49

!DSPAM:5d4bb60467481808415646!




++--+---+
| Paul Goyette   | PGP Key fingerprint: | E-mail addresses: |
| (Retired)  | FA29 0E3B 35AF E8AE 6651 | p...@whooppee.com |
| Software Developer | 0786 F758 55DE 53BA 7731 | pgoye...@netbsd.org   |
++--+---+


Re: Build failed on fresh sources

2019-09-29 Thread Paul Goyette

On Sun, 29 Sep 2019, Christos Zoulas wrote:


Sure, but the question is why are you getting errors in the first place.
They should have been downgraded to warnings by -Wno-error-foo.


Something in /etc/mk.conf perhaps?



christos


On Sep 29, 2019, at 3:18 AM, Greywolf  wrote:

On Sat, Sep 28, 2019 at 3:48 AM Christos Zoulas  wrote:


We've chosen not to fix the fallthrough warnings in 3rd party code, but
we instead have changed the default setting to not produce errors. Have
you made any changes to the compilation environment that caused those to
become errors again?

christos



No, sir, I have not -- fresh sources, no environment variables set (I
only set them for full release builds).

Can this be overridden with a subsequent -Wno-{something}, or something similar?


--
--*greywolf;



!DSPAM:5d90be4e77492061013659!




++--+---+
| Paul Goyette   | PGP Key fingerprint: | E-mail addresses: |
| (Retired)  | FA29 0E3B 35AF E8AE 6651 | p...@whooppee.com |
| Software Developer | 0786 F758 55DE 53BA 7731 | pgoye...@netbsd.org   |
++--+---+


Re: heads up: follow correct upgrade procedures

2019-09-30 Thread Paul Goyette

On Mon, 30 Sep 2019, Jonathan A. Kollasch wrote:


Due to some recent changes to some file system syscalls, it is necessary
to follow correct upgrade procedures to avoid trouble.

Be sure to install new kernel and matching modules and boot the new
kernel before upgrading userland.  Remember that once userland is
upgraded you will not be able to run an older kernel.  Remember that
/rescue will not save you if it has been upgraded beyond the old kernel
too.


If not already done, this should probably be added to src/UPDATING



++--+---+
| Paul Goyette   | PGP Key fingerprint: | E-mail addresses: |
| (Retired)  | FA29 0E3B 35AF E8AE 6651 | p...@whooppee.com |
| Software Developer | 0786 F758 55DE 53BA 7731 | pgoye...@netbsd.org   |
++--+---+


Re: Automated report: NetBSD-current/i386 test failure

2019-11-12 Thread Paul Goyette

Hmmm, this might be my fault.  Checking/bisecting now...

On Tue, 12 Nov 2019, NetBSD Test Fixture wrote:


This is an automatically generated notice of new failures of the
NetBSD test suite.

The newly failing test cases are:

   lib/libc/sys/t_ptrace_wait3:thread_concurrent_signals
   lib/libc/sys/t_ptrace_wait3:trace_thread_lwpcreate_and_exit
   lib/libc/sys/t_ptrace_wait4:thread_concurrent_signals
   lib/libc/sys/t_ptrace_wait4:trace_thread_lwpcreate_and_exit
   lib/libc/sys/t_ptrace_wait6:thread_concurrent_signals
   lib/libc/sys/t_ptrace_wait6:trace_thread_lwpcreate_and_exit
   lib/libc/sys/t_ptrace_wait:thread_concurrent_signals
   lib/libc/sys/t_ptrace_wait:trace_thread_lwpcreate_and_exit
   lib/libc/sys/t_ptrace_waitid:trace_thread_lwpcreate_and_exit
   lib/libc/sys/t_ptrace_waitpid:thread_concurrent_signals
   lib/libc/sys/t_ptrace_waitpid:trace_thread_lwpcreate_and_exit

The above tests failed in each of the last 3 test runs, and passed in
at least 27 consecutive runs before that.

Between the last successful test and the failed test, a total of 10064
revisions were committed, by the following developers:

   christos
   chs
   gutteridge
   hannken
   jdolecek
   jmcneill
   joerg
   kamil
   maxv
   mlelstv
   mrg
   msaitoh
   nisimura
   pgoyette
   roy
   sevan
   tnn
   yamaguchi

Log files can be found at:

   
http://releng.NetBSD.org/b5reports/i386/commits-2019.11.html#2019.11.11.02.40.48

!DSPAM:5dca6d0b256033292947347!




++--+---+
| Paul Goyette   | PGP Key fingerprint: | E-mail addresses: |
| (Retired)  | FA29 0E3B 35AF E8AE 6651 | p...@whooppee.com |
| Software Developer | 0786 F758 55DE 53BA 7731 | pgoye...@netbsd.org   |
++--+---+


Re: Automated report: NetBSD-current/i386 test failure

2019-11-12 Thread Paul Goyette

On Tue, 12 Nov 2019, Kamil Rytarowski wrote:


The problem is in ATF tests with assumptions that are no longer valid.

We are working on this. The right fix is to improve the tests.


Oh, good - not my fault after all!

Thanks!






On 12.11.2019 13:53, Paul Goyette wrote:

Hmmm, this might be my fault.?? Checking/bisecting now...

On Tue, 12 Nov 2019, NetBSD Test Fixture wrote:


This is an automatically generated notice of new failures of the
NetBSD test suite.

The newly failing test cases are:

 lib/libc/sys/t_ptrace_wait3:thread_concurrent_signals
 lib/libc/sys/t_ptrace_wait3:trace_thread_lwpcreate_and_exit
 lib/libc/sys/t_ptrace_wait4:thread_concurrent_signals
 lib/libc/sys/t_ptrace_wait4:trace_thread_lwpcreate_and_exit
 lib/libc/sys/t_ptrace_wait6:thread_concurrent_signals
 lib/libc/sys/t_ptrace_wait6:trace_thread_lwpcreate_and_exit
 lib/libc/sys/t_ptrace_wait:thread_concurrent_signals
 lib/libc/sys/t_ptrace_wait:trace_thread_lwpcreate_and_exit
 lib/libc/sys/t_ptrace_waitid:trace_thread_lwpcreate_and_exit
 lib/libc/sys/t_ptrace_waitpid:thread_concurrent_signals
 lib/libc/sys/t_ptrace_waitpid:trace_thread_lwpcreate_and_exit

The above tests failed in each of the last 3 test runs, and passed in
at least 27 consecutive runs before that.

Between the last successful test and the failed test, a total of 10064
revisions were committed, by the following developers:

 christos
 chs
 gutteridge
 hannken
 jdolecek
 jmcneill
 joerg
 kamil
 maxv
 mlelstv
 mrg
 msaitoh
 nisimura
 pgoyette
 roy
 sevan
 tnn
 yamaguchi

Log files can be found at:


http://releng.NetBSD.org/b5reports/i386/commits-2019.11.html#2019.11.11.02.40.48


!DSPAM:5dca6d0b256033292947347!




++--+---+
| Paul Goyette | PGP Key fingerprint: | E-mail 
addresses: |
| (Retired)?? | FA29 0E3B 35AF E8AE 6651 | 
p...@whooppee.com |
| Software Developer | 0786 F758 55DE 53BA 7731 | pgoye...@netbsd.org |
++--+---+






++--+---+
| Paul Goyette   | PGP Key fingerprint: | E-mail addresses: |
| (Retired)  | FA29 0E3B 35AF E8AE 6651 | p...@whooppee.com |
| Software Developer | 0786 F758 55DE 53BA 7731 | pgoye...@netbsd.org   |
++--+---+

Re: Automated report: NetBSD-current/i386 test failure

2019-11-12 Thread Paul Goyette

I've confirmed that these failures occur with builds from before
my most recent changes.  So, as Kamil indicated (and Joerg on
IRC), it ain't my fault!

On Tue, 12 Nov 2019, Paul Goyette wrote:


Hmmm, this might be my fault.  Checking/bisecting now...

On Tue, 12 Nov 2019, NetBSD Test Fixture wrote:


This is an automatically generated notice of new failures of the
NetBSD test suite.

The newly failing test cases are:

   lib/libc/sys/t_ptrace_wait3:thread_concurrent_signals
   lib/libc/sys/t_ptrace_wait3:trace_thread_lwpcreate_and_exit
   lib/libc/sys/t_ptrace_wait4:thread_concurrent_signals
   lib/libc/sys/t_ptrace_wait4:trace_thread_lwpcreate_and_exit
   lib/libc/sys/t_ptrace_wait6:thread_concurrent_signals
   lib/libc/sys/t_ptrace_wait6:trace_thread_lwpcreate_and_exit
   lib/libc/sys/t_ptrace_wait:thread_concurrent_signals
   lib/libc/sys/t_ptrace_wait:trace_thread_lwpcreate_and_exit
   lib/libc/sys/t_ptrace_waitid:trace_thread_lwpcreate_and_exit
   lib/libc/sys/t_ptrace_waitpid:thread_concurrent_signals
   lib/libc/sys/t_ptrace_waitpid:trace_thread_lwpcreate_and_exit

The above tests failed in each of the last 3 test runs, and passed in
at least 27 consecutive runs before that.

Between the last successful test and the failed test, a total of 10064
revisions were committed, by the following developers:

   christos
   chs
   gutteridge
   hannken
   jdolecek
   jmcneill
   joerg
   kamil
   maxv
   mlelstv
   mrg
   msaitoh
   nisimura
   pgoyette
   roy
   sevan
   tnn
   yamaguchi

Log files can be found at:

   
http://releng.NetBSD.org/b5reports/i386/commits-2019.11.html#2019.11.11.02.40.48

!DSPAM:5dca6d0b256033292947347!




++--+---+
| Paul Goyette   | PGP Key fingerprint: | E-mail addresses: |
| (Retired)  | FA29 0E3B 35AF E8AE 6651 | p...@whooppee.com |
| Software Developer | 0786 F758 55DE 53BA 7731 | pgoye...@netbsd.org   |
++--+---+



++--+---+
| Paul Goyette   | PGP Key fingerprint: | E-mail addresses: |
| (Retired)  | FA29 0E3B 35AF E8AE 6651 | p...@whooppee.com |
| Software Developer | 0786 F758 55DE 53BA 7731 | pgoye...@netbsd.org   |
++--+---+


Re: New ATF test failures

2019-12-15 Thread Paul Goyette

Working on it.  As far as I can tell, it just needs to initialize the
newly-created module_hook synchronization variables.

I'll commit the fix as soon as I can test it.


Index: rump.c
===
RCS file: /cvsroot/src/sys/rump/librump/rumpkern/rump.c,v
retrieving revision 1.337
diff -u -p -r1.337 rump.c
--- rump.c  7 Dec 2019 14:55:58 -   1.337
+++ rump.c  15 Dec 2019 13:43:06 -
@@ -52,6 +52,7 @@ __KERNEL_RCSID(0, "$NetBSD: rump.c,v 1.3
 #include 
 #include 
 #include 
+#include 
 #include 
 #include 
 #include 
@@ -412,6 +413,7 @@ rump_init(void)
iostat_init();
fd_sys_init();
module_init();
+   module_hook_init();
devsw_init();
pipe_init();
resource_init();




On Sun, 15 Dec 2019, Andreas Gustafsson wrote:


The following test cases are now failing on multiple testbeds:

 dev/sysmon/t_swsensor/alarm_sensor
 dev/sysmon/t_swsensor/entropy_interrupt_sensor
 dev/sysmon/t_swsensor/entropy_polled_sensor
 dev/sysmon/t_swsensor/limit_sensor
 dev/sysmon/t_swsensor/simple_sensor
 net/if_vlan/t_vlan/vlan_auto_follow_mtu
 net/if_vlan/t_vlan/vlan_auto_follow_mtu6
 net/if_vlan/t_vlan/vlan_basic
 net/if_vlan/t_vlan/vlan_basic6
 net/if_vlan/t_vlan/vlan_bridge
 net/if_vlan/t_vlan/vlan_bridge6
 net/if_vlan/t_vlan/vlan_configs
 net/if_vlan/t_vlan/vlan_configs6
 net/if_vlan/t_vlan/vlan_create_destroy
 net/if_vlan/t_vlan/vlan_create_destroy6
 net/if_vlan/t_vlan/vlan_multicast
 net/if_vlan/t_vlan/vlan_multicast6
 net/if_vlan/t_vlan/vlan_vlanid
 net/if_vlan/t_vlan/vlan_vlanid6

since these commits:

 2019.12.12.22.55.20 pgoyette src/sys/kern/files.kern 1.39
 2019.12.12.22.55.20 pgoyette src/sys/kern/init_main.c 1.509
 2019.12.12.22.55.20 pgoyette src/sys/kern/kern_module.c 1.141
 2019.12.12.22.55.20 pgoyette src/sys/kern/kern_module_hook.c 1.1
 2019.12.12.22.55.20 pgoyette src/sys/rump/librump/rumpkern/Makefile.rumpkern 
1.178
 2019.12.12.22.55.20 pgoyette src/sys/sys/module_hook.h 1.6
 2019.12.12.22.55.20 pgoyette src/sys/sys/param.h 1.624

For logs, see:

 
http://www.gson.org/netbsd/bugs/build/amd64-baremetal/commits-2019.12.html#2019.12.12.22.55.20

--
Andreas Gustafsson, g...@gson.org

!DSPAM:5df6011f137371101510858!




++--+---+
| Paul Goyette   | PGP Key fingerprint: | E-mail addresses: |
| (Retired)  | FA29 0E3B 35AF E8AE 6651 | p...@whooppee.com |
| Software Developer | 0786 F758 55DE 53BA 7731 | pgoye...@netbsd.org   |
++--+---+


Re: New ATF test failures

2019-12-15 Thread Paul Goyette

OK, looks like this is fixed.  With the fix I just committed, these
tests now pass:

# cd /usr/tests/dev/sysmon
# atf-run t_swsensor | atf-report
Tests root: /usr/tests/dev/sysmon

t_swsensor (1/1): 5 test cases
alarm_sensor: [57.065200s] Passed.
entropy_interrupt_sensor: [36.167103s] Passed.
entropy_polled_sensor: [68.102301s] Passed.
limit_sensor: [56.831116s] Passed.
simple_sensor: [35.712674s] Passed.
[253.936348s]

Summary for 1 test programs:
5 passed test cases.
0 failed test cases.
0 expected failed test cases.
0 skipped test cases.
# atf-run t_vlan | atf-report
Tests root: /usr/tests/net/if_vlan

t_vlan (1/1): 14 test cases
vlan_auto_follow_mtu: [16.895262s] Passed.
vlan_auto_follow_mtu6: [9.539698s] Passed.
vlan_basic: [31.873899s] Passed.
vlan_basic6: [31.160412s] Passed.
vlan_bridge: [5.104153s] Passed.
vlan_bridge6: [5.430393s] Passed.
vlan_configs: [4.959282s] Passed.
vlan_configs6: [5.565353s] Passed.
vlan_create_destroy: [5.446405s] Passed.
vlan_create_destroy6: [5.899840s] Passed.
vlan_multicast: [12.461551s] Passed.
vlan_multicast6: [8.807345s] Passed.
vlan_vlanid: [14.141529s] Passed.
vlan_vlanid6: [14.487024s] Passed.
[171.900011s]

Summary for 1 test programs:
14 passed test cases.
0 failed test cases.
0 expected failed test cases.
0 skipped test cases.
#

On Sun, 15 Dec 2019, Andreas Gustafsson wrote:


The following test cases are now failing on multiple testbeds:

 dev/sysmon/t_swsensor/alarm_sensor
 dev/sysmon/t_swsensor/entropy_interrupt_sensor
 dev/sysmon/t_swsensor/entropy_polled_sensor
 dev/sysmon/t_swsensor/limit_sensor
 dev/sysmon/t_swsensor/simple_sensor
 net/if_vlan/t_vlan/vlan_auto_follow_mtu
 net/if_vlan/t_vlan/vlan_auto_follow_mtu6
 net/if_vlan/t_vlan/vlan_basic
 net/if_vlan/t_vlan/vlan_basic6
 net/if_vlan/t_vlan/vlan_bridge
 net/if_vlan/t_vlan/vlan_bridge6
 net/if_vlan/t_vlan/vlan_configs
 net/if_vlan/t_vlan/vlan_configs6
 net/if_vlan/t_vlan/vlan_create_destroy
 net/if_vlan/t_vlan/vlan_create_destroy6
 net/if_vlan/t_vlan/vlan_multicast
 net/if_vlan/t_vlan/vlan_multicast6
 net/if_vlan/t_vlan/vlan_vlanid
 net/if_vlan/t_vlan/vlan_vlanid6

since these commits:

 2019.12.12.22.55.20 pgoyette src/sys/kern/files.kern 1.39
 2019.12.12.22.55.20 pgoyette src/sys/kern/init_main.c 1.509
 2019.12.12.22.55.20 pgoyette src/sys/kern/kern_module.c 1.141
 2019.12.12.22.55.20 pgoyette src/sys/kern/kern_module_hook.c 1.1
 2019.12.12.22.55.20 pgoyette src/sys/rump/librump/rumpkern/Makefile.rumpkern 
1.178
 2019.12.12.22.55.20 pgoyette src/sys/sys/module_hook.h 1.6
 2019.12.12.22.55.20 pgoyette src/sys/sys/param.h 1.624

For logs, see:

 
http://www.gson.org/netbsd/bugs/build/amd64-baremetal/commits-2019.12.html#2019.12.12.22.55.20

--
Andreas Gustafsson, g...@gson.org

!DSPAM:5df6011f137371101510858!




++--+---+
| Paul Goyette   | PGP Key fingerprint: | E-mail addresses: |
| (Retired)  | FA29 0E3B 35AF E8AE 6651 | p...@whooppee.com |
| Software Developer | 0786 F758 55DE 53BA 7731 | pgoye...@netbsd.org   |
++--+---+


Re: amd64 -current build failure

2019-12-17 Thread Paul Goyette

Christos just fixed the lzma stuff...

On Tue, 17 Dec 2019, Chavdar Ivanov wrote:


Looks like it; the lzma/bz2 undefined references still there though,
10 minutes ago.

On Tue, 17 Dec 2019 at 13:18, Andrew Doran  wrote:


Hi,

On Tue, Dec 17, 2019 at 09:49:58AM +, Chavdar Ivanov wrote:


Last two days I haven't been able to build amd64 -current:
...
/home/sysbuild/amd64/tools/lib/gcc/x86_64--netbsd/8.3.0/../../../../x86_64--netbsd/bin/ld:
/home/sysbuild/amd64/destdir/usr/lib/librump.so: undefined reference
to `rumpns_cpuctl_ioctl'
collect2: error: ld returned 1 exit status


I think I fixed that one late last night - your followup message seems to
confirm.

Andrew




--


!DSPAM:5df8dacf79529213120201!




++--+---+
| Paul Goyette   | PGP Key fingerprint: | E-mail addresses: |
| (Retired)  | FA29 0E3B 35AF E8AE 6651 | p...@whooppee.com |
| Software Developer | 0786 F758 55DE 53BA 7731 | pgoye...@netbsd.org   |
++--+---+


Re: amd64 -current build failure

2019-12-17 Thread Paul Goyette

On Tue, 17 Dec 2019, Chavdar Ivanov wrote:


I am still getting the same errors, my CVS log is empty...


You might have to run a full build.  I was trying an update build
which failed, but a full build seems to work.






On Tue, 17 Dec 2019 at 15:12, Paul Goyette  wrote:


Christos just fixed the lzma stuff...

On Tue, 17 Dec 2019, Chavdar Ivanov wrote:


Looks like it; the lzma/bz2 undefined references still there though,
10 minutes ago.

On Tue, 17 Dec 2019 at 13:18, Andrew Doran  wrote:


Hi,

On Tue, Dec 17, 2019 at 09:49:58AM +, Chavdar Ivanov wrote:


Last two days I haven't been able to build amd64 -current:
...
/home/sysbuild/amd64/tools/lib/gcc/x86_64--netbsd/8.3.0/../../../../x86_64--netbsd/bin/ld:
/home/sysbuild/amd64/destdir/usr/lib/librump.so: undefined reference
to `rumpns_cpuctl_ioctl'
collect2: error: ld returned 1 exit status


I think I fixed that one late last night - your followup message seems to
confirm.

Andrew




--







++--+---+
| Paul Goyette   | PGP Key fingerprint: | E-mail addresses: |
| (Retired)  | FA29 0E3B 35AF E8AE 6651 | p...@whooppee.com |
| Software Developer | 0786 F758 55DE 53BA 7731 | pgoye...@netbsd.org   |
++--+---+




--


!DSPAM:5df8fd1c112731575111740!




++--+---+
| Paul Goyette   | PGP Key fingerprint: | E-mail addresses: |
| (Retired)  | FA29 0E3B 35AF E8AE 6651 | p...@whooppee.com |
| Software Developer | 0786 F758 55DE 53BA 7731 | pgoye...@netbsd.org   |
++--+---+


Re: Build breakage

2019-12-12 Thread Paul Goyette

Working on it - expect a commit soon, as soon as my build comlpetes.


On Thu, 12 Dec 2019, Andreas Gustafsson wrote:


Hi all,

As of source date 2019.12.12.05.00.33, the evbarm-earmv7hf build is
failing with:

 --- kern_module.o ---
 
/tmp/bracket/build/2019.12.12.11.47.30-evbarm-earmv7hf/src/lib/librump/../../sys/rump/../kern/kern_module.c:
 In function 'module_init':
 
/tmp/bracket/build/2019.12.12.11.47.30-evbarm-earmv7hf/src/lib/librump/../../sys/rump/../kern/kern_module.c:456:20:
 error: implicit declaration of function 'pserialize_create'; did you mean 
'sysctl_create'? [-Werror=implicit-function-declaration]
   module_hook_psz = pserialize_create();
 ^
 sysctl_create
 cc1: all warnings being treated as errors
 *** [kern_module.o] Error code 1

and the sparc build is also failing with a similar error.
--
Andreas Gustafsson, g...@gson.org

!DSPAM:5df265f354781187315487!




++--+---+
| Paul Goyette   | PGP Key fingerprint: | E-mail addresses: |
| (Retired)  | FA29 0E3B 35AF E8AE 6651 | p...@whooppee.com |
| Software Developer | 0786 F758 55DE 53BA 7731 | pgoye...@netbsd.org   |
++--+---+


heads up - amd64 booting is broken

2019-12-10 Thread Paul Goyette

It seems that amd64 booting is broken, likely as a result of the
recent addition of support for multiboot2.

See [1] and [2]

[1] http://mail-index.netbsd.org/source-changes-d/2019/12/10/msg011875.html
[2] http://mail-index.netbsd.org/source-changes/2019/12/10/msg111777.html


++--+---+
| Paul Goyette   | PGP Key fingerprint: | E-mail addresses: |
| (Retired)  | FA29 0E3B 35AF E8AE 6651 | p...@whooppee.com |
| Software Developer | 0786 F758 55DE 53BA 7731 | pgoye...@netbsd.org   |
++--+---+


nvmmctl debug file

2019-10-28 Thread Paul Goyette

With the new nvmmctl utility, when building a release with MKDEBUG=yes
I get an unexpected file in

$DESTDIR/amd64/usr/libdata/debug/usr/sbin/nvmmctl.debug

Do you maybe need to add something to $SRC/distrib/sets/lists/debug/*


++--+---+
| Paul Goyette   | PGP Key fingerprint: | E-mail addresses: |
| (Retired)  | FA29 0E3B 35AF E8AE 6651 | p...@whooppee.com |
| Software Developer | 0786 F758 55DE 53BA 7731 | pgoye...@netbsd.org   |
++--+---+


Re: Tar extract behaviour changed

2019-10-23 Thread Paul Goyette

On Wed, 23 Oct 2019, Christos Zoulas wrote:


I am not advocating for either, perhaps we should just add -P to the
extraction and get over it :-)


This one gets my vote!   :)



++--+---+
| Paul Goyette   | PGP Key fingerprint: | E-mail addresses: |
| (Retired)  | FA29 0E3B 35AF E8AE 6651 | p...@whooppee.com |
| Software Developer | 0786 F758 55DE 53BA 7731 | pgoye...@netbsd.org   |
++--+---+


Re: built-in vs loadable modules and userconf

2019-10-19 Thread Paul Goyette

On Sat, 19 Oct 2019, Rhialto wrote:


On Sat 19 Oct 2019 at 08:16:02 -0700, Paul Goyette wrote:

If the module is built-in, it will never be loaded from the *.kmod file.
But as I pointed out above, disabling in userconf does NOT disable the
module, and does not prevent the module's initialization code from
executing.


Side-thought: maybe userconf can benefit from a way to disable modules?


Yeah, that might be nice to have...  :)


++--+---+
| Paul Goyette   | PGP Key fingerprint: | E-mail addresses: |
| (Retired)  | FA29 0E3B 35AF E8AE 6651 | p...@whooppee.com |
| Software Developer | 0786 F758 55DE 53BA 7731 | pgoye...@netbsd.org   |
++--+---+


Re: built-in vs loadable modules and userconf

2019-10-19 Thread Paul Goyette

On Sat, 19 Oct 2019, John D. Baker wrote:


I'm curious to see if changes to ahci_sata code in -current fixes the
problem in PR kern/54289.  On the primary test machine I have it does
not.

The only other machine to exhibit the problem is my file server using
RAIDframe.  I nearly lost the RAID booting a netbsd-9 kernel when half
of the disks failed to identify due the problem in the PR.

If I boot a -current kernel in userconf mode (boot -c) and disable the
built-in "raid" device, this should prevent the raid from trying to
configure and failing if the problem is still present.

I want to be sure that a disabled built-in module won't cause a loadable
module to be loaded and defeat the disabling of the built-in "raid" device.


Disabling the raid device in userconf doesn't disable the raidframe 
module.  If the raidframe module is in your kernel, it can only be 
disabled after booting, using modunload(8).


Also note that disabling a built-in module does not allow you to load a 
different instance of that module.  modload(8) will find the built-in 
module, and if you specify the -f option for modload it will re-enable 
the built-in module.  (Without -f, modload will report an error.)




++--+-------+
| Paul Goyette   | PGP Key fingerprint: | E-mail addresses: |
| (Retired)  | FA29 0E3B 35AF E8AE 6651 | p...@whooppee.com |
| Software Developer | 0786 F758 55DE 53BA 7731 | pgoye...@netbsd.org   |
++--+---+


Re: built-in vs loadable modules and userconf

2019-10-19 Thread Paul Goyette

On Sat, 19 Oct 2019, John D. Baker wrote:


On Sat, 19 Oct 2019, Paul Goyette wrote:


I want to be sure that a disabled built-in module won't cause a loadable
module to be loaded and defeat the disabling of the built-in "raid" device.


Disabling the raid device in userconf doesn't disable the raidframe module.
If the raidframe module is in your kernel, it can only be disabled after
booting, using modunload(8).


All raidframe support is built-in on my kernel (includes GENERIC and
elides unnecessary/unwanted items).  If I disable all the raid* stuff,
won't that disable raidframe as well?



From the module(9) perspective, no, the module is not disabled as a

result of userconf activity.

Using userconf to disable the device will prevent all accesses to the
device.  BUT, the module will still be active, and the module's
initialization code will be called.  Any code in the kernel that looks
for auto-configured raid devices will still run.  I would expect it to
fail to create any actual raid instances, but the code should still
execute.


It's important that everything raidframe/raid related be disabled before
the kernel boots, or my RAID will be hosed again and I don't need that
level of stress.  I was lucky to recover it the first time (have
encountered a few damaged files since then).


Given that you've had "issues" with raid's auto-configure before, I
would be reluctant to try this with your current kernel.  I would
suggest that you build a new kernel with no raid config(1)ured.


Also note that disabling a built-in module does not allow you to load a
different instance of that module.  modload(8) will find the built-in module,


I think that means if I disable something via userconf, the on-disk
loadable module won't be loaded.


If the module is built-in, it will never be loaded from the *.kmod file.
But as I pointed out above, disabling in userconf does NOT disable the
module, and does not prevent the module's initialization code from
executing.



I suppose I could whip up a custom kernel without the raidframe/raid
support in it, but it's crucial to keep the raidframe module from being
loaded if any disk is found with RAID autoconf support.


If you build a custom kernel without any raid support, then there will
be no code to look for any auto-configured-raid-sets, and therefore no
code to instantiate any raid devices, and therefore no reason to ever
attempt to autoload the module.

If you're really paranoid, you can also configure your custom kernel
with MODULAR_DEFAULT_AUTOLOAD disabled (this is on by default).


I suppose also I can turn off autoconfig on the RAID before booting the
test kernel (the config file is renamed so it won't be found by the
'rc' scripts).

I have another machine with a local raidframe raid that I can test the
procedure with when current tasks are completed.



Hope this helps!



++--+-------+
| Paul Goyette   | PGP Key fingerprint: | E-mail addresses: |
| (Retired)  | FA29 0E3B 35AF E8AE 6651 | p...@whooppee.com |
| Software Developer | 0786 F758 55DE 53BA 7731 | pgoye...@netbsd.org   |
++--+---+


Build failure

2019-11-29 Thread Paul Goyette

With up-to-date sources, I'm getting

checkflist ===> distrib/sets

===  2 extra files in DESTDIR  =
Files in DESTDIR but missing from flist.
File is obsolete or flist is out of date ?
--
./usr/tests/usr.bin/make/unit-tests/varmod-edge.exp
./usr/tests/usr.bin/make/unit-tests/varmod-edge.mk
=  end of 2 extra files  ===


Looks like the sets lists need to be updated.


++--+---+
| Paul Goyette   | PGP Key fingerprint: | E-mail addresses: |
| (Retired)  | FA29 0E3B 35AF E8AE 6651 | p...@whooppee.com |
| Software Developer | 0786 F758 55DE 53BA 7731 | pgoye...@netbsd.org   |
++--+---+


<    1   2   3   4   5   6   7   8   9   >