Re: iwm: Fatal firmware error (could not add sta (error 35))

2021-05-14 Thread Stefan Sperling
On Fri, May 14, 2021 at 02:39:15PM +0200, Matthias Schmidt wrote:
> I am now running 
> 
> OpenBSD 6.9-current (GENERIC.MP) #17: Wed May 12 11:14:50 MDT 2021
> dera...@amd64.openbsd.org:/usr/src/sys/arch/amd64/compile/GENERIC.MP
> 
> which contains your described fix.  Since then I occasionally see a new
> firmare error I haven't seen before.  Most of the time the interface
> recovers but sometimes I have to bring it down and up again.

This looks like your AP is disappearing for some reason.
The AP may be switching channels or you may have moved out of range.
And the driver doesn't handle the resulting state transitions correctly.

Thanks for reporting this! I will look into it.

> May 14 13:59:59 sigma /bsd: iwm0: received msg 1/2 of the group key handshake 
> from cc:ce:1e:8b:cf:d1
> May 14 13:59:59 sigma /bsd: iwm0: sending msg 2/2 of the group key handshake 
> to cc:ce:1e:8b:cf:d1
> May 14 14:01:20 sigma /bsd: iwm0: RUN -> ASSOC
> May 14 14:01:20 sigma /bsd: iwm0: sending action to cc:ce:1e:8b:cf:d1 on 
> channel 100 mode 11n
> May 14 14:01:20 sigma /bsd: iwm0: sending assoc_req to cc:ce:1e:8b:cf:d1 on 
> channel 100 mode 11n
> May 14 14:01:24 sigma /bsd: iwm0: association timed out for cc:ce:1e:8b:cf:d1
> May 14 14:01:24 sigma /bsd: iwm0: dumping device error log
> May 14 14:01:24 sigma /bsd: iwm0: Start Error Log Dump:
> May 14 14:01:24 sigma /bsd: iwm0: Status: 0x9, count: 6
> May 14 14:01:24 sigma /bsd: iwm0: 0x3421 | ADVANCED_SYSASSERT  
> May 14 14:01:24 sigma /bsd: iwm0: 0220 | trm_hw_status0
> May 14 14:01:24 sigma /bsd: iwm0:  | trm_hw_status1
> May 14 14:01:24 sigma /bsd: iwm0: 00023FDC | branchlink2
> May 14 14:01:24 sigma /bsd: iwm0: 0003915A | interruptlink1
> May 14 14:01:24 sigma /bsd: iwm0:  | interruptlink2
> May 14 14:01:24 sigma /bsd: iwm0:  | data1
> May 14 14:01:24 sigma /bsd: iwm0: 0001 | data2
> May 14 14:01:24 sigma /bsd: iwm0: DEADBEEF | data3
> May 14 14:01:24 sigma /bsd: iwm0:  | beacon time
> May 14 14:01:24 sigma /bsd: iwm0: E8F0FA81 | tsf low
> May 14 14:01:24 sigma /bsd: iwm0: 0024 | tsf hi
> May 14 14:01:24 sigma /bsd: iwm0:  | time gp1
> May 14 14:01:24 sigma /bsd: iwm0: 20010BB2 | time gp2
> May 14 14:01:24 sigma /bsd: iwm0: 0001 | uCode revision type
> May 14 14:01:24 sigma /bsd: iwm0: 0022 | uCode version major
> May 14 14:01:24 sigma /bsd: iwm0:  | uCode version minor
> May 14 14:01:24 sigma /bsd: iwm0: 0230 | hw version
> May 14 14:01:24 sigma /bsd: iwm0: 18089000 | board version
> May 14 14:01:24 sigma /bsd: iwm0: 007C0028 | hcmd
> May 14 14:01:24 sigma /bsd: iwm0: 24022082 | isr0
> May 14 14:01:24 sigma /bsd: iwm0: 0100 | isr1
> May 14 14:01:24 sigma /bsd: iwm0: 08201802 | isr2
> May 14 14:01:24 sigma /bsd: iwm0: 004140C0 | isr3
> May 14 14:01:24 sigma /bsd: iwm0:  | isr4
> May 14 14:01:24 sigma /bsd: iwm0: 007B002B | last cmd Id
> May 14 14:01:24 sigma /bsd: iwm0:  | wait_event
> May 14 14:01:24 sigma /bsd: iwm0: 0080 | l2p_control
> May 14 14:01:24 sigma /bsd: iwm0: 00018010 | l2p_duration
> May 14 14:01:24 sigma /bsd: iwm0: 003F | l2p_mhvalid
> May 14 14:01:24 sigma /bsd: iwm0:  | l2p_addr_match
> May 14 14:01:24 sigma /bsd: iwm0: 000D | lmpm_pmg_sel
> May 14 14:01:24 sigma /bsd: iwm0: 30101345 | timestamp
> May 14 14:01:24 sigma /bsd: iwm0: A8B8 | flow_handler
> May 14 14:01:24 sigma /bsd: iwm0: Start UMAC Error Log Dump:
> May 14 14:01:24 sigma /bsd: iwm0: Status: 0x9, count: 7
> May 14 14:01:24 sigma /bsd: iwm0: 0x0070 | NMI_INTERRUPT_LMAC_FATAL
> May 14 14:01:24 sigma /bsd: iwm0: 0x | umac branchlink1
> May 14 14:01:24 sigma /bsd: iwm0: 0xC0086964 | umac branchlink2
> May 14 14:01:24 sigma /bsd: iwm0: 0xC0083A94 | umac interruptlink1
> May 14 14:01:24 sigma /bsd: iwm0: 0xC0083A94 | umac interruptlink2
> May 14 14:01:24 sigma /bsd: iwm0: 0x0800 | umac data1
> May 14 14:01:24 sigma /bsd: iwm0: 0xC0083A94 | umac data2
> May 14 14:01:24 sigma /bsd: iwm0: 0xDEADBEEF | umac data3
> May 14 14:01:24 sigma /bsd: iwm0: 0x0022 | umac major
> May 14 14:01:24 sigma /bsd: iwm0: 0x | umac minor
> May 14 14:01:24 sigma /bsd: iwm0: 0xC088628C | frame pointer
> May 14 14:01:24 sigma /bsd: iwm0: 0xC088628C | stack pointer
> May 14 14:01:24 sigma /bsd: iwm0: 0x007C0028 | last host cmd
> May 14 14:01:24 sigma /bsd: iwm0: 0x | isr status reg
> May 14 14:01:24 sigma /bsd: driver status:
> May 14 14:01:24 sigma /bsd:   tx ring  0: qid=0  cur=125 queued=1  
> May 14 14:01:24 sigma /bsd:   tx ring  1: qid=1  cur=0   queued=0  
> May 14 14:01:24 sigma /bsd:   tx ring  2: qid=2  cur=0   queued=0  
> May 14 14:01:24 sigma /bsd:   tx ring  3: qid=3  cur=0   queued=0  
> May 14 14:01:24 sigma /bsd:   tx ring  4: qid=4  cur=0   queued=0  
> May 14 14:01:24 sigma /bsd:   tx ring  5: qid=5  cur=119 queued=2  
> May 14 14:01:24 sigma /bsd:   tx ring  6: qid=6  cur=0   queued=0  
> May 14 14:01:24 sigma /bsd:   tx ring  7: 

Re: iwm: Fatal firmware error (could not add sta (error 35))

2021-05-14 Thread Matthias Schmidt
Hi Stefan,

* Stefan Sperling wrote:
> On Tue, May 11, 2021 at 11:44:17AM +0200, Stefan Sperling wrote:
> > Can you please run with this and let me know if it changes anything?
> 
> I have finally managed to reproduce the problem locally by playing around
> with forced background scans and roaming. This patch is a superset of the
> previous patch. It should fix the 'add sta' problem and also fixes a couple
> of small bugs I found along the way.

I am now running 

OpenBSD 6.9-current (GENERIC.MP) #17: Wed May 12 11:14:50 MDT 2021
dera...@amd64.openbsd.org:/usr/src/sys/arch/amd64/compile/GENERIC.MP

which contains your described fix.  Since then I occasionally see a new
firmare error I haven't seen before.  Most of the time the interface
recovers but sometimes I have to bring it down and up again.

Cheers

Matthias

May 14 13:59:59 sigma /bsd: iwm0: received msg 1/2 of the group key handshake 
from cc:ce:1e:8b:cf:d1
May 14 13:59:59 sigma /bsd: iwm0: sending msg 2/2 of the group key handshake to 
cc:ce:1e:8b:cf:d1
May 14 14:01:20 sigma /bsd: iwm0: RUN -> ASSOC
May 14 14:01:20 sigma /bsd: iwm0: sending action to cc:ce:1e:8b:cf:d1 on 
channel 100 mode 11n
May 14 14:01:20 sigma /bsd: iwm0: sending assoc_req to cc:ce:1e:8b:cf:d1 on 
channel 100 mode 11n
May 14 14:01:24 sigma /bsd: iwm0: association timed out for cc:ce:1e:8b:cf:d1
May 14 14:01:24 sigma /bsd: iwm0: dumping device error log
May 14 14:01:24 sigma /bsd: iwm0: Start Error Log Dump:
May 14 14:01:24 sigma /bsd: iwm0: Status: 0x9, count: 6
May 14 14:01:24 sigma /bsd: iwm0: 0x3421 | ADVANCED_SYSASSERT  
May 14 14:01:24 sigma /bsd: iwm0: 0220 | trm_hw_status0
May 14 14:01:24 sigma /bsd: iwm0:  | trm_hw_status1
May 14 14:01:24 sigma /bsd: iwm0: 00023FDC | branchlink2
May 14 14:01:24 sigma /bsd: iwm0: 0003915A | interruptlink1
May 14 14:01:24 sigma /bsd: iwm0:  | interruptlink2
May 14 14:01:24 sigma /bsd: iwm0:  | data1
May 14 14:01:24 sigma /bsd: iwm0: 0001 | data2
May 14 14:01:24 sigma /bsd: iwm0: DEADBEEF | data3
May 14 14:01:24 sigma /bsd: iwm0:  | beacon time
May 14 14:01:24 sigma /bsd: iwm0: E8F0FA81 | tsf low
May 14 14:01:24 sigma /bsd: iwm0: 0024 | tsf hi
May 14 14:01:24 sigma /bsd: iwm0:  | time gp1
May 14 14:01:24 sigma /bsd: iwm0: 20010BB2 | time gp2
May 14 14:01:24 sigma /bsd: iwm0: 0001 | uCode revision type
May 14 14:01:24 sigma /bsd: iwm0: 0022 | uCode version major
May 14 14:01:24 sigma /bsd: iwm0:  | uCode version minor
May 14 14:01:24 sigma /bsd: iwm0: 0230 | hw version
May 14 14:01:24 sigma /bsd: iwm0: 18089000 | board version
May 14 14:01:24 sigma /bsd: iwm0: 007C0028 | hcmd
May 14 14:01:24 sigma /bsd: iwm0: 24022082 | isr0
May 14 14:01:24 sigma /bsd: iwm0: 0100 | isr1
May 14 14:01:24 sigma /bsd: iwm0: 08201802 | isr2
May 14 14:01:24 sigma /bsd: iwm0: 004140C0 | isr3
May 14 14:01:24 sigma /bsd: iwm0:  | isr4
May 14 14:01:24 sigma /bsd: iwm0: 007B002B | last cmd Id
May 14 14:01:24 sigma /bsd: iwm0:  | wait_event
May 14 14:01:24 sigma /bsd: iwm0: 0080 | l2p_control
May 14 14:01:24 sigma /bsd: iwm0: 00018010 | l2p_duration
May 14 14:01:24 sigma /bsd: iwm0: 003F | l2p_mhvalid
May 14 14:01:24 sigma /bsd: iwm0:  | l2p_addr_match
May 14 14:01:24 sigma /bsd: iwm0: 000D | lmpm_pmg_sel
May 14 14:01:24 sigma /bsd: iwm0: 30101345 | timestamp
May 14 14:01:24 sigma /bsd: iwm0: A8B8 | flow_handler
May 14 14:01:24 sigma /bsd: iwm0: Start UMAC Error Log Dump:
May 14 14:01:24 sigma /bsd: iwm0: Status: 0x9, count: 7
May 14 14:01:24 sigma /bsd: iwm0: 0x0070 | NMI_INTERRUPT_LMAC_FATAL
May 14 14:01:24 sigma /bsd: iwm0: 0x | umac branchlink1
May 14 14:01:24 sigma /bsd: iwm0: 0xC0086964 | umac branchlink2
May 14 14:01:24 sigma /bsd: iwm0: 0xC0083A94 | umac interruptlink1
May 14 14:01:24 sigma /bsd: iwm0: 0xC0083A94 | umac interruptlink2
May 14 14:01:24 sigma /bsd: iwm0: 0x0800 | umac data1
May 14 14:01:24 sigma /bsd: iwm0: 0xC0083A94 | umac data2
May 14 14:01:24 sigma /bsd: iwm0: 0xDEADBEEF | umac data3
May 14 14:01:24 sigma /bsd: iwm0: 0x0022 | umac major
May 14 14:01:24 sigma /bsd: iwm0: 0x | umac minor
May 14 14:01:24 sigma /bsd: iwm0: 0xC088628C | frame pointer
May 14 14:01:24 sigma /bsd: iwm0: 0xC088628C | stack pointer
May 14 14:01:24 sigma /bsd: iwm0: 0x007C0028 | last host cmd
May 14 14:01:24 sigma /bsd: iwm0: 0x | isr status reg
May 14 14:01:24 sigma /bsd: driver status:
May 14 14:01:24 sigma /bsd:   tx ring  0: qid=0  cur=125 queued=1  
May 14 14:01:24 sigma /bsd:   tx ring  1: qid=1  cur=0   queued=0  
May 14 14:01:24 sigma /bsd:   tx ring  2: qid=2  cur=0   queued=0  
May 14 14:01:24 sigma /bsd:   tx ring  3: qid=3  cur=0   queued=0  
May 14 14:01:24 sigma /bsd:   tx ring  4: qid=4  cur=0   queued=0  
May 14 14:01:24 sigma /bsd:   tx ring  5: qid=5  cur=119 queued=2  
May 14 14:01:24 sigma /bsd:   tx ring  6: qid=6  cur=0   queued=0  
May 14 14:01:24 sigma /bsd:   tx ring  7: 

DDB Prompt - Stopped at i915_ggtt_pin+0x29:

2021-05-14 Thread Antonino Sidoti
Hello,

This is fresh install of 6.9 release, no upgrade. After installing firmware 
updates on first boot and then doing a reboot the system gets a DDB prompt. I 
have tried a snapshot and it works fine. I will go back to 6.8 for now.

Thanks
Nino

OpenBSD 6.9 (GENERIC.MP) #473: Mon Apr 19 10:40:28 MDT 2021 
  dera...@amd64.openbsd.org:/usr/src/sys/arch/amd64/compile/GENERIC.MP
real mem = 4183560192 (3989MB)  
>> OpenBSD/amd64 BOOT 3.53
avail mem = 4041379840 (3854MB)
random: good seed from bootblocks
mpath0 at root
scsibus0 at mpath0: 256 targets
mainbus0 at root
bios0 at mainbus0: SMBIOS rev. 3.0 @ 0xec120 (49 entries)
bios0: vendor American Megatrends Inc. version "5.11" date 10/22/2019
bios0: Protectli FW2B
acpi0 at bios0: ACPI 5.0
acpi0: sleep states S0 S5
acpi0: tables DSDT FACP APIC FPDT FIDT MCFG SSDT SSDT SSDT UEFI LPIT CSRT
acpi0: wakeup devices SIO1(S0) BRC1(S0) XHC1(S4) HDEF(S4) RP01(S4) PXSX(S4) 
RP02(S4) PXSX(S4) RP03(S4) PXSX(S4) RP04(S4) PXSX(S4)
acpitimer0 at acpi0: 3579545 Hz, 24 bits
acpimadt0 at acpi0 addr 0xfee0: PC-AT compat
cpu0 at mainbus0: apid 0 (boot processor)
cpu0: Intel(R) Celeron(R) CPU J3060 @ 1.60GHz, 1600.41 MHz, 06-4c-04
cpu0: 
FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CFLUSH,DS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE,SSE3,PCLMUL,DTES64,MWAIT,DS-CPL,VMX,EST,TM2,SSSE3,CX16,xTPR,PDCM,SSE4.1,SSE4.2,MOVBE,POPCNT,DEADLINE,AES,RDRAND,NXE,RDTSCP,LONG,LAHF,3DNOWP,PERF,ITSC,TSC_ADJUST,SMEP,ERMS,MD_CLEAR,IBRS,IBPB,STIBP,SENSOR,ARAT,MELTDOWN
cpu0: 1MB 64b/line 16-way L2 cache
cpu0: smt 0, core 0, package 0
mtrr: Pentium Pro MTRR support, 8 var ranges, 88 fixed ranges
cpu0: apic clock running at 80MHz
cpu0: mwait min=64, max=64, C-substates=0.2.0.0.0.0.3.3, IBE
cpu1 at mainbus0: apid 4 (application processor)
cpu1: Intel(R) Celeron(R) CPU J3060 @ 1.60GHz, 1599.97 MHz, 06-4c-04
cpu1: 
FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CFLUSH,DS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE,SSE3,PCLMUL,DTES64,MWAIT,DS-CPL,VMX,EST,TM2,SSSE3,CX16,xTPR,PDCM,SSE4.1,SSE4.2,MOVBE,POPCNT,DEADLINE,AES,RDRAND,NXE,RDTSCP,LONG,LAHF,3DNOWP,PERF,ITSC,TSC_ADJUST,SMEP,ERMS,MD_CLEAR,IBRS,IBPB,STIBP,SENSOR,ARAT,MELTDOWN
cpu1: 1MB 64b/line 16-way L2 cache
cpu1: smt 0, core 2, package 0
ioapic0 at mainbus0: apid 1 pa 0xfec0, version 20, 115 pins
acpimcfg0 at acpi0
acpimcfg0: addr 0xe000, bus 0-255
acpiprt0 at acpi0: bus 0 (PCI0)
acpiprt1 at acpi0: bus 1 (RP01)
acpiprt2 at acpi0: bus 2 (RP02)
acpiprt3 at acpi0: bus -1 (RP03)
acpiprt4 at acpi0: bus -1 (RP04)
acpiec0 at acpi0: not present
acpicmos0 at acpi0
acpipci0 at acpi0 PCI0: 0x0004 0x0011 0x0001
"BCM2E64" at acpi0 not configured
"BCM4752" at acpi0 not configured
"SMO91D0" at acpi0 not configured
"INTCF1C" at acpi0 not configured
acpibtn0 at acpi0: SLPB
"PNP0C0B" at acpi0 not configured
acpicpu0 at acpi0: C3(10@1000 mwait.1@0x64), C2(10@500 mwait.1@0x58), C1(1000@1 
mwait.1)
acpicpu1 at acpi0: C3(10@1000 mwait.1@0x64), C2(10@500 mwait.1@0x58), C1(1000@1 
mwait.1)
acpipwrres0 at acpi0: ID3C, resource for ISP3
acpipwrres1 at acpi0: CLK0, resource for CAMD
acpipwrres2 at acpi0: CLK0, resource for CAM1
acpipwrres3 at acpi0: CLK1, resource for CAM2, CAM3
acpipwrres4 at acpi0: USBC, resource for XHC1
acpipwrres5 at acpi0: FN00, resource for FAN0
acpitz0 at acpi0: critical temperature is 95 degC
acpivideo0 at acpi0: GFX0
acpivout0 at acpivideo0: DD1F
cpu0: using VERW MDS workaround
pci0 at mainbus0 bus 0
pchb0 at pci0 dev 0 function 0 "Intel Braswell Host" rev 0x35
inteldrm0 at pci0 dev 2 function 0 "Intel HD Graphics" rev 0x35
drm0 at inteldrm0
inteldrm0: msi, CHERRYVIEW, gen 8
ahci0 at pci0 dev 19 function 0 "Intel Braswell AHCI" rev 0x35: msi, AHCI 1.3.1
ahci0: port 0: 6.0Gb/s
ahci0: PHY offline on port 1
scsibus1 at ahci0: 32 targets
sd0 at scsibus1 targ 0 lun 0:  naa.50026b7782ae0b59
sd0: 114473MB, 512 bytes/sector, 234441648 sectors, thin
xhci0 at pci0 dev 20 function 0 "Intel Braswell xHCI" rev 0x35: msi, xHCI 1.0
usb0 at xhci0: USB revision 3.0
uhub0 at usb0 configuration 1 interface 0 "Intel xHCI root hub" rev 3.00/1.00 
addr 1
"Intel Braswell TXE" rev 0x35 at pci0 dev 26 function 0 not configured
azalia0 at pci0 dev 27 function 0 "Intel Braswell HD Audio" rev 0x35: msi
azalia0: codecs: Realtek ALC662, Intel/0x2883, using Realtek ALC662
audio0 at azalia0
ppb0 at pci0 dev 28 function 0 "Intel Braswell PCIE" rev 0x35: msi
pci1 at ppb0 bus 1
em0 at pci1 dev 0 function 0 "Intel I211" rev 0x03: msi, address 
00:e0:67:15:e7:82
ppb1 at pci0 dev 28 function 1 "Intel Braswell PCIE" rev 0x35: msi
pci2 at ppb1 bus 2
em1 at pci2 dev 0 function 0 "Intel I211" rev 0x03: msi, address 
00:e0:67:15:e7:83
pcib0 at pci0 dev 31 function 0 "Intel Braswell PCU LPC" rev 0x35
ichiic0 at pci0 dev 31 function 3 "Intel Braswell SMBus" rev 0x35: apic 1 int 18
iic0 at ichiic0
spdmem0 at iic0 addr 0x50: 4GB DDR3 SDRAM PC3-12800 SO-DIMM
isa0