Re: USB ure RTL8153 panics on release 6.7 + 001_wscons

2020-05-22 Thread Jonathon Fletcher



> On May 21, 2020, at 7:04 PM, Jonathon Fletcher 
>  wrote:
> 
> 
>> On May 21, 2020, at 12:39 AM, Kevin Lo  wrote:
>> 
>> On Tue, May 19, 2020 at 09:52:57AM -0700, Jonathon Fletcher wrote:
>>> 
>>> 
>>>> Synopsis:  ure RTL8153 panics on 6.7 - was stable on 6.6
>>> 
>>>> Environment:
>>> System  : OpenBSD 6.7
>>> Details : OpenBSD 6.7 (GENERIC.MP) #1: Sat May 16 16:33:02 MDT 2020
>>>  
>>> r...@syspatch-67-amd64.openbsd.org:/usr/src/sys/arch/amd64/compile/GENERIC.MP
>>> 
>>> Architecture: OpenBSD.amd64
>>> Machine : amd64
>>> 
>>>> Description:
>>> ** PRELIMINARY BUG REPORT - INCOMPLETE INFO **
>>> 
>>> USB ure with RTL8153 on 6.7 panics. Same hardware stable on 6.6.
>>> 
>>> ure0 at uhub0 port 14 configuration 1 interface 0 "Realtek USB 10/100/1000 
>>> LAN" rev 3.00/30.00 addr 5
>>> ure0: RTL8153 (0x5c30), address a0:ce:c8:cd:ba:d1
>>> rgephy0 at ure0 phy 0: RTL8251 PHY, rev. 0
>>> 
>>> Under load (estimated 50MB/s for approximate 5 minutes), kernel panics:
> 
> ...
> 
>>> Aside:
>>> 
>>> Same panic occurred with RTL8156 - first 6.7 panic. After that I reverted 
>>> to the RTL8153 above in case panic was limited to the newly-supported 
>>> hardware.
>>> 
>>> ure0 at uhub0 port 14 configuration 1 interface 0 "Realtek USB 
>>> 10/100/1G/2.5G LAN" rev 3.20/30.00 addr 5
>>> ure0: RTL8156 (0x7030), address 00:e0:4c:ab:64:5a
>> 
>> Thanks for the report.  Could you test this diff?  Thanks.
> 
> Thank you for the quick patch.
> 
> I have run this at 40-50MB/s for ~20mins for both RTL8153 and RTL8156.
> 
> No panic with your patch.
> 
> I am going to leave it running with the RTL8156 and will send an update if I 
> see any problems.


Kevin,

This happened with the patch applied and using the RTL8156.

Same dmesg as original report.

assertwaitok: non-zero mutex count: 1
Stopped at db_enter+0xl0: popq %rbp
TID PID UID PRFLRGS PFLAGS CPU COMMAND
db_enter() at db_enter+0xl0
panic(81c8e578) at panic+0xl28
assertwaitok() at assertwaitok+0xc7
mi_suitch() at mi_switch+0x40
sleep_finish(800022e81148,1) at sleep_finish+0x84
sleep_finish_all(800022e81148,l) at sleep_finish_al1+0x21
tsleep(fd84465a24b0,10,81c9bd4f,0) at tsleep+0xd6
usbd_transfer(fd84465a24b0) at usbd_transfer+0x204
usbd_do_request_flags(8052d500,800022e812a0,800022e8129c,0,0,1388)
 at usbd_do_request_flags+0x139
ure_reset(80533000) at ure_reset+0x5e
ure_stop(80533000) at ure_stop+0x21
ure_encap(80533000,fd808ecf9800,5) at ure_encap+0xf6
ure_start(805330d0) at ure_start+0x98
if_qstart_compat(80533348) at if_qstart_compat+0x2e
end trace frame: 0x800022e81450, count: 0
https://www.openbsd.org/ddb.html describes the minimum info required in bug 
reports. Insufficient info makes it difficult to find and fix bugs.
ddb{1}>

Thanks,
Jonathon



Re: USB ure RTL8153 panics on release 6.7 + 001_wscons

2020-05-21 Thread Jonathon Fletcher


> On May 21, 2020, at 12:39 AM, Kevin Lo  wrote:
> 
> On Tue, May 19, 2020 at 09:52:57AM -0700, Jonathon Fletcher wrote:
>> 
>> 
>>> Synopsis:   ure RTL8153 panics on 6.7 - was stable on 6.6
>> 
>>> Environment:
>>  System  : OpenBSD 6.7
>>  Details : OpenBSD 6.7 (GENERIC.MP) #1: Sat May 16 16:33:02 MDT 2020
>>   
>> r...@syspatch-67-amd64.openbsd.org:/usr/src/sys/arch/amd64/compile/GENERIC.MP
>> 
>>  Architecture: OpenBSD.amd64
>>  Machine : amd64
>> 
>>> Description:
>> ** PRELIMINARY BUG REPORT - INCOMPLETE INFO **
>> 
>> USB ure with RTL8153 on 6.7 panics. Same hardware stable on 6.6.
>> 
>> ure0 at uhub0 port 14 configuration 1 interface 0 "Realtek USB 10/100/1000 
>> LAN" rev 3.00/30.00 addr 5
>> ure0: RTL8153 (0x5c30), address a0:ce:c8:cd:ba:d1
>> rgephy0 at ure0 phy 0: RTL8251 PHY, rev. 0
>> 
>> Under load (estimated 50MB/s for approximate 5 minutes), kernel panics:

...

>> Aside:
>> 
>> Same panic occurred with RTL8156 - first 6.7 panic. After that I reverted to 
>> the RTL8153 above in case panic was limited to the newly-supported hardware.
>> 
>> ure0 at uhub0 port 14 configuration 1 interface 0 "Realtek USB 
>> 10/100/1G/2.5G LAN" rev 3.20/30.00 addr 5
>> ure0: RTL8156 (0x7030), address 00:e0:4c:ab:64:5a
> 
> Thanks for the report.  Could you test this diff?  Thanks.

Kevin,

Thank you for the quick patch.

I have run this at 40-50MB/s for ~20mins for both RTL8153 and RTL8156.

No panic with your patch.

I am going to leave it running with the RTL8156 and will send an update if I 
see any problems.

Thanks,
Jonathon



USB ure RTL8153 panics on release 6.7 + 001_wscons

2020-05-19 Thread Jonathon Fletcher


>Synopsis:  ure RTL8153 panics on 6.7 - was stable on 6.6

>Category:  

>Environment:
System  : OpenBSD 6.7
Details : OpenBSD 6.7 (GENERIC.MP) #1: Sat May 16 16:33:02 MDT 2020
 
r...@syspatch-67-amd64.openbsd.org:/usr/src/sys/arch/amd64/compile/GENERIC.MP

Architecture: OpenBSD.amd64
Machine : amd64

>Description:
** PRELIMINARY BUG REPORT - INCOMPLETE INFO **


USB ure with RTL8153 on 6.7 panics. Same hardware stable on 6.6.

ure0 at uhub0 port 14 configuration 1 interface 0 "Realtek USB 10/100/1000 LAN" 
rev 3.00/30.00 addr 5
ure0: RTL8153 (0x5c30), address a0:ce:c8:cd:ba:d1
rgephy0 at ure0 phy 0: RTL8251 PHY, rev. 0


Under load (estimated 50MB/s for approximate 5 minutes), kernel panics:

panic: assertwaitok: non-zero mutex count: 1
Stopped at db_enter+0xl0: popq %rbp
TID PID UID PRFLAGS PFLAGS CPU COMMAND
db_enter() at db_enter+0x10
panic(81c8al98) at panic+0x128
assertwaitok() at assertwaitok+0xc7
mi_switch at mi_switch+0x40
sleep_finish(800022e816b8,1) at sleep_finish+0x84
sleep_finish_all(800022e816b8,1) at sleep_finish_all+0x21
tsleepCfd84465a24b0,10,81c9c412,0) at tsleep+0xd6
usbd_transfer(fd84465a24b0) at usbd_transfer+0x204
usbd_do_request_flagsC8052e600,800022e81810,800022e8180c,0,0,1388)
 at usbd_do_request_fLags+0x139
ure_reset(80538000) at ure_reset+0x5e
ure_stop(80538000) at ure_stop+0x21
ure_encap(80538000, fd809e390f00) at ure_encap+0xf2
ure_start(805380d0) at ure_start+0x8b
if_qstart_compat(80538348) at if_qstart_conpat+0x2e
end trace frame: 0x800022e819b0, count: 0
https://www.openbsd.org/ddb.htnl describes the minimum info required in bug 
reports. Insufficient info makes it difficult to find and fix bugs.
ddb{0}> 


Above panic OCR’d from a picture and may have errors. I hope to provide more 
info / trace later.


Aside:

Same panic occurred with RTL8156 - first 6.7 panic. After that I reverted to 
the RTL8153 above in case panic was limited to the newly-supported hardware.

ure0 at uhub0 port 14 configuration 1 interface 0 "Realtek USB 10/100/1G/2.5G 
LAN" rev 3.20/30.00 addr 5
ure0: RTL8156 (0x7030), address 00:e0:4c:ab:64:5a


>How-To-Repeat:
Standard 6.7 amd64 install with 001_wscons syspatch and either RTL8153 
or RTL8156.
Run under network load for a few minutes.

>Fix:
I removed the ure usb and reverted to an old axe device. 



dmesg:
OpenBSD 6.7 (GENERIC.MP) #1: Sat May 16 16:33:02 MDT 2020

r...@syspatch-67-amd64.openbsd.org:/usr/src/sys/arch/amd64/compile/GENERIC.MP
real mem = 17076674560 (16285MB)
avail mem = 16546508800 (15779MB)
mpath0 at root
scsibus0 at mpath0: 256 targets
mainbus0 at root
bios0 at mainbus0: SMBIOS rev. 2.8 @ 0xb4eda000 (53 entries)
bios0: vendor Intel Corporation version "RYBDWi35.86A.0383.2019.1030.1528" date 
10/30/2019
bios0: Intel Corporation NUC5i7RYB
acpi0 at bios0: ACPI 5.0
acpi0: sleep states S0 S3 S4 S5
acpi0: tables DSDT FACP APIC FPDT FIDT MCFG HPET SSDT UEFI LPIT SSDT ASF! SSDT 
SSDT SSDT DMAR BGRT
acpi0: wakeup devices PEG0(S4) PEGP(S4) PEG1(S4) PEGP(S4) PEG2(S4) PEGP(S4) 
RP01(S4) PXSX(S4) RP02(S4) PXSX(S4) RP03(S4) PXSX(S4) RP04(S4) PXSX(S4) 
RP05(S4) PXSX(S4) [...]
acpitimer0 at acpi0: 3579545 Hz, 24 bits
acpimadt0 at acpi0 addr 0xfee0: PC-AT compat
cpu0 at mainbus0: apid 0 (boot processor)
cpu0: Intel(R) Core(TM) i7-5557U CPU @ 3.10GHz, 3392.63 MHz, 06-3d-04
cpu0: 
FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CFLUSH,DS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE,SSE3,PCLMUL,DTES64,MWAIT,DS-CPL,VMX,EST,TM2,SSSE3,SDBG,FMA3,CX16,xTPR,PDCM,PCID,SSE4.1,SSE4.2,x2APIC,MOVBE,POPCNT,DEADLINE,AES,XSAVE,AVX,F16C,RDRAND,NXE,PAGE1GB,RDTSCP,LONG,LAHF,ABM,3DNOWP,PERF,ITSC,FSGSBASE,TSC_ADJUST,BMI1,AVX2,SMEP,BMI2,ERMS,INVPCID,RDSEED,ADX,SMAP,PT,MD_CLEAR,IBRS,IBPB,STIBP,L1DF,SSBD,SENSOR,ARAT,XSAVEOPT,MELTDOWN
cpu0: 256KB 64b/line 8-way L2 cache
cpu0: smt 0, core 0, package 0
mtrr: Pentium Pro MTRR support, 10 var ranges, 88 fixed ranges
cpu0: apic clock running at 99MHz
cpu0: mwait min=64, max=64, C-substates=0.2.1.2.4.1.1.1, IBE
cpu1 at mainbus0: apid 2 (application processor)
cpu1: Intel(R) Core(TM) i7-5557U CPU @ 3.10GHz, 3392.17 MHz, 06-3d-04
cpu1: 
FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CFLUSH,DS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE,SSE3,PCLMUL,DTES64,MWAIT,DS-CPL,VMX,EST,TM2,SSSE3,SDBG,FMA3,CX16,xTPR,PDCM,PCID,SSE4.1,SSE4.2,x2APIC,MOVBE,POPCNT,DEADLINE,AES,XSAVE,AVX,F16C,RDRAND,NXE,PAGE1GB,RDTSCP,LONG,LAHF,ABM,3DNOWP,PERF,ITSC,FSGSBASE,TSC_ADJUST,BMI1,AVX2,SMEP,BMI2,ERMS,INVPCID,RDSEED,ADX,SMAP,PT,MD_CLEAR,IBRS,IBPB,STIBP,L1DF,SSBD,SENSOR,ARAT,XSAVEOPT,MELTDOWN
cpu1: 256KB 64b/line 8-way L2 cache
cpu1: smt 0, core 1, package 0
cpu2 at mainbus0: apid 1 (application processor)
cpu2: Intel(R) Core(TM) i7-5557U CPU @ 3.10GHz, 3392.16 MHz, 06-3d-04
cpu2: