Re: Kernel Panic on high bandwidth transfer over wifi

2018-08-29 Thread Eric Dumazet



On 08/29/2018 04:42 AM, Nathaniel Munk wrote:
> Hi all,
> I'm running Arch Linux on kernel 4.18.5 (same issue on both arch-provided 
> kernel and mainline built-from-source). There is an issue whereby the kernel 
> crashes when transferring at high bandwidths (approx 6mB/s) over a specific 
> wifi connection. I can only reproduce the issue when using the Personal 
> Hotspot on my iPhone 6S+, but can reproduce it very consistently on that 
> connection.
> 
> More often than not, any download reaching this speed will cause a panic, but 
> if the download is immediately terminated at the first error the system can 
> recover (and doing this I have obtained the attached logs). Unfortunately, I 
> have not had access to a second machine to obtain the netconsole printout of 
> the panic.
> 
> As above, high-bandwidth transfers on other wifi networks do not cause the 
> issue (nor on ethernet connections).
> 
> As you can see from the attached log, the issue appears at tcp_recvmsg+0x579 
> and net_tx_action+0x1fe. At both these positions (net/ipv4/tcp.c:2000 and 
> net/core/dev.c:4279 in mainline 4.18.5), a member of the skb struct is called.
> 
> Thank you for your time (and I apologize if this is spurious or badly worded, 
> this is my first bug report), and please don't hesitate to let me know if 
> there's anything else I can do to help work this out.
> 
> Regards,
> ---
> Nathaniel Munk
> nathan...@munk.com.au
> 

Unfortunately there is no attached log ;)



Re: Kernel Panic on high bandwidth transfer over wifi

2018-08-29 Thread Nathaniel Munk
Of course I did, sorry.

---
Nathaniel Munk
nathan...@munk.com.au
0435 726 099

‐‐‐ Original Message ‐‐‐
On August 29, 2018 10:09 PM, Willy Tarreau  wrote:

> On Wed, Aug 29, 2018 at 11:42:44AM +, Nathaniel Munk wrote:
>
> > As you can see from the attached log
>
> You apparently forgot to attach the log.
>
> Willy


[ 1242.620637] TCP recvmsg seq # bug: copied 93359823, seq 1, rcvnxt 93359D8F, fl 8000
[ 1242.620700] WARNING: CPU: 0 PID: 10255 at net/ipv4/tcp.c:2003 tcp_recvmsg+0x579/0xc70
[ 1242.620704] Modules linked in: ccm nf_tables_set nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib nft_reject_inet nf_reject_ipv4 nf_reject_ipv6 nft_reject nft_ct nft_chain_nat_ipv6 nft_chain_nat_ipv4 nf_tables ebtable_nat ebtable_broute bridge stp llc ip6table_nat nf_conntrack_ipv6 nf_defrag_ipv6 nf_nat_ipv6 ip6table_mangle ip6table_raw ip6table_security iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 nf_nat iptable_mangle iptable_raw iptable_security nf_conntrack libcrc32c ip_set nfnetlink ebtable_filter ebtables ip6table_filter ip6_tables iptable_filter bpfilter 8021q mrp ipheth btusb btrtl btbcm btintel uvcvideo bluetooth videobuf2_vmalloc videobuf2_memops videobuf2_v4l2 videobuf2_common videodev media ecdh_generic joydev mousedev nls_iso8859_1 nls_cp437 vfat fat arc4 snd_hda_codec_hdmi
[ 1242.620797]  snd_soc_skl snd_soc_skl_ipc iwlmvm snd_soc_sst_ipc snd_soc_sst_dsp snd_hda_ext_core snd_soc_acpi snd_soc_core mei_wdt mac80211 snd_hda_codec_realtek snd_compress iTCO_wdt ac97_bus snd_hda_codec_generic snd_pcm_dmaengine iTCO_vendor_support intel_rapl snd_hda_intel x86_pkg_temp_thermal intel_powerclamp snd_hda_codec coretemp kvm_intel snd_hda_core wmi_bmof iwlwifi kvm cfg80211 snd_hwdep input_leds snd_pcm thinkpad_acpi snd_timer psmouse irqbypass nvram mei_me intel_cstate rfkill intel_uncore e1000e intel_rapl_perf mei i2c_i801 snd intel_pch_thermal soundcore ac tpm_tis tpm_tis_core tpm rng_core led_class evdev battery wmi rtc_cmos mac_hid pcc_cpufreq ip_tables x_tables ext4 crc32c_generic crc16 mbcache jbd2 fscrypto algif_skcipher af_alg dm_crypt dm_mod sd_mod crct10dif_pclmul crc32_pclmul
[ 1242.620885]  crc32c_intel ghash_clmulni_intel pcbc serio_raw atkbd libps2 ahci libahci aesni_intel libata aes_x86_64 crypto_simd xhci_pci cryptd glue_helper scsi_mod xhci_hcd i8042 serio hid_generic usbhid usbcore usb_common hid intel_agp i915 i2c_algo_bit drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops drm intel_gtt agpgart
[ 1242.620933] CPU: 0 PID: 10255 Comm: IOCP Thread 0 Tainted: G U  W 4.18.5-arch1-1-ARCH #1
[ 1242.620937] Hardware name: LENOVO 20FN001CAU/20FN001CAU, BIOS R06ET59W (1.33 ) 02/27/2018
[ 1242.620949] RIP: 0010:tcp_recvmsg+0x579/0xc70
[ 1242.620951] Code: fb ff ff 4c 89 e0 41 8b 8d 38 05 00 00 44 8b 44 24 2c 89 de 48 c7 c7 28 f5 6f 9a 4c 89 54 24 08 48 89 44 24 10 e8 11 f8 9f ff <0f> 0b 48 8b 44 24 10 4c 8b 54 24 08 8b 4c 24 3c 39 4c 24 38 0f 8c 
[ 1242.621003] RSP: 0018:b0de41223bb0 EFLAGS: 00010282
[ 1242.621007] RAX:  RBX: 93359823 RCX: 0001
[ 1242.621010] RDX: 8001 RSI: 9a6811ce RDI: 
[ 1242.621013] RBP: b0de41223c70 R08: 99cddf10 R09: 03c4
[ 1242.621016] R10: 0008 R11: 9ae04f2d R12: 8a6a70e76a00
[ 1242.621018] R13: 8a6a39ea18c0 R14:  R15: 8a6a39ea1dfc
[ 1242.621022] FS:  () GS:8a6a8140(0063) knlGS:ed455b40
[ 1242.621026] CS:  0010 DS: 002b ES: 002b CR0: 80050033
[ 1242.621028] CR2: e9f68000 CR3: 0002347aa004 CR4: 003606f0
[ 1242.621032] DR0:  DR1:  DR2: 
[ 1242.621035] DR3:  DR6: fffe0ff0 DR7: 0400
[ 1242.621038] Call Trace:
[ 1242.621049]  ? update_rq_clock+0x33/0x120
[ 1242.621053]  ? compat_import_iovec+0x37/0xcd
[ 1242.621058]  ? __switch_to_asm+0x40/0x70
[ 1242.621061]  inet_recvmsg+0x5b/0x100
[ 1242.621066]  ___sys_recvmsg+0xdd/0x1e0
[ 1242.621070]  ? __switch_to_asm+0x34/0x70
[ 1242.621074]  ? _raw_spin_unlock_irq+0x1d/0x30
[ 1242.621077]  ? finish_task_switch+0x83/0x2c0
[ 1242.621080]  ? tcp_keepalive_timer.cold.3+0x19/0x19
[ 1242.621084]  ? tcp_poll+0x12e/0x260
[ 1242.621086]  ? sock_poll+0x61/0xb0
[ 1242.621091]  ? ep_item_poll.isra.1+0x40/0xc0
[ 1242.621095]  ? ep_send_events_proc+0x7b/0x1a0
[ 1242.621098]  ? __ia32_sys_epoll_ctl+0x20/0x20
[ 1242.621101]  ? preempt_count_add+0x68/0xa0
[ 1242.621106]  ? _raw_spin_lock_irqsave+0x25/0x50
[ 1242.621112]  ? __fget+0x6e/0xa0
[ 1242.621116]  __sys_recvmsg+0x54/0xa0
[ 1242.621124]  __ia32_compat_sys_socketcall+0x174/0x300
[ 1242.621129]  ? do_epoll_wait+0x8b/0xd0
[ 1242.621135]  do_fast_syscall_32+0xa7/0x2a0
[ 1242.621139]  entry_SYSENTER_compat+0x7f/0x91
[ 1242.621144] ---[ end trace dc996496c7568a8f ]---
[ 1242.621232] BUG: unable to handle kernel paging request at 00020024cb72
[ 1242.621238] 

Re: Kernel Panic on high bandwidth transfer over wifi

2018-08-29 Thread Willy Tarreau
On Wed, Aug 29, 2018 at 11:42:44AM +, Nathaniel Munk wrote:
> As you can see from the attached log

You apparently forgot to attach the log.

Willy


Kernel Panic on high bandwidth transfer over wifi

2018-08-29 Thread Nathaniel Munk
Hi all,
I'm running Arch Linux on kernel 4.18.5 (same issue on both arch-provided 
kernel and mainline built-from-source). There is an issue whereby the kernel 
crashes when transferring at high bandwidths (approx 6mB/s) over a specific 
wifi connection. I can only reproduce the issue when using the Personal Hotspot 
on my iPhone 6S+, but can reproduce it very consistently on that connection.

More often than not, any download reaching this speed will cause a panic, but 
if the download is immediately terminated at the first error the system can 
recover (and doing this I have obtained the attached logs). Unfortunately, I 
have not had access to a second machine to obtain the netconsole printout of 
the panic.

As above, high-bandwidth transfers on other wifi networks do not cause the 
issue (nor on ethernet connections).

As you can see from the attached log, the issue appears at tcp_recvmsg+0x579 
and net_tx_action+0x1fe. At both these positions (net/ipv4/tcp.c:2000 and 
net/core/dev.c:4279 in mainline 4.18.5), a member of the skb struct is called.

Thank you for your time (and I apologize if this is spurious or badly worded, 
this is my first bug report), and please don't hesitate to let me know if 
there's anything else I can do to help work this out.

Regards,
---
Nathaniel Munk
nathan...@munk.com.au