I'm testing some TIPC code between a Debian host and a Fedora host and 
getting some nasty looking kernel messages. Could someone take a quick look and 
see if these are known issues and whether there are known workarounds?

Description:    Debian GNU/Linux 10 (buster)
Linux post 4.19.0-14-amd64 #1 SMP Debian 4.19.171-2 (2021-01-30) x86_64 
GNU/Linux

[Mon Apr  5 11:54:13 2021] list_del corruption, ffffa5fd89417d80->next is 
LIST_POISON1 (dead000000000100)
[Mon Apr  5 11:54:13 2021] ------------[ cut here ]------------
[Mon Apr  5 11:54:13 2021] kernel BUG at lib/list_debug.c:47!
[Mon Apr  5 11:54:13 2021] invalid opcode: 0000 [#1] SMP PTI
[Mon Apr  5 11:54:13 2021] CPU: 42 PID: 254384 Comm: mupip Not tainted 
4.19.0-14-amd64 #1 Debian 4.19.171-2
[Mon Apr  5 11:54:13 2021] Hardware name: Dell Inc. PowerEdge R730xd/072T6D, 
BIOS 2.1.7 06/16/2016
[Mon Apr  5 11:54:13 2021] RIP: 0010:__list_del_entry_valid.cold.1+0x12/0x47
[Mon Apr  5 11:54:13 2021] Code: ff ff 0f 0b 48 89 c1 4c 89 c6 48 c7 c7 90 ae 
09 89 e8 90 24 ff ff 0f 0b 48 89 fe 48 89 c2 48 c7 c7 20 af 09 89 e8 7c 24 ff 
ff <0f> 0b 48 c7 c7 d0 af 09 89 e8 6e 24 ff ff 0f 0b 48 89 f2 48 89 fe
[Mon Apr  5 11:54:13 2021] RSP: 0018:ffffa5fd89417d10 EFLAGS: 00010046
[Mon Apr  5 11:54:13 2021] RAX: 000000000000004e RBX: ffffa5fd89417d68 RCX: 
0000000000000000
[Mon Apr  5 11:54:13 2021] RDX: 0000000000000000 RSI: ffff89b23f9566b8 RDI: 
ffff89b23f9566b8
[Mon Apr  5 11:54:13 2021] RBP: ffff89a5d3099800 R08: 00000000000006eb R09: 
0000000000aaaaaa
[Mon Apr  5 11:54:13 2021] R10: 0000000000000000 R11: 0000000000000001 R12: 
0000000000000286
[Mon Apr  5 11:54:13 2021] R13: ffff89a8f9d370c0 R14: ffffa5fd89417e28 R15: 
0000000000000008
[Mon Apr  5 11:54:13 2021] FS:  00007f9abfcaa3c0(0000) 
GS:ffff89b23f940000(0000) knlGS:0000000000000000
[Mon Apr  5 11:54:13 2021] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[Mon Apr  5 11:54:13 2021] list_del corruption, ffffa5fd8834bd80->next is 
LIST_POISON1 (dead000000000100)
[Mon Apr  5 11:54:13 2021] CR2: 00007fbf0ebcd000 CR3: 00000006d95a0005 CR4: 
00000000003606e0
[Mon Apr  5 11:54:13 2021] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 
0000000000000000
[Mon Apr  5 11:54:13 2021] ------------[ cut here ]------------
[Mon Apr  5 11:54:13 2021] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 
0000000000000400
[Mon Apr  5 11:54:13 2021] kernel BUG at lib/list_debug.c:47!
[Mon Apr  5 11:54:13 2021] Call Trace:
[Mon Apr  5 11:54:13 2021]  remove_wait_queue+0x20/0x60
[Mon Apr  5 11:54:13 2021]  __tipc_sendmsg+0x2bb/0x570 [tipc]
[Mon Apr  5 11:54:13 2021]  ? do_wait_intr_irq+0xb0/0xb0
[Mon Apr  5 11:54:13 2021]  tipc_sendmsg+0x30/0x50 [tipc]
[Mon Apr  5 11:54:13 2021]  sock_sendmsg+0x36/0x40
[Mon Apr  5 11:54:13 2021]  __sys_sendto+0xee/0x160
[Mon Apr  5 11:54:13 2021] list_del corruption, ffffa5fd88823d80->next is 
LIST_POISON1 (dead000000000100)
[Mon Apr  5 11:54:13 2021]  ? vfs_write+0x16d/0x1a0
[Mon Apr  5 11:54:13 2021] ------------[ cut here ]------------
[Mon Apr  5 11:54:13 2021]  ? vfs_write+0xe9/0x1a0
[Mon Apr  5 11:54:13 2021] kernel BUG at lib/list_debug.c:47!
[Mon Apr  5 11:54:13 2021]  __x64_sys_sendto+0x24/0x30
[Mon Apr  5 11:54:13 2021]  do_syscall_64+0x53/0x110
[Mon Apr  5 11:54:13 2021]  entry_SYSCALL_64_after_hwframe+0x44/0xa9
[Mon Apr  5 11:54:13 2021] RIP: 0033:0x7f9abffe49b7
[Mon Apr  5 11:54:13 2021] Code: 64 89 02 48 c7 c0 ff ff ff ff eb b6 0f 1f 80 
00 00 00 00 48 8d 05 49 5d 0c 00 41 89 ca 8b 00 85 c0 75 10 b8 2c 00 00 00 0f 
05 <48> 3d 00 f0 ff ff 77 71 c3 41 57 4d 89 c7 41 56 41 89 ce 41 55 49
[Mon Apr  5 11:54:13 2021] RSP: 002b:00007ffd979ddb18 EFLAGS: 00000246 
ORIG_RAX: 000000000000002c
[Mon Apr  5 11:54:13 2021] RAX: ffffffffffffffda RBX: 0000000000000000 RCX: 
00007f9abffe49b7
[Mon Apr  5 11:54:13 2021] RDX: 0000000000000008 RSI: 00007ffd979ddb28 RDI: 
0000000000000003
[Mon Apr  5 11:54:13 2021] RBP: 00007ffd979ddb40 R08: 00007f9ac02c2120 R09: 
0000000000000010
[Mon Apr  5 11:54:13 2021] R10: 0000000000000000 R11: 0000000000000246 R12: 
000055d11e4c8250
[Mon Apr  5 11:54:13 2021] R13: 00007ffd979efa40 R14: 0000000000000000 R15: 
0000000000000000
[Mon Apr  5 11:54:13 2021] Modules linked in: btrfs zstd_compress 
zstd_decompress xxhash fuse vfat fat loop rpcsec_gss_krb5 nfsv4 dns_resolver 
nfs lockd grace fscache binfmt_misc ext4 crc16 mbcache jbd2 f2fs fscrypto ecb 
intel_rapl sb_edac x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm 
irqbypass crct10dif_pclmul ipmi_ssif crc32_pclmul mgag200 ttm 
ghash_clmulni_intel drm_kms_helper drm intel_cstate iTCO_wdt sg dcdbas 
i2c_algo_bit mei_me iTCO_vendor_support intel_uncore intel_rapl_perf evdev 
mxm_wmi mei pcspkr pcc_cpufreq ipmi_si ipmi_devintf wmi ipmi_msghandler button 
acpi_power_meter dm_mod auth_rpcgss tipc sunrpc ip6_udp_tunnel udp_tunnel 
ip_tables x_tables autofs4 xfs raid10 raid456 async_raid6_recov async_memcpy 
async_pq async_xor async_tx xor sd_mod raid6_pq libcrc32c crc32c_generic raid1 
raid0
[Mon Apr  5 11:54:13 2021]  multipath linear md_mod crc32c_intel ahci libahci 
aesni_intel nvme ehci_pci aes_x86_64 crypto_simd libata ehci_hcd megaraid_sas 
cryptd glue_helper nvme_core usbcore tg3 lpc_ich scsi_mod libphy mfd_core 
usb_common
[Mon Apr  5 11:54:13 2021] ---[ end trace bb3c37742d449893 ]---

Description:    Fedora release 33 (Thirty Three)
Linux lltcgtmfed01.fisdev.local 5.10.23-200.fc33.x86_64 #1 SMP Thu Mar 11 
22:18:30 UTC 2021 x86_64 x86_64 x86_64 GNU/Linux

[Mon Apr  5 14:29:30 2021] tipc: Gacks on 1866da4db38c:eno1-1866da4d75f5:eno1: 
64, ql: 339!
[Mon Apr  5 14:29:32 2021] tipc: Gacks on 1866da4db38c:eno1-1866da4d75f5:eno1: 
64, ql: 339!

   I suspect I lost at least one datagram due to one of these, as I know that 
it was sent, and the recv()ing process (now stuck) was only expecting this one 
datagram reply, making buffer issues unlikely.

   Thanks.

Gary Duzan
FIS - GT.M Core Team

The information contained in this message is proprietary and/or confidential. 
If you are not the intended recipient, please: (i) delete the message and all 
copies; (ii) do not disclose, distribute or use the message in any manner; and 
(iii) notify the sender immediately. In addition, please be aware that any 
message addressed to our domain is subject to archiving and review by persons 
other than the intended recipient. Thank you.

_______________________________________________
tipc-discussion mailing list
tipc-discussion@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/tipc-discussion

Reply via email to