I'm testing some TIPC code between a Debian host and a Fedora host and getting some nasty looking kernel messages. Could someone take a quick look and see if these are known issues and whether there are known workarounds?
Description: Debian GNU/Linux 10 (buster) Linux post 4.19.0-14-amd64 #1 SMP Debian 4.19.171-2 (2021-01-30) x86_64 GNU/Linux [Mon Apr 5 11:54:13 2021] list_del corruption, ffffa5fd89417d80->next is LIST_POISON1 (dead000000000100) [Mon Apr 5 11:54:13 2021] ------------[ cut here ]------------ [Mon Apr 5 11:54:13 2021] kernel BUG at lib/list_debug.c:47! [Mon Apr 5 11:54:13 2021] invalid opcode: 0000 [#1] SMP PTI [Mon Apr 5 11:54:13 2021] CPU: 42 PID: 254384 Comm: mupip Not tainted 4.19.0-14-amd64 #1 Debian 4.19.171-2 [Mon Apr 5 11:54:13 2021] Hardware name: Dell Inc. PowerEdge R730xd/072T6D, BIOS 2.1.7 06/16/2016 [Mon Apr 5 11:54:13 2021] RIP: 0010:__list_del_entry_valid.cold.1+0x12/0x47 [Mon Apr 5 11:54:13 2021] Code: ff ff 0f 0b 48 89 c1 4c 89 c6 48 c7 c7 90 ae 09 89 e8 90 24 ff ff 0f 0b 48 89 fe 48 89 c2 48 c7 c7 20 af 09 89 e8 7c 24 ff ff <0f> 0b 48 c7 c7 d0 af 09 89 e8 6e 24 ff ff 0f 0b 48 89 f2 48 89 fe [Mon Apr 5 11:54:13 2021] RSP: 0018:ffffa5fd89417d10 EFLAGS: 00010046 [Mon Apr 5 11:54:13 2021] RAX: 000000000000004e RBX: ffffa5fd89417d68 RCX: 0000000000000000 [Mon Apr 5 11:54:13 2021] RDX: 0000000000000000 RSI: ffff89b23f9566b8 RDI: ffff89b23f9566b8 [Mon Apr 5 11:54:13 2021] RBP: ffff89a5d3099800 R08: 00000000000006eb R09: 0000000000aaaaaa [Mon Apr 5 11:54:13 2021] R10: 0000000000000000 R11: 0000000000000001 R12: 0000000000000286 [Mon Apr 5 11:54:13 2021] R13: ffff89a8f9d370c0 R14: ffffa5fd89417e28 R15: 0000000000000008 [Mon Apr 5 11:54:13 2021] FS: 00007f9abfcaa3c0(0000) GS:ffff89b23f940000(0000) knlGS:0000000000000000 [Mon Apr 5 11:54:13 2021] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [Mon Apr 5 11:54:13 2021] list_del corruption, ffffa5fd8834bd80->next is LIST_POISON1 (dead000000000100) [Mon Apr 5 11:54:13 2021] CR2: 00007fbf0ebcd000 CR3: 00000006d95a0005 CR4: 00000000003606e0 [Mon Apr 5 11:54:13 2021] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 [Mon Apr 5 11:54:13 2021] ------------[ cut here ]------------ [Mon Apr 5 11:54:13 2021] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400 [Mon Apr 5 11:54:13 2021] kernel BUG at lib/list_debug.c:47! [Mon Apr 5 11:54:13 2021] Call Trace: [Mon Apr 5 11:54:13 2021] remove_wait_queue+0x20/0x60 [Mon Apr 5 11:54:13 2021] __tipc_sendmsg+0x2bb/0x570 [tipc] [Mon Apr 5 11:54:13 2021] ? do_wait_intr_irq+0xb0/0xb0 [Mon Apr 5 11:54:13 2021] tipc_sendmsg+0x30/0x50 [tipc] [Mon Apr 5 11:54:13 2021] sock_sendmsg+0x36/0x40 [Mon Apr 5 11:54:13 2021] __sys_sendto+0xee/0x160 [Mon Apr 5 11:54:13 2021] list_del corruption, ffffa5fd88823d80->next is LIST_POISON1 (dead000000000100) [Mon Apr 5 11:54:13 2021] ? vfs_write+0x16d/0x1a0 [Mon Apr 5 11:54:13 2021] ------------[ cut here ]------------ [Mon Apr 5 11:54:13 2021] ? vfs_write+0xe9/0x1a0 [Mon Apr 5 11:54:13 2021] kernel BUG at lib/list_debug.c:47! [Mon Apr 5 11:54:13 2021] __x64_sys_sendto+0x24/0x30 [Mon Apr 5 11:54:13 2021] do_syscall_64+0x53/0x110 [Mon Apr 5 11:54:13 2021] entry_SYSCALL_64_after_hwframe+0x44/0xa9 [Mon Apr 5 11:54:13 2021] RIP: 0033:0x7f9abffe49b7 [Mon Apr 5 11:54:13 2021] Code: 64 89 02 48 c7 c0 ff ff ff ff eb b6 0f 1f 80 00 00 00 00 48 8d 05 49 5d 0c 00 41 89 ca 8b 00 85 c0 75 10 b8 2c 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 71 c3 41 57 4d 89 c7 41 56 41 89 ce 41 55 49 [Mon Apr 5 11:54:13 2021] RSP: 002b:00007ffd979ddb18 EFLAGS: 00000246 ORIG_RAX: 000000000000002c [Mon Apr 5 11:54:13 2021] RAX: ffffffffffffffda RBX: 0000000000000000 RCX: 00007f9abffe49b7 [Mon Apr 5 11:54:13 2021] RDX: 0000000000000008 RSI: 00007ffd979ddb28 RDI: 0000000000000003 [Mon Apr 5 11:54:13 2021] RBP: 00007ffd979ddb40 R08: 00007f9ac02c2120 R09: 0000000000000010 [Mon Apr 5 11:54:13 2021] R10: 0000000000000000 R11: 0000000000000246 R12: 000055d11e4c8250 [Mon Apr 5 11:54:13 2021] R13: 00007ffd979efa40 R14: 0000000000000000 R15: 0000000000000000 [Mon Apr 5 11:54:13 2021] Modules linked in: btrfs zstd_compress zstd_decompress xxhash fuse vfat fat loop rpcsec_gss_krb5 nfsv4 dns_resolver nfs lockd grace fscache binfmt_misc ext4 crc16 mbcache jbd2 f2fs fscrypto ecb intel_rapl sb_edac x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm irqbypass crct10dif_pclmul ipmi_ssif crc32_pclmul mgag200 ttm ghash_clmulni_intel drm_kms_helper drm intel_cstate iTCO_wdt sg dcdbas i2c_algo_bit mei_me iTCO_vendor_support intel_uncore intel_rapl_perf evdev mxm_wmi mei pcspkr pcc_cpufreq ipmi_si ipmi_devintf wmi ipmi_msghandler button acpi_power_meter dm_mod auth_rpcgss tipc sunrpc ip6_udp_tunnel udp_tunnel ip_tables x_tables autofs4 xfs raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor sd_mod raid6_pq libcrc32c crc32c_generic raid1 raid0 [Mon Apr 5 11:54:13 2021] multipath linear md_mod crc32c_intel ahci libahci aesni_intel nvme ehci_pci aes_x86_64 crypto_simd libata ehci_hcd megaraid_sas cryptd glue_helper nvme_core usbcore tg3 lpc_ich scsi_mod libphy mfd_core usb_common [Mon Apr 5 11:54:13 2021] ---[ end trace bb3c37742d449893 ]--- Description: Fedora release 33 (Thirty Three) Linux lltcgtmfed01.fisdev.local 5.10.23-200.fc33.x86_64 #1 SMP Thu Mar 11 22:18:30 UTC 2021 x86_64 x86_64 x86_64 GNU/Linux [Mon Apr 5 14:29:30 2021] tipc: Gacks on 1866da4db38c:eno1-1866da4d75f5:eno1: 64, ql: 339! [Mon Apr 5 14:29:32 2021] tipc: Gacks on 1866da4db38c:eno1-1866da4d75f5:eno1: 64, ql: 339! I suspect I lost at least one datagram due to one of these, as I know that it was sent, and the recv()ing process (now stuck) was only expecting this one datagram reply, making buffer issues unlikely. Thanks. Gary Duzan FIS - GT.M Core Team The information contained in this message is proprietary and/or confidential. If you are not the intended recipient, please: (i) delete the message and all copies; (ii) do not disclose, distribute or use the message in any manner; and (iii) notify the sender immediately. In addition, please be aware that any message addressed to our domain is subject to archiving and review by persons other than the intended recipient. Thank you. _______________________________________________ tipc-discussion mailing list tipc-discussion@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/tipc-discussion