** Information type changed from Public to Public Security
--
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/2098056
Title:
RAID getting corrupted while running ZFS IOs .
Status in linux package in Ubuntu:
Confirmed
Bug description:
Steps to reproduce :
1. Power on the NVMeoF enclosure.
2. Discover and connect the drives.
3. Create 2 zpools with even and odd drives.
4. Start the IO on both pools created.
Observation :
1. Observed call trace while running ZFS IO Able to see "failed to send
request-5" and drive went continuously reconnected state.
2. The issue is seen with Ubuntu 24.04.1 with kernel 6.8.0-49.generic kernel.
From Kernel ring buffer logs (dmesg) :
[Tue Feb 11 05:25:55 2025] ------------[ cut here ]------------
[Tue Feb 11 05:25:55 2025] WARNING: CPU: 10 PID: 114873 at
net/core/skbuff.c:7006 skb_splice_from_iter+0x139/0x370
[Tue Feb 11 05:25:55 2025] Modules linked in: nvme_tcp nvme_keyring nvme
xt_tcpudp nft_compat nf_tables qrtr cfg80211 binfmt_misc zfs(PO) spl(O)
intel_rapl_msr intel_rapl_common intel_uncore_frequency
intel_uncore_frequency_common sb_edac x86_pkg_temp_thermal intel_powerclamp
coretemp kvm_intel dell_wmi dell_smbios dell_wmi_descriptor kvm video mgag200
ledtrig_audio irqbypass sparse_keymap dcdbas joydev input_leds mei_me
i2c_algo_bit mei acpi_power_meter rapl intel_cstate lpc_ich ipmi_ssif mac_hid
acpi_ipmi ipmi_si ipmi_devintf ipmi_msghandler mxm_wmi sch_fq_codel
dm_multipath nvme_fabrics msr nvme_core nvme_auth efi_pstore nfnetlink
dmi_sysfs ip_tables x_tables autofs4 btrfs blake2b_generic raid10 raid456
async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq
libcrc32c raid1 raid0 mlx5_ib ib_uverbs macsec ib_core mlx5_core
crct10dif_pclmul crc32_pclmul polyval_clmulni polyval_generic
ghash_clmulni_intel sha256_ssse3 sha1_ssse3 mlxfw psample tls pci_hyperv_intf
tg3
pata_acpi wmi hid_generic usbhid hid aesni_intel
[Tue Feb 11 05:25:55 2025] crypto_simd cryptd
[Tue Feb 11 05:25:55 2025] CPU: 10 PID: 114873 Comm: kworker/10:2H Tainted: P
O 6.8.0-49-generic #49-Ubuntu
[Tue Feb 11 05:25:55 2025] Hardware name: Dell Inc. PowerEdge R730/072T6D,
BIOS 2.7.1 001/22/2018
[Tue Feb 11 05:25:55 2025] Workqueue: nvme_tcp_wq nvme_tcp_io_work [nvme_tcp]
[Tue Feb 11 05:25:55 2025] RIP: 0010:skb_splice_from_iter+0x139/0x370
[Tue Feb 11 05:25:55 2025] Code: 39 e1 48 8b 53 08 49 0f 47 cc 49 89 cd f6 c2
01 0f 85 c0 01 00 00 66 90 48 89 da 48 8b 12 80 e6 08 0f 84 8e 00 00 00 4d 89
fe <0f> 0b 49 c7 c0 fb ff ff ff 48 8b 85 68 ff ff ff 41 01 46 70 41 01
[Tue Feb 11 05:25:55 2025] RSP: 0018:ffffb216769d7a38 EFLAGS: 00010202
[Tue Feb 11 05:25:55 2025] RAX: 0000000000000000 RBX: fffff74820347000 RCX:
0000000000001000
[Tue Feb 11 05:25:55 2025] RDX: 0017ffffc0000840 RSI: 0000000000000000 RDI:
0000000000000000
[Tue Feb 11 05:25:55 2025] RBP: ffffb216769d7ae0 R08: 0000000000000000 R09:
0000000000000000
[Tue Feb 11 05:25:55 2025] R10: 0000000000000000 R11: 0000000000000000 R12:
0000000000001000
[Tue Feb 11 05:25:55 2025] R13: 0000000000001000 R14: ffff9c22fccbfe00 R15:
ffff9c22fccbfe00
[Tue Feb 11 05:25:55 2025] FS: 0000000000000000(0000)
GS:ffff9c347f680000(0000) knlGS:0000000000000000
[Tue Feb 11 05:25:55 2025] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[Tue Feb 11 05:25:55 2025] CR2: 00007d79ae7af000 CR3: 0000002a226e4001 CR4:
00000000003706f0
[Tue Feb 11 05:25:55 2025] DR0: 0000000000000000 DR1: 0000000000000000 DR2:
0000000000000000
[Tue Feb 11 05:25:55 2025] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7:
0000000000000400
[Tue Feb 11 05:25:55 2025] Call Trace:
[Tue Feb 11 05:25:55 2025] <TASK>
[Tue Feb 11 05:25:55 2025] ? show_regs+0x6d/0x80
[Tue Feb 11 05:25:55 2025] ? __warn+0x89/0x160
[Tue Feb 11 05:25:55 2025] ? skb_splice_from_iter+0x139/0x370
[Tue Feb 11 05:25:55 2025] ? report_bug+0x17e/0x1b0
[Tue Feb 11 05:25:55 2025] ? handle_bug+0x51/0xa0
[Tue Feb 11 05:25:55 2025] ? exc_invalid_op+0x18/0x80
[Tue Feb 11 05:25:55 2025] ? asm_exc_invalid_op+0x1b/0x20
[Tue Feb 11 05:25:55 2025] ? skb_splice_from_iter+0x139/0x370
[Tue Feb 11 05:25:55 2025] ? skb_splice_from_iter+0xd5/0x370
[Tue Feb 11 05:25:55 2025] tcp_sendmsg_locked+0x352/0xd70
[Tue Feb 11 05:25:55 2025] ? tcp_push+0x159/0x190
[Tue Feb 11 05:25:55 2025] ? tcp_sendmsg_locked+0x9c4/0xd70
[Tue Feb 11 05:25:55 2025] tcp_sendmsg+0x2c/0x50
[Tue Feb 11 05:25:55 2025] inet_sendmsg+0x42/0x80
[Tue Feb 11 05:25:55 2025] sock_sendmsg+0x118/0x150
[Tue Feb 11 05:25:55 2025] nvme_tcp_try_send_data+0x16e/0x4d0 [nvme_tcp]
[Tue Feb 11 05:25:55 2025] nvme_tcp_try_send+0x23c/0x300 [nvme_tcp]
[Tue Feb 11 05:25:55 2025] nvme_tcp_io_work+0x40/0xe0 [nvme_tcp]
[Tue Feb 11 05:25:55 2025] process_one_work+0x178/0x350
[Tue Feb 11 05:25:55 2025] worker_thread+0x306/0x440
[Tue Feb 11 05:25:55 2025] ? __pfx_worker_thread+0x10/0x10
[Tue Feb 11 05:25:55 2025] kthread+0xf2/0x120
[Tue Feb 11 05:25:55 2025] ? __pfx_kthread+0x10/0x10
[Tue Feb 11 05:25:55 2025] ret_from_fork+0x47/0x70
[Tue Feb 11 05:25:55 2025] ? __pfx_kthread+0x10/0x10
[Tue Feb 11 05:25:55 2025] ret_from_fork_asm+0x1b/0x30
[Tue Feb 11 05:25:55 2025] </TASK>
[Tue Feb 11 05:25:55 2025] ---[ end trace 0000000000000000 ]---
[Tue Feb 11 05:25:55 2025] nvme nvme8: failed to send request -5
[Tue Feb 11 05:26:25 2025] nvme nvme8: I/O tag 5 (9005) type 4 opcode 0x2
(I/O Cmd) QID 11 timeout
[Tue Feb 11 05:26:25 2025] nvme nvme8: starting error recovery
[Tue Feb 11 05:26:25 2025] nvme nvme8: I/O tag 6 (c006) type 4 opcode 0x1
(I/O Cmd) QID 11 timeout
[Tue Feb 11 05:26:25 2025] nvme nvme8: I/O tag 7 (d007) type 4 opcode 0x2
(I/O Cmd) QID 11 timeout
[Tue Feb 11 05:26:25 2025] nvme nvme8: I/O tag 11 (700b) type 4 opcode 0x1
(I/O Cmd) QID 11 timeout
[Tue Feb 11 05:26:25 2025] nvme nvme8: I/O tag 12 (300c) type 4 opcode 0x1
(I/O Cmd) QID 11 timeout
[Tue Feb 11 05:26:25 2025] nvme nvme32: failed to send request -5
[Tue Feb 11 05:26:25 2025] nvme nvme8: Reconnecting in 10 seconds...
[Tue Feb 11 05:26:25 2025] nvme nvme32: starting error recovery
[Tue Feb 11 05:26:25 2025] block nvme8n1: no usable path - requeuing I/O
[Tue Feb 11 05:26:25 2025] block nvme8n1: no usable path - requeuing I/O
[Tue Feb 11 05:26:25 2025] block nvme8n1: no usable path - requeuing I/O
[Tue Feb 11 05:26:25 2025] block nvme8n1: no usable path - requeuing I/O
[Tue Feb 11 05:26:25 2025] block nvme8n1: no usable path - requeuing I/O
[Tue Feb 11 05:26:25 2025] block nvme8n1: no usable path - requeuing I/O
[Tue Feb 11 05:26:25 2025] block nvme8n1: no usable path - requeuing I/O
[Tue Feb 11 05:26:25 2025] block nvme8n1: no usable path - requeuing I/O
[Tue Feb 11 05:26:25 2025] block nvme8n1: no usable path - requeuing I/O
[Tue Feb 11 05:26:25 2025] block nvme8n1: no usable path - requeuing I/O
[Tue Feb 11 05:26:25 2025] nvme nvme32: Reconnecting in 10 seconds...
[Tue Feb 11 05:26:36 2025] nvme nvme8: queue_size 128 > ctrl sqsize 16,
clamping down
[Tue Feb 11 05:26:36 2025] nvme nvme8: creating 16 I/O queues.
[Tue Feb 11 05:26:36 2025] nvme nvme32: queue_size 128 > ctrl sqsize 16,
clamping down
[Tue Feb 11 05:26:36 2025] nvme nvme32: creating 16 I/O queues.
[Tue Feb 11 05:26:36 2025] nvme nvme8: mapped 16/0/0 default/read/poll queues.
[Tue Feb 11 05:26:36 2025] nvme nvme8: Successfully reconnected (1 attempt)
[Tue Feb 11 05:26:36 2025] nvme nvme8: failed to send request -5
[Tue Feb 11 05:26:36 2025] nvme nvme32: mapped 16/0/0 default/read/poll
queues.
[Tue Feb 11 05:26:36 2025] nvme nvme8: starting error recovery
[Tue Feb 11 05:26:36 2025] nvme_ns_head_submit_bio: 55 callbacks suppressed
[Tue Feb 11 05:26:36 2025] block nvme8n1: no usable path - requeuing I/O
[Tue Feb 11 05:26:36 2025] block nvme8n1: no usable path - requeuing I/O
[Tue Feb 11 05:26:36 2025] block nvme8n1: no usable path - requeuing I/O
[Tue Feb 11 05:26:36 2025] block nvme8n1: no usable path - requeuing I/O
[Tue Feb 11 05:26:36 2025] block nvme8n1: no usable path - requeuing I/O
[Tue Feb 11 05:26:36 2025] block nvme8n1: no usable path - requeuing I/O
[Tue Feb 11 05:26:36 2025] block nvme8n1: no usable path - requeuing I/O
[Tue Feb 11 05:26:36 2025] block nvme8n1: no usable path - requeuing I/O
[Tue Feb 11 05:26:36 2025] block nvme8n1: no usable path - requeuing I/O
[Tue Feb 11 05:26:36 2025] block nvme8n1: no usable path - requeuing I/O
[Tue Feb 11 05:26:36 2025] nvme nvme32: Successfully reconnected (1 attempt)
[Tue Feb 11 05:26:36 2025] nvme nvme8: reading non-mdts-limits failed: -4
---
ProblemType: Bug
AlsaDevices:
total 0
crw-rw---- 1 root audio 116, 1 Feb 12 07:46 seq
crw-rw---- 1 root audio 116, 33 Feb 12 07:46 timer
AplayDevices: Error: [Errno 2] No such file or directory: 'aplay'
ApportVersion: 2.28.1-0ubuntu2
Architecture: amd64
ArecordDevices: Error: [Errno 2] No such file or directory: 'arecord'
AudioDevicesInUse: Error: command ['fuser', '-v', '/dev/snd/timer',
'/dev/snd/seq'] failed with exit code 1:
CRDA: N/A
CasperMD5CheckResult: unknown
DistroRelease: Ubuntu 24.04
InstallationDate: Installed on 2024-09-20 (146 days ago)
InstallationMedia: Ubuntu-Server 24.04 LTS "Noble Numbat" - Release amd64
(20240423)
IwConfig: Error: [Errno 2] No such file or directory: 'iwconfig'
MachineType: Dell Inc. PowerEdge R740
NonfreeKernelModules: zfs
Package: linux (not installed)
PciMultimedia:
ProcEnviron:
LANG=en_US.UTF-8
PATH=(custom, no user)
SHELL=/bin/bash
TERM=xterm-256color
XDG_RUNTIME_DIR=<set>
ProcFB: 0 mgag200drmfb
ProcKernelCmdLine: BOOT_IMAGE=/vmlinuz-6.8.0-52-generic
root=/dev/mapper/ubuntu--vg-ubuntu--lv ro intel_iommu=off
ProcVersionSignature: Ubuntu 6.8.0-52.53-generic 6.8.12
RelatedPackageVersions:
linux-restricted-modules-6.8.0-52-generic N/A
linux-backports-modules-6.8.0-52-generic N/A
linux-firmware 20240318.git3b128b60-0ubuntu2.3
RfKill: Error: [Errno 2] No such file or directory: 'rfkill'
Tags: noble
Uname: Linux 6.8.0-52-generic x86_64
UpgradeStatus: No upgrade log present (probably fresh install)
UserGroups: N/A
_MarkForUpload: True
dmi.bios.date: 06/04/2023
dmi.bios.release: 2.19
dmi.bios.vendor: Dell Inc.
dmi.bios.version: 2.19.1
dmi.board.name: 08D89F
dmi.board.vendor: Dell Inc.
dmi.board.version: A03
dmi.chassis.type: 23
dmi.chassis.vendor: Dell Inc.
dmi.modalias:
dmi:bvnDellInc.:bvr2.19.1:bd06/04/2023:br2.19:svnDellInc.:pnPowerEdgeR740:pvr:rvnDellInc.:rn08D89F:rvrA03:cvnDellInc.:ct23:cvr:skuSKU=0715;ModelName=PowerEdgeR740:
dmi.product.family: PowerEdge
dmi.product.name: PowerEdge R740
dmi.product.sku: SKU=0715;ModelName=PowerEdge R740
dmi.sys.vendor: Dell Inc.
To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/2098056/+subscriptions
--
Mailing list: https://launchpad.net/~kernel-packages
Post to : [email protected]
Unsubscribe : https://launchpad.net/~kernel-packages
More help : https://help.launchpad.net/ListHelp