Sorry for duplicate but it happened after a disk started failing during a check that was triggered on all my 3 RAID arrays and then after a good while this bug happened. That's basically it.
[154043.105837] md: check of RAID array md125[154049.432225] md: check of RAID array md126 [154055.718196] md: check of RAID array md127 [163101.001572] sd 1:0:0:0: [sda] tag#8069 FAILED Result: hostbyte=DID_SOFT_ERROR driverbyte=DRIVER_OK cmd_age=0s [163101.001655] sd 1:0:0:0: [sda] tag#8069 CDB: Read(10) 28 00 1b 28 d1 80 00 02 80 00 [163101.001691] I/O error, dev sda, sector 455659904 op 0x0:(READ) flags 0x80700 phys_seg 5 prio class 2 [163101.412714] sd 1:0:0:0: Power-on or device reset occurred [163101.698759] sd 1:0:0:0: [sda] tag#7728 FAILED Result: hostbyte=DID_SOFT_ERROR driverbyte=DRIVER_OK cmd_age=0s [163101.698813] sd 1:0:0:0: [sda] tag#7728 CDB: Read(10) 28 00 74 70 6d 00 00 00 80 00 [163101.698843] I/O error, dev sda, sector 1953524992 op 0x0:(READ) flags 0x80700 phys_seg 1 prio class 2 [163102.162693] sd 1:0:0:0: Power-on or device reset occurred [163102.447648] sd 1:0:0:0: [sda] Unaligned partial completion (resid=866300, sector_sz=512) [163102.447723] sd 1:0:0:0: [sda] tag#8049 CDB: Read(10) 28 00 5e 21 d4 00 00 08 00 00 [163102.447751] sd 1:0:0:0: [sda] tag#8049 FAILED Result: hostbyte=DID_SOFT_ERROR driverbyte=DRIVER_OK cmd_age=0s [163102.447789] sd 1:0:0:0: [sda] tag#8049 CDB: Read(10) 28 00 5e 21 d4 00 00 08 00 00 [163102.447855] I/O error, dev sda, sector 1579275264 op 0x0:(READ) flags 0x80700 phys_seg 8 prio class 2 [163102.912783] sd 1:0:0:0: Power-on or device reset occurred [163103.662867] sd 1:0:0:0: Power-on or device reset occurred [163104.413036] sd 1:0:0:0: Power-on or device reset occurred [163105.163044] sd 1:0:0:0: Power-on or device reset occurred [163105.913609] sd 1:0:0:0: Power-on or device reset occurred [163106.663213] sd 1:0:0:0: Power-on or device reset occurred [163107.289773] sd 1:0:0:0: Power-on or device reset occurred [163107.932812] sd 1:0:0:0: Power-on or device reset occurred [163108.913957] sd 1:0:0:0: Power-on or device reset occurred [163109.664106] sd 1:0:0:0: Power-on or device reset occurred [163110.414281] sd 1:0:0:0: Power-on or device reset occurred [163111.164312] sd 1:0:0:0: Power-on or device reset occurred [163111.913814] sd 1:0:0:0: Power-on or device reset occurred [163112.663904] sd 1:0:0:0: Power-on or device reset occurred [163113.414627] sd 1:0:0:0: Power-on or device reset occurred [163113.699639] sd 1:0:0:0: [sda] Unaligned partial completion (resid=205820, sector_sz=512) [163113.699771] sd 1:0:0:0: [sda] tag#7615 CDB: Read(10) 28 00 01 4d 84 00 00 04 00 00 [163113.699976] sd 1:0:0:0: [sda] tag#7615 FAILED Result: hostbyte=DID_SOFT_ERROR driverbyte=DRIVER_OK cmd_age=0s [163113.700198] sd 1:0:0:0: [sda] tag#7615 CDB: Read(10) 28 00 01 4d 84 00 00 04 00 00 [163113.700329] I/O error, dev sda, sector 21857280 op 0x0:(READ) flags 0x80700 phys_seg 4 prio class 2 [163114.164085] sd 1:0:0:0: Power-on or device reset occurred [163114.914167] sd 1:0:0:0: Power-on or device reset occurred [163115.664261] sd 1:0:0:0: Power-on or device reset occurred [163115.959965] sd 1:0:0:0: [sda] tag#8044 FAILED Result: hostbyte=DID_SOFT_ERROR driverbyte=DRIVER_OK cmd_age=0s [163115.959963] sd 1:0:0:0: [sda] Unaligned partial completion (resid=308220, sector_sz=512) [163115.959996] sd 1:0:0:0: [sda] tag#8016 CDB: Read(10) 28 00 00 58 a0 00 00 04 00 00 [163115.960041] sd 1:0:0:0: [sda] tag#8044 CDB: Read(10) 28 00 1b 48 fc 00 00 04 00 00 [163115.960081] sd 1:0:0:0: [sda] tag#8016 FAILED Result: hostbyte=DID_SOFT_ERROR driverbyte=DRIVER_OK cmd_age=0s [163115.960109] I/O error, dev sda, sector 457767936 op 0x0:(READ) flags 0x80700 phys_seg 8 prio class 2 [163115.960155] sd 1:0:0:0: [sda] tag#8016 CDB: Read(10) 28 00 00 58 a0 00 00 04 00 00 [163115.960402] I/O error, dev sda, sector 5808128 op 0x0:(READ) flags 0x80700 phys_seg 4 prio class 2 [163116.414438] sd 1:0:0:0: Power-on or device reset occurred [163116.706739] sd 1:0:0:0: [sda] tag#8103 FAILED Result: hostbyte=DID_SOFT_ERROR driverbyte=DRIVER_OK cmd_age=0s [163116.706804] sd 1:0:0:0: [sda] tag#8103 CDB: Read(10) 28 00 0d 86 3c 00 00 08 00 00 [163116.706853] I/O error, dev sda, sector 226900992 op 0x0:(READ) flags 0x80700 phys_seg 12 prio class 2 [163117.109789] sd 1:0:0:0: Power-on or device reset occurred [163117.914577] sd 1:0:0:0: Power-on or device reset occurred [163497.189569] md: md126: check done. [163829.231426] md: md127: check done. [185096.100388] md: md125: check done. and just after that, just the kernel logs you saw in my previous mail, nothing in between. On Monday, February 9th, 2026 at 12:19 AM, jfiusdq <[email protected]> wrote: > Hello, > > > Today I was met with the following kernel log on IBM POWER9, abit worrying > because of it concerning RAID6: > > > [240888.555387] slab raid6-md125 start c000000d9371bf30 pointer offset 16 > size 2544 > [240888.555464] list_add corruption. prev->next should be next > (c00000002a3fc3e0), but was c000000d9371bf40. (prev=c000000d9371bf40). > [240888.555582] ------------[ cut here ]------------ > [240888.555615] kernel BUG at lib/list_debug.c:32! > [240888.555650] Oops: Exception in kernel mode, sig: 5 [#1] > [240888.555703] LE PAGE_SIZE=64K MMU=Radix SMP NR_CPUS=2048 NUMA PowerNV > [240888.555755] Modules linked in: vendor_reset(OE) vfio_pci vfio_pci_core > vfio_iommu_spapr_tce vfio iommufd vhost_net vhost vhost_iotlb tap tun > nft_masq nft_reject_ipv4 act_csum cls_u32 sch_htb nf_nat_tftp > nf_conntrack_tftp bridge stp llc kvm_hv kvm rfkill xt_conntrack nft_compat > nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib nft_reject_inet nf_reject_ipv4 > nf_reject_ipv6 nft_reject nft_ct nft_chain_nat nf_nat nf_conntrack > nf_defrag_ipv6 nf_defrag_ipv4 nf_tables dm_cache_smq dm_cache raid456 > async_raid6_recov async_memcpy async_pq async_xor async_tx sunrpc raid10 > snd_hda_intel at24 joydev snd_intel_dspcfg snd_hda_codec snd_hda_core > onboard_usb_dev snd_hwdep snd_seq snd_seq_device snd_pcm ofpart tg3 > powernv_flash snd_timer atlantic vmx_crypto ipmi_powernv snd ipmi_devintf mtd > ipmi_msghandler macsec rtc_opal opal_prd soundcore i2c_opal fuse dm_multipath > loop nfnetlink zram lz4hc_compress lz4_compress xfs dm_thin_pool > dm_persistent_data dm_bio_prison dm_crypt raid1 nvme mpt3sas nvme_core uas > usb_storage ast nvme_keyring > [240888.555962] nvme_auth hkdf raid_class i2c_algo_bit scsi_transport_sas > scsi_dh_rdac scsi_dh_emc scsi_dh_alua i2c_dev aes_gcm_p10_crypto crypto_simd > cryptd > [240888.556531] CPU: 33 UID: 0 PID: 2851 Comm: dmcrypt_write/2 Tainted: G > OE 6.18.8-200.fc43.ppc64le #1 PREEMPT(voluntary) > [240888.556608] Tainted: [O]=OOT_MODULE, [E]=UNSIGNED_MODULE > [240888.556650] Hardware name: T2P9D01 REV 1.00 POWER9 (raw) 0x4e1202 > opal:skiboot-ecb1dc7 PowerNV > [240888.556721] NIP: c000000000e098fc LR: c000000000e098f8 CTR: > 0000000000000000 > [240888.556766] REGS: c0000000295e77a0 TRAP: 0700 Tainted: G OE > (6.18.8-200.fc43.ppc64le) > [240888.556822] MSR: 9000000000029033 <SF,HV,EE,ME,IR,DR,RI,LE> CR: > 24004280 XER: 00000000 > [240888.556904] CFAR: c00000000034542c IRQMASK: 0 > GPR00: c000000000e098f8 c0000000295e7a40 c0000000026ba900 > 0000000000000075 > GPR04: 00000000ffffbfff 0000000000000001 0000001ffc2d0000 > 0000000000000001 > GPR08: 0000000000000027 0000000000000000 0000000000000000 > c0000000295e7890 > GPR12: c000201fff18ffa8 c000001ffffde600 c000000000299c28 > c0002000113fdb40 > GPR16: 0000000000000000 0000000000000000 0000000000000000 > 0000000000000000 > GPR20: 0000000000000000 0000000000000000 0000000000000000 > 0000000000000001 > GPR24: 0000000000000000 c0000000295e7c70 c00000002bbce800 > c000001126d57830 > GPR28: 0000000000000000 c000201d2995b588 c00020005f464000 > c000201d2995b540 > [240888.557339] NIP [c000000000e098fc] __list_add_valid_or_report+0xdc/0x140 > [240888.557405] LR [c000000000e098f8] __list_add_valid_or_report+0xd8/0x140 > [240888.557463] Call Trace: > [240888.557484] [c0000000295e7a40] [c000000000e098f8] > __list_add_valid_or_report+0xd8/0x140 (unreliable) > [240888.557541] [c0000000295e7ab0] [c0080000238f0ad4] > release_stripe_plug+0x9c/0x150 [raid456] > [240888.557607] [c0000000295e7b00] [c0080000238f59f4] > make_stripe_request+0x32c/0x560 [raid456] > [240888.557678] [c0000000295e7bd0] [c0080000238f5df8] > raid5_make_request+0x1d0/0x610 [raid456] > [240888.557765] [c0000000295e7d10] [c000000001369a04] > md_handle_request+0x1c4/0x400 > [240888.557850] [c0000000295e7da0] [c000000000d04010] __submit_bio+0x230/0x3d0 > [240888.557927] [c0000000295e7e40] [c000000000d04244] > __submit_bio_noacct+0x94/0x250 > [240888.557998] [c0000000295e7eb0] [c00000000138743c] > dm_submit_bio_remap+0x4c/0x120 > [240888.558070] [c0000000295e7ef0] [c00800001bce26a8] > dmcrypt_write+0x1a0/0x200 [dm_crypt] > [240888.558131] [c0000000295e7f90] [c000000000299da8] kthread+0x188/0x1a0 > [240888.558196] [c0000000295e7fe0] [c00000000000ded8] > start_kernel_thread+0x14/0x18 > [240888.558257] Code: f8c10060 f8010080 4b883b35 60000000 3c62ff82 38639388 > e8c10060 e9210068 e8a60000 7d244b78 4b53baf5 60000000 <0fe00000> 7c0802a6 > 7c852378 7c641b78 > [240888.558363] ---[ end trace 0000000000000000 ]--- > [240889.114586] pstore: backend (nvram) writing error (-1) > > [240889.114636] note: dmcrypt_write/2[2851] exited with irqs disabled > [240889.114756] ------------[ cut here ]------------ > [240889.114785] WARNING: CPU: 33 PID: 2851 at kernel/exit.c:903 > do_exit+0x5c/0x5b0 > [240889.114837] Modules linked in: vendor_reset(OE) vfio_pci vfio_pci_core > vfio_iommu_spapr_tce vfio iommufd vhost_net vhost vhost_iotlb tap tun > nft_masq nft_reject_ipv4 act_csum cls_u32 sch_htb nf_nat_tftp > nf_conntrack_tftp bridge stp llc kvm_hv kvm rfkill xt_conntrack nft_compat > nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib nft_reject_inet nf_reject_ipv4 > nf_reject_ipv6 nft_reject nft_ct nft_chain_nat nf_nat nf_conntrack > nf_defrag_ipv6 nf_defrag_ipv4 nf_tables dm_cache_smq dm_cache raid456 > async_raid6_recov async_memcpy async_pq async_xor async_tx sunrpc raid10 > snd_hda_intel at24 joydev snd_intel_dspcfg snd_hda_codec snd_hda_core > onboard_usb_dev snd_hwdep snd_seq snd_seq_device snd_pcm ofpart tg3 > powernv_flash snd_timer atlantic vmx_crypto ipmi_powernv snd ipmi_devintf mtd > ipmi_msghandler macsec rtc_opal opal_prd soundcore i2c_opal fuse dm_multipath > loop nfnetlink zram lz4hc_compress lz4_compress xfs dm_thin_pool > dm_persistent_data dm_bio_prison dm_crypt raid1 nvme mpt3sas nvme_core uas > usb_storage ast nvme_keyring > [240889.114993] nvme_auth hkdf raid_class i2c_algo_bit scsi_transport_sas > scsi_dh_rdac scsi_dh_emc scsi_dh_alua i2c_dev aes_gcm_p10_crypto crypto_simd > cryptd > [240889.115528] CPU: 33 UID: 0 PID: 2851 Comm: dmcrypt_write/2 Tainted: G > D OE 6.18.8-200.fc43.ppc64le #1 PREEMPT(voluntary) > [240889.115641] Tainted: [D]=DIE, [O]=OOT_MODULE, [E]=UNSIGNED_MODULE > [240889.115678] Hardware name: T2P9D01 REV 1.00 POWER9 (raw) 0x4e1202 > opal:skiboot-ecb1dc7 PowerNV > [240889.115771] NIP: c00000000026035c LR: c000000000260950 CTR: > 0000000000000000 > [240889.115828] REGS: c0000000295e7330 TRAP: 0700 Tainted: G D OE > (6.18.8-200.fc43.ppc64le) > [240889.115875] MSR: 9000000002029033 <SF,HV,VEC,EE,ME,IR,DR,RI,LE> CR: > 24004280 XER: 20040000 > [240889.115936] CFAR: c00000000026094c IRQMASK: 0 > GPR00: c000000000260950 c0000000295e75d0 c0000000026ba900 > 0000000000000005 > GPR04: 0000000000002710 0000000000000001 0000001ffc2d0000 > 0000000000000001 > GPR08: 0000000000000005 0000000000000001 c0000000295e7f18 > 0000000000004000 > GPR12: c000201fff18ffa8 c000001ffffde600 0000000000000000 > 0000000000000000 > GPR16: 0000000000000000 0000000000000000 0000000000000000 > 0000000000000000 > GPR20: 0000000000000000 0000000000000000 0000000000000000 > 0000000000000000 > GPR24: 0000000000000000 0000000000000000 0000000000000000 > 0000000000000000 > GPR28: 0000000000000005 0000000000000003 c000000003e3a900 > c00000004555e800 > [240889.116379] NIP [c00000000026035c] do_exit+0x5c/0x5b0 > [240889.116407] LR [c000000000260950] make_task_dead+0xa0/0x1d0 > [240889.116427] Call Trace: > [240889.116473] [c0000000295e75d0] [c0000000295e7600] 0xc0000000295e7600 > (unreliable) > [240889.116549] [c0000000295e7670] [c000000000260950] > make_task_dead+0xa0/0x1d0 > [240889.116621] [c0000000295e76f0] [c00000000002a314] oops_end+0x164/0x1a0 > [240889.116689] [c0000000295e7770] [c000000000009b2c] > program_check_common_virt+0x3bc/0x3c0 > [240889.116749] ---- interrupt: 700 at __list_add_valid_or_report+0xdc/0x140 > [240889.116799] NIP: c000000000e098fc LR: c000000000e098f8 CTR: > 0000000000000000 > [240889.116838] REGS: c0000000295e77a0 TRAP: 0700 Tainted: G D OE > (6.18.8-200.fc43.ppc64le) > [240889.116905] MSR: 9000000000029033 <SF,HV,EE,ME,IR,DR,RI,LE> CR: > 24004280 XER: 00000000 > [240889.116967] CFAR: c00000000034542c IRQMASK: 0 > GPR00: c000000000e098f8 c0000000295e7a40 c0000000026ba900 > 0000000000000075 > GPR04: 00000000ffffbfff 0000000000000001 0000001ffc2d0000 > 0000000000000001 > GPR08: 0000000000000027 0000000000000000 0000000000000000 > c0000000295e7890 > GPR12: c000201fff18ffa8 c000001ffffde600 c000000000299c28 > c0002000113fdb40 > GPR16: 0000000000000000 0000000000000000 0000000000000000 > 0000000000000000 > GPR20: 0000000000000000 0000000000000000 0000000000000000 > 0000000000000001 > GPR24: 0000000000000000 c0000000295e7c70 c00000002bbce800 > c000001126d57830 > GPR28: 0000000000000000 c000201d2995b588 c00020005f464000 > c000201d2995b540 > [240889.117343] NIP [c000000000e098fc] __list_add_valid_or_report+0xdc/0x140 > [240889.117400] LR [c000000000e098f8] __list_add_valid_or_report+0xd8/0x140 > [240889.117454] ---- interrupt: 700 > [240889.117487] [c0000000295e7ab0] [c0080000238f0ad4] > release_stripe_plug+0x9c/0x150 [raid456] > [240889.117544] [c0000000295e7b00] [c0080000238f59f4] > make_stripe_request+0x32c/0x560 [raid456] > [240889.117605] [c0000000295e7bd0] [c0080000238f5df8] > raid5_make_request+0x1d0/0x610 [raid456] > [240889.117678] [c0000000295e7d10] [c000000001369a04] > md_handle_request+0x1c4/0x400 > [240889.117738] [c0000000295e7da0] [c000000000d04010] __submit_bio+0x230/0x3d0 > [240889.117792] [c0000000295e7e40] [c000000000d04244] > __submit_bio_noacct+0x94/0x250 > [240889.117859] [c0000000295e7eb0] [c00000000138743c] > dm_submit_bio_remap+0x4c/0x120 > [240889.117956] [c0000000295e7ef0] [c00800001bce26a8] > dmcrypt_write+0x1a0/0x200 [dm_crypt] > [240889.118017] [c0000000295e7f90] [c000000000299da8] kthread+0x188/0x1a0 > [240889.118084] [c0000000295e7fe0] [c00000000000ded8] > start_kernel_thread+0x14/0x18 > [240889.118131] Code: 91610008 f8010010 f821ff61 e92d0c78 f9210078 39200000 > 892d0932 552907fe 0b090000 e95f0c68 312affff 7d295110 <0b090000> ebbf0b90 > ebdf0b88 7fa3eb78 > [240889.118320] ---[ end trace 0000000000000000 ]--- > > > If that's any useful to reveal a bug, about the tainting module, it's a port > of https://github.com/gnif/vendor-reset to ppc to reset some older graphics > cards, nothing that should affect what the log is talking about, furthermore > its been here for quite a long time without any errors. > > > Thanks
publickey - [email protected] - 0x344F580A.asc
Description: application/pgp-keys
signature.asc
Description: OpenPGP digital signature
