Sorry for duplicate but it happened after a disk started failing during a check 
that was triggered on all my 3 RAID arrays and then after a good while this bug 
happened. That's basically it.

[154043.105837] md: check of RAID array md125[154049.432225] md: check of RAID 
array md126
[154055.718196] md: check of RAID array md127
[163101.001572] sd 1:0:0:0: [sda] tag#8069 FAILED Result: 
hostbyte=DID_SOFT_ERROR driverbyte=DRIVER_OK cmd_age=0s
[163101.001655] sd 1:0:0:0: [sda] tag#8069 CDB: Read(10) 28 00 1b 28 d1 80 00 
02 80 00
[163101.001691] I/O error, dev sda, sector 455659904 op 0x0:(READ) flags 
0x80700 phys_seg 5 prio class 2
[163101.412714] sd 1:0:0:0: Power-on or device reset occurred
[163101.698759] sd 1:0:0:0: [sda] tag#7728 FAILED Result: 
hostbyte=DID_SOFT_ERROR driverbyte=DRIVER_OK cmd_age=0s
[163101.698813] sd 1:0:0:0: [sda] tag#7728 CDB: Read(10) 28 00 74 70 6d 00 00 
00 80 00
[163101.698843] I/O error, dev sda, sector 1953524992 op 0x0:(READ) flags 
0x80700 phys_seg 1 prio class 2
[163102.162693] sd 1:0:0:0: Power-on or device reset occurred
[163102.447648] sd 1:0:0:0: [sda] Unaligned partial completion (resid=866300, 
sector_sz=512)
[163102.447723] sd 1:0:0:0: [sda] tag#8049 CDB: Read(10) 28 00 5e 21 d4 00 00 
08 00 00
[163102.447751] sd 1:0:0:0: [sda] tag#8049 FAILED Result: 
hostbyte=DID_SOFT_ERROR driverbyte=DRIVER_OK cmd_age=0s
[163102.447789] sd 1:0:0:0: [sda] tag#8049 CDB: Read(10) 28 00 5e 21 d4 00 00 
08 00 00
[163102.447855] I/O error, dev sda, sector 1579275264 op 0x0:(READ) flags 
0x80700 phys_seg 8 prio class 2
[163102.912783] sd 1:0:0:0: Power-on or device reset occurred
[163103.662867] sd 1:0:0:0: Power-on or device reset occurred
[163104.413036] sd 1:0:0:0: Power-on or device reset occurred
[163105.163044] sd 1:0:0:0: Power-on or device reset occurred
[163105.913609] sd 1:0:0:0: Power-on or device reset occurred
[163106.663213] sd 1:0:0:0: Power-on or device reset occurred
[163107.289773] sd 1:0:0:0: Power-on or device reset occurred
[163107.932812] sd 1:0:0:0: Power-on or device reset occurred
[163108.913957] sd 1:0:0:0: Power-on or device reset occurred
[163109.664106] sd 1:0:0:0: Power-on or device reset occurred
[163110.414281] sd 1:0:0:0: Power-on or device reset occurred
[163111.164312] sd 1:0:0:0: Power-on or device reset occurred
[163111.913814] sd 1:0:0:0: Power-on or device reset occurred
[163112.663904] sd 1:0:0:0: Power-on or device reset occurred
[163113.414627] sd 1:0:0:0: Power-on or device reset occurred
[163113.699639] sd 1:0:0:0: [sda] Unaligned partial completion (resid=205820, 
sector_sz=512)
[163113.699771] sd 1:0:0:0: [sda] tag#7615 CDB: Read(10) 28 00 01 4d 84 00 00 
04 00 00
[163113.699976] sd 1:0:0:0: [sda] tag#7615 FAILED Result: 
hostbyte=DID_SOFT_ERROR driverbyte=DRIVER_OK cmd_age=0s
[163113.700198] sd 1:0:0:0: [sda] tag#7615 CDB: Read(10) 28 00 01 4d 84 00 00 
04 00 00
[163113.700329] I/O error, dev sda, sector 21857280 op 0x0:(READ) flags 0x80700 
phys_seg 4 prio class 2
[163114.164085] sd 1:0:0:0: Power-on or device reset occurred
[163114.914167] sd 1:0:0:0: Power-on or device reset occurred
[163115.664261] sd 1:0:0:0: Power-on or device reset occurred
[163115.959965] sd 1:0:0:0: [sda] tag#8044 FAILED Result: 
hostbyte=DID_SOFT_ERROR driverbyte=DRIVER_OK cmd_age=0s
[163115.959963] sd 1:0:0:0: [sda] Unaligned partial completion (resid=308220, 
sector_sz=512)
[163115.959996] sd 1:0:0:0: [sda] tag#8016 CDB: Read(10) 28 00 00 58 a0 00 00 
04 00 00
[163115.960041] sd 1:0:0:0: [sda] tag#8044 CDB: Read(10) 28 00 1b 48 fc 00 00 
04 00 00
[163115.960081] sd 1:0:0:0: [sda] tag#8016 FAILED Result: 
hostbyte=DID_SOFT_ERROR driverbyte=DRIVER_OK cmd_age=0s
[163115.960109] I/O error, dev sda, sector 457767936 op 0x0:(READ) flags 
0x80700 phys_seg 8 prio class 2
[163115.960155] sd 1:0:0:0: [sda] tag#8016 CDB: Read(10) 28 00 00 58 a0 00 00 
04 00 00
[163115.960402] I/O error, dev sda, sector 5808128 op 0x0:(READ) flags 0x80700 
phys_seg 4 prio class 2
[163116.414438] sd 1:0:0:0: Power-on or device reset occurred
[163116.706739] sd 1:0:0:0: [sda] tag#8103 FAILED Result: 
hostbyte=DID_SOFT_ERROR driverbyte=DRIVER_OK cmd_age=0s
[163116.706804] sd 1:0:0:0: [sda] tag#8103 CDB: Read(10) 28 00 0d 86 3c 00 00 
08 00 00
[163116.706853] I/O error, dev sda, sector 226900992 op 0x0:(READ) flags 
0x80700 phys_seg 12 prio class 2
[163117.109789] sd 1:0:0:0: Power-on or device reset occurred
[163117.914577] sd 1:0:0:0: Power-on or device reset occurred
[163497.189569] md: md126: check done.
[163829.231426] md: md127: check done.
[185096.100388] md: md125: check done.


and just after that, just the kernel logs you saw in my previous mail, nothing 
in between.
On Monday, February 9th, 2026 at 12:19 AM, jfiusdq <[email protected]> wrote:

> Hello,
> 

> 

> Today I was met with the following kernel log on IBM POWER9, abit worrying 
> because of it concerning RAID6:
> 

> 

> [240888.555387]  slab raid6-md125 start c000000d9371bf30 pointer offset 16 
> size 2544
> [240888.555464] list_add corruption. prev->next should be next 
> (c00000002a3fc3e0), but was c000000d9371bf40. (prev=c000000d9371bf40).
> [240888.555582] ------------[ cut here ]------------
> [240888.555615] kernel BUG at lib/list_debug.c:32!
> [240888.555650] Oops: Exception in kernel mode, sig: 5 [#1]
> [240888.555703] LE PAGE_SIZE=64K MMU=Radix  SMP NR_CPUS=2048 NUMA PowerNV
> [240888.555755] Modules linked in: vendor_reset(OE) vfio_pci vfio_pci_core 
> vfio_iommu_spapr_tce vfio iommufd vhost_net vhost vhost_iotlb tap tun 
> nft_masq nft_reject_ipv4 act_csum cls_u32 sch_htb nf_nat_tftp 
> nf_conntrack_tftp bridge stp llc kvm_hv kvm rfkill xt_conntrack nft_compat 
> nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib nft_reject_inet nf_reject_ipv4 
> nf_reject_ipv6 nft_reject nft_ct nft_chain_nat nf_nat nf_conntrack 
> nf_defrag_ipv6 nf_defrag_ipv4 nf_tables dm_cache_smq dm_cache raid456 
> async_raid6_recov async_memcpy async_pq async_xor async_tx sunrpc raid10 
> snd_hda_intel at24 joydev snd_intel_dspcfg snd_hda_codec snd_hda_core 
> onboard_usb_dev snd_hwdep snd_seq snd_seq_device snd_pcm ofpart tg3 
> powernv_flash snd_timer atlantic vmx_crypto ipmi_powernv snd ipmi_devintf mtd 
> ipmi_msghandler macsec rtc_opal opal_prd soundcore i2c_opal fuse dm_multipath 
> loop nfnetlink zram lz4hc_compress lz4_compress xfs dm_thin_pool 
> dm_persistent_data dm_bio_prison dm_crypt raid1 nvme mpt3sas nvme_core uas 
> usb_storage ast nvme_keyring
> [240888.555962]  nvme_auth hkdf raid_class i2c_algo_bit scsi_transport_sas 
> scsi_dh_rdac scsi_dh_emc scsi_dh_alua i2c_dev aes_gcm_p10_crypto crypto_simd 
> cryptd
> [240888.556531] CPU: 33 UID: 0 PID: 2851 Comm: dmcrypt_write/2 Tainted: G     
>       OE       6.18.8-200.fc43.ppc64le #1 PREEMPT(voluntary)
> [240888.556608] Tainted: [O]=OOT_MODULE, [E]=UNSIGNED_MODULE
> [240888.556650] Hardware name: T2P9D01 REV 1.00 POWER9 (raw) 0x4e1202 
> opal:skiboot-ecb1dc7 PowerNV
> [240888.556721] NIP:  c000000000e098fc LR: c000000000e098f8 CTR: 
> 0000000000000000
> [240888.556766] REGS: c0000000295e77a0 TRAP: 0700   Tainted: G           OE   
>      (6.18.8-200.fc43.ppc64le)
> [240888.556822] MSR:  9000000000029033 <SF,HV,EE,ME,IR,DR,RI,LE>  CR: 
> 24004280  XER: 00000000
> [240888.556904] CFAR: c00000000034542c IRQMASK: 0
>                 GPR00: c000000000e098f8 c0000000295e7a40 c0000000026ba900 
> 0000000000000075
>                 GPR04: 00000000ffffbfff 0000000000000001 0000001ffc2d0000 
> 0000000000000001
>                 GPR08: 0000000000000027 0000000000000000 0000000000000000 
> c0000000295e7890
>                 GPR12: c000201fff18ffa8 c000001ffffde600 c000000000299c28 
> c0002000113fdb40
>                 GPR16: 0000000000000000 0000000000000000 0000000000000000 
> 0000000000000000
>                 GPR20: 0000000000000000 0000000000000000 0000000000000000 
> 0000000000000001
>                 GPR24: 0000000000000000 c0000000295e7c70 c00000002bbce800 
> c000001126d57830
>                 GPR28: 0000000000000000 c000201d2995b588 c00020005f464000 
> c000201d2995b540
> [240888.557339] NIP [c000000000e098fc] __list_add_valid_or_report+0xdc/0x140
> [240888.557405] LR [c000000000e098f8] __list_add_valid_or_report+0xd8/0x140
> [240888.557463] Call Trace:
> [240888.557484] [c0000000295e7a40] [c000000000e098f8] 
> __list_add_valid_or_report+0xd8/0x140 (unreliable)
> [240888.557541] [c0000000295e7ab0] [c0080000238f0ad4] 
> release_stripe_plug+0x9c/0x150 [raid456]
> [240888.557607] [c0000000295e7b00] [c0080000238f59f4] 
> make_stripe_request+0x32c/0x560 [raid456]
> [240888.557678] [c0000000295e7bd0] [c0080000238f5df8] 
> raid5_make_request+0x1d0/0x610 [raid456]
> [240888.557765] [c0000000295e7d10] [c000000001369a04] 
> md_handle_request+0x1c4/0x400
> [240888.557850] [c0000000295e7da0] [c000000000d04010] __submit_bio+0x230/0x3d0
> [240888.557927] [c0000000295e7e40] [c000000000d04244] 
> __submit_bio_noacct+0x94/0x250
> [240888.557998] [c0000000295e7eb0] [c00000000138743c] 
> dm_submit_bio_remap+0x4c/0x120
> [240888.558070] [c0000000295e7ef0] [c00800001bce26a8] 
> dmcrypt_write+0x1a0/0x200 [dm_crypt]
> [240888.558131] [c0000000295e7f90] [c000000000299da8] kthread+0x188/0x1a0
> [240888.558196] [c0000000295e7fe0] [c00000000000ded8] 
> start_kernel_thread+0x14/0x18
> [240888.558257] Code: f8c10060 f8010080 4b883b35 60000000 3c62ff82 38639388 
> e8c10060 e9210068 e8a60000 7d244b78 4b53baf5 60000000 <0fe00000> 7c0802a6 
> 7c852378 7c641b78
> [240888.558363] ---[ end trace 0000000000000000 ]---
> [240889.114586] pstore: backend (nvram) writing error (-1)
> 

> [240889.114636] note: dmcrypt_write/2[2851] exited with irqs disabled
> [240889.114756] ------------[ cut here ]------------
> [240889.114785] WARNING: CPU: 33 PID: 2851 at kernel/exit.c:903 
> do_exit+0x5c/0x5b0
> [240889.114837] Modules linked in: vendor_reset(OE) vfio_pci vfio_pci_core 
> vfio_iommu_spapr_tce vfio iommufd vhost_net vhost vhost_iotlb tap tun 
> nft_masq nft_reject_ipv4 act_csum cls_u32 sch_htb nf_nat_tftp 
> nf_conntrack_tftp bridge stp llc kvm_hv kvm rfkill xt_conntrack nft_compat 
> nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib nft_reject_inet nf_reject_ipv4 
> nf_reject_ipv6 nft_reject nft_ct nft_chain_nat nf_nat nf_conntrack 
> nf_defrag_ipv6 nf_defrag_ipv4 nf_tables dm_cache_smq dm_cache raid456 
> async_raid6_recov async_memcpy async_pq async_xor async_tx sunrpc raid10 
> snd_hda_intel at24 joydev snd_intel_dspcfg snd_hda_codec snd_hda_core 
> onboard_usb_dev snd_hwdep snd_seq snd_seq_device snd_pcm ofpart tg3 
> powernv_flash snd_timer atlantic vmx_crypto ipmi_powernv snd ipmi_devintf mtd 
> ipmi_msghandler macsec rtc_opal opal_prd soundcore i2c_opal fuse dm_multipath 
> loop nfnetlink zram lz4hc_compress lz4_compress xfs dm_thin_pool 
> dm_persistent_data dm_bio_prison dm_crypt raid1 nvme mpt3sas nvme_core uas 
> usb_storage ast nvme_keyring
> [240889.114993]  nvme_auth hkdf raid_class i2c_algo_bit scsi_transport_sas 
> scsi_dh_rdac scsi_dh_emc scsi_dh_alua i2c_dev aes_gcm_p10_crypto crypto_simd 
> cryptd
> [240889.115528] CPU: 33 UID: 0 PID: 2851 Comm: dmcrypt_write/2 Tainted: G     
>  D    OE       6.18.8-200.fc43.ppc64le #1 PREEMPT(voluntary)
> [240889.115641] Tainted: [D]=DIE, [O]=OOT_MODULE, [E]=UNSIGNED_MODULE
> [240889.115678] Hardware name: T2P9D01 REV 1.00 POWER9 (raw) 0x4e1202 
> opal:skiboot-ecb1dc7 PowerNV
> [240889.115771] NIP:  c00000000026035c LR: c000000000260950 CTR: 
> 0000000000000000
> [240889.115828] REGS: c0000000295e7330 TRAP: 0700   Tainted: G      D    OE   
>      (6.18.8-200.fc43.ppc64le)
> [240889.115875] MSR:  9000000002029033 <SF,HV,VEC,EE,ME,IR,DR,RI,LE>  CR: 
> 24004280  XER: 20040000
> [240889.115936] CFAR: c00000000026094c IRQMASK: 0
>                 GPR00: c000000000260950 c0000000295e75d0 c0000000026ba900 
> 0000000000000005
>                 GPR04: 0000000000002710 0000000000000001 0000001ffc2d0000 
> 0000000000000001
>                 GPR08: 0000000000000005 0000000000000001 c0000000295e7f18 
> 0000000000004000
>                 GPR12: c000201fff18ffa8 c000001ffffde600 0000000000000000 
> 0000000000000000
>                 GPR16: 0000000000000000 0000000000000000 0000000000000000 
> 0000000000000000
>                 GPR20: 0000000000000000 0000000000000000 0000000000000000 
> 0000000000000000
>                 GPR24: 0000000000000000 0000000000000000 0000000000000000 
> 0000000000000000
>                 GPR28: 0000000000000005 0000000000000003 c000000003e3a900 
> c00000004555e800
> [240889.116379] NIP [c00000000026035c] do_exit+0x5c/0x5b0
> [240889.116407] LR [c000000000260950] make_task_dead+0xa0/0x1d0
> [240889.116427] Call Trace:
> [240889.116473] [c0000000295e75d0] [c0000000295e7600] 0xc0000000295e7600 
> (unreliable)
> [240889.116549] [c0000000295e7670] [c000000000260950] 
> make_task_dead+0xa0/0x1d0
> [240889.116621] [c0000000295e76f0] [c00000000002a314] oops_end+0x164/0x1a0
> [240889.116689] [c0000000295e7770] [c000000000009b2c] 
> program_check_common_virt+0x3bc/0x3c0
> [240889.116749] ---- interrupt: 700 at __list_add_valid_or_report+0xdc/0x140
> [240889.116799] NIP:  c000000000e098fc LR: c000000000e098f8 CTR: 
> 0000000000000000
> [240889.116838] REGS: c0000000295e77a0 TRAP: 0700   Tainted: G      D    OE   
>      (6.18.8-200.fc43.ppc64le)
> [240889.116905] MSR:  9000000000029033 <SF,HV,EE,ME,IR,DR,RI,LE>  CR: 
> 24004280  XER: 00000000
> [240889.116967] CFAR: c00000000034542c IRQMASK: 0
>                 GPR00: c000000000e098f8 c0000000295e7a40 c0000000026ba900 
> 0000000000000075
>                 GPR04: 00000000ffffbfff 0000000000000001 0000001ffc2d0000 
> 0000000000000001
>                 GPR08: 0000000000000027 0000000000000000 0000000000000000 
> c0000000295e7890
>                 GPR12: c000201fff18ffa8 c000001ffffde600 c000000000299c28 
> c0002000113fdb40
>                 GPR16: 0000000000000000 0000000000000000 0000000000000000 
> 0000000000000000
>                 GPR20: 0000000000000000 0000000000000000 0000000000000000 
> 0000000000000001
>                 GPR24: 0000000000000000 c0000000295e7c70 c00000002bbce800 
> c000001126d57830
>                 GPR28: 0000000000000000 c000201d2995b588 c00020005f464000 
> c000201d2995b540
> [240889.117343] NIP [c000000000e098fc] __list_add_valid_or_report+0xdc/0x140
> [240889.117400] LR [c000000000e098f8] __list_add_valid_or_report+0xd8/0x140
> [240889.117454] ---- interrupt: 700
> [240889.117487] [c0000000295e7ab0] [c0080000238f0ad4] 
> release_stripe_plug+0x9c/0x150 [raid456]
> [240889.117544] [c0000000295e7b00] [c0080000238f59f4] 
> make_stripe_request+0x32c/0x560 [raid456]
> [240889.117605] [c0000000295e7bd0] [c0080000238f5df8] 
> raid5_make_request+0x1d0/0x610 [raid456]
> [240889.117678] [c0000000295e7d10] [c000000001369a04] 
> md_handle_request+0x1c4/0x400
> [240889.117738] [c0000000295e7da0] [c000000000d04010] __submit_bio+0x230/0x3d0
> [240889.117792] [c0000000295e7e40] [c000000000d04244] 
> __submit_bio_noacct+0x94/0x250
> [240889.117859] [c0000000295e7eb0] [c00000000138743c] 
> dm_submit_bio_remap+0x4c/0x120
> [240889.117956] [c0000000295e7ef0] [c00800001bce26a8] 
> dmcrypt_write+0x1a0/0x200 [dm_crypt]
> [240889.118017] [c0000000295e7f90] [c000000000299da8] kthread+0x188/0x1a0
> [240889.118084] [c0000000295e7fe0] [c00000000000ded8] 
> start_kernel_thread+0x14/0x18
> [240889.118131] Code: 91610008 f8010010 f821ff61 e92d0c78 f9210078 39200000 
> 892d0932 552907fe 0b090000 e95f0c68 312affff 7d295110 <0b090000> ebbf0b90 
> ebdf0b88 7fa3eb78
> [240889.118320] ---[ end trace 0000000000000000 ]---
> 

> 

> If that's any useful to reveal a bug, about the tainting module, it's a port 
> of https://github.com/gnif/vendor-reset to ppc to reset some older graphics 
> cards, nothing that should affect what the log is talking about, furthermore 
> its been here for quite a long time without any errors.
> 

> 

> Thanks

Attachment: publickey - [email protected] - 0x344F580A.asc
Description: application/pgp-keys

Attachment: signature.asc
Description: OpenPGP digital signature

Reply via email to