Hello,
Today I was met with the following kernel log on IBM POWER9, abit worrying because of it concerning RAID6: [240888.555387] slab raid6-md125 start c000000d9371bf30 pointer offset 16 size 2544 [240888.555464] list_add corruption. prev->next should be next (c00000002a3fc3e0), but was c000000d9371bf40. (prev=c000000d9371bf40). [240888.555582] ------------[ cut here ]------------ [240888.555615] kernel BUG at lib/list_debug.c:32! [240888.555650] Oops: Exception in kernel mode, sig: 5 [#1] [240888.555703] LE PAGE_SIZE=64K MMU=Radix SMP NR_CPUS=2048 NUMA PowerNV [240888.555755] Modules linked in: vendor_reset(OE) vfio_pci vfio_pci_core vfio_iommu_spapr_tce vfio iommufd vhost_net vhost vhost_iotlb tap tun nft_masq nft_reject_ipv4 act_csum cls_u32 sch_htb nf_nat_tftp nf_conntrack_tftp bridge stp llc kvm_hv kvm rfkill xt_conntrack nft_compat nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib nft_reject_inet nf_reject_ipv4 nf_reject_ipv6 nft_reject nft_ct nft_chain_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 nf_tables dm_cache_smq dm_cache raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx sunrpc raid10 snd_hda_intel at24 joydev snd_intel_dspcfg snd_hda_codec snd_hda_core onboard_usb_dev snd_hwdep snd_seq snd_seq_device snd_pcm ofpart tg3 powernv_flash snd_timer atlantic vmx_crypto ipmi_powernv snd ipmi_devintf mtd ipmi_msghandler macsec rtc_opal opal_prd soundcore i2c_opal fuse dm_multipath loop nfnetlink zram lz4hc_compress lz4_compress xfs dm_thin_pool dm_persistent_data dm_bio_prison dm_crypt raid1 nvme mpt3sas nvme_core uas usb_storage ast nvme_keyring [240888.555962] nvme_auth hkdf raid_class i2c_algo_bit scsi_transport_sas scsi_dh_rdac scsi_dh_emc scsi_dh_alua i2c_dev aes_gcm_p10_crypto crypto_simd cryptd [240888.556531] CPU: 33 UID: 0 PID: 2851 Comm: dmcrypt_write/2 Tainted: G OE 6.18.8-200.fc43.ppc64le #1 PREEMPT(voluntary) [240888.556608] Tainted: [O]=OOT_MODULE, [E]=UNSIGNED_MODULE [240888.556650] Hardware name: T2P9D01 REV 1.00 POWER9 (raw) 0x4e1202 opal:skiboot-ecb1dc7 PowerNV [240888.556721] NIP: c000000000e098fc LR: c000000000e098f8 CTR: 0000000000000000 [240888.556766] REGS: c0000000295e77a0 TRAP: 0700 Tainted: G OE (6.18.8-200.fc43.ppc64le) [240888.556822] MSR: 9000000000029033 <SF,HV,EE,ME,IR,DR,RI,LE> CR: 24004280 XER: 00000000 [240888.556904] CFAR: c00000000034542c IRQMASK: 0 GPR00: c000000000e098f8 c0000000295e7a40 c0000000026ba900 0000000000000075 GPR04: 00000000ffffbfff 0000000000000001 0000001ffc2d0000 0000000000000001 GPR08: 0000000000000027 0000000000000000 0000000000000000 c0000000295e7890 GPR12: c000201fff18ffa8 c000001ffffde600 c000000000299c28 c0002000113fdb40 GPR16: 0000000000000000 0000000000000000 0000000000000000 0000000000000000 GPR20: 0000000000000000 0000000000000000 0000000000000000 0000000000000001 GPR24: 0000000000000000 c0000000295e7c70 c00000002bbce800 c000001126d57830 GPR28: 0000000000000000 c000201d2995b588 c00020005f464000 c000201d2995b540 [240888.557339] NIP [c000000000e098fc] __list_add_valid_or_report+0xdc/0x140 [240888.557405] LR [c000000000e098f8] __list_add_valid_or_report+0xd8/0x140 [240888.557463] Call Trace: [240888.557484] [c0000000295e7a40] [c000000000e098f8] __list_add_valid_or_report+0xd8/0x140 (unreliable) [240888.557541] [c0000000295e7ab0] [c0080000238f0ad4] release_stripe_plug+0x9c/0x150 [raid456] [240888.557607] [c0000000295e7b00] [c0080000238f59f4] make_stripe_request+0x32c/0x560 [raid456] [240888.557678] [c0000000295e7bd0] [c0080000238f5df8] raid5_make_request+0x1d0/0x610 [raid456] [240888.557765] [c0000000295e7d10] [c000000001369a04] md_handle_request+0x1c4/0x400 [240888.557850] [c0000000295e7da0] [c000000000d04010] __submit_bio+0x230/0x3d0 [240888.557927] [c0000000295e7e40] [c000000000d04244] __submit_bio_noacct+0x94/0x250 [240888.557998] [c0000000295e7eb0] [c00000000138743c] dm_submit_bio_remap+0x4c/0x120 [240888.558070] [c0000000295e7ef0] [c00800001bce26a8] dmcrypt_write+0x1a0/0x200 [dm_crypt] [240888.558131] [c0000000295e7f90] [c000000000299da8] kthread+0x188/0x1a0 [240888.558196] [c0000000295e7fe0] [c00000000000ded8] start_kernel_thread+0x14/0x18 [240888.558257] Code: f8c10060 f8010080 4b883b35 60000000 3c62ff82 38639388 e8c10060 e9210068 e8a60000 7d244b78 4b53baf5 60000000 <0fe00000> 7c0802a6 7c852378 7c641b78 [240888.558363] ---[ end trace 0000000000000000 ]--- [240889.114586] pstore: backend (nvram) writing error (-1) [240889.114636] note: dmcrypt_write/2[2851] exited with irqs disabled [240889.114756] ------------[ cut here ]------------ [240889.114785] WARNING: CPU: 33 PID: 2851 at kernel/exit.c:903 do_exit+0x5c/0x5b0 [240889.114837] Modules linked in: vendor_reset(OE) vfio_pci vfio_pci_core vfio_iommu_spapr_tce vfio iommufd vhost_net vhost vhost_iotlb tap tun nft_masq nft_reject_ipv4 act_csum cls_u32 sch_htb nf_nat_tftp nf_conntrack_tftp bridge stp llc kvm_hv kvm rfkill xt_conntrack nft_compat nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib nft_reject_inet nf_reject_ipv4 nf_reject_ipv6 nft_reject nft_ct nft_chain_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 nf_tables dm_cache_smq dm_cache raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx sunrpc raid10 snd_hda_intel at24 joydev snd_intel_dspcfg snd_hda_codec snd_hda_core onboard_usb_dev snd_hwdep snd_seq snd_seq_device snd_pcm ofpart tg3 powernv_flash snd_timer atlantic vmx_crypto ipmi_powernv snd ipmi_devintf mtd ipmi_msghandler macsec rtc_opal opal_prd soundcore i2c_opal fuse dm_multipath loop nfnetlink zram lz4hc_compress lz4_compress xfs dm_thin_pool dm_persistent_data dm_bio_prison dm_crypt raid1 nvme mpt3sas nvme_core uas usb_storage ast nvme_keyring [240889.114993] nvme_auth hkdf raid_class i2c_algo_bit scsi_transport_sas scsi_dh_rdac scsi_dh_emc scsi_dh_alua i2c_dev aes_gcm_p10_crypto crypto_simd cryptd [240889.115528] CPU: 33 UID: 0 PID: 2851 Comm: dmcrypt_write/2 Tainted: G D OE 6.18.8-200.fc43.ppc64le #1 PREEMPT(voluntary) [240889.115641] Tainted: [D]=DIE, [O]=OOT_MODULE, [E]=UNSIGNED_MODULE [240889.115678] Hardware name: T2P9D01 REV 1.00 POWER9 (raw) 0x4e1202 opal:skiboot-ecb1dc7 PowerNV [240889.115771] NIP: c00000000026035c LR: c000000000260950 CTR: 0000000000000000 [240889.115828] REGS: c0000000295e7330 TRAP: 0700 Tainted: G D OE (6.18.8-200.fc43.ppc64le) [240889.115875] MSR: 9000000002029033 <SF,HV,VEC,EE,ME,IR,DR,RI,LE> CR: 24004280 XER: 20040000 [240889.115936] CFAR: c00000000026094c IRQMASK: 0 GPR00: c000000000260950 c0000000295e75d0 c0000000026ba900 0000000000000005 GPR04: 0000000000002710 0000000000000001 0000001ffc2d0000 0000000000000001 GPR08: 0000000000000005 0000000000000001 c0000000295e7f18 0000000000004000 GPR12: c000201fff18ffa8 c000001ffffde600 0000000000000000 0000000000000000 GPR16: 0000000000000000 0000000000000000 0000000000000000 0000000000000000 GPR20: 0000000000000000 0000000000000000 0000000000000000 0000000000000000 GPR24: 0000000000000000 0000000000000000 0000000000000000 0000000000000000 GPR28: 0000000000000005 0000000000000003 c000000003e3a900 c00000004555e800 [240889.116379] NIP [c00000000026035c] do_exit+0x5c/0x5b0 [240889.116407] LR [c000000000260950] make_task_dead+0xa0/0x1d0 [240889.116427] Call Trace: [240889.116473] [c0000000295e75d0] [c0000000295e7600] 0xc0000000295e7600 (unreliable) [240889.116549] [c0000000295e7670] [c000000000260950] make_task_dead+0xa0/0x1d0 [240889.116621] [c0000000295e76f0] [c00000000002a314] oops_end+0x164/0x1a0 [240889.116689] [c0000000295e7770] [c000000000009b2c] program_check_common_virt+0x3bc/0x3c0 [240889.116749] ---- interrupt: 700 at __list_add_valid_or_report+0xdc/0x140 [240889.116799] NIP: c000000000e098fc LR: c000000000e098f8 CTR: 0000000000000000 [240889.116838] REGS: c0000000295e77a0 TRAP: 0700 Tainted: G D OE (6.18.8-200.fc43.ppc64le) [240889.116905] MSR: 9000000000029033 <SF,HV,EE,ME,IR,DR,RI,LE> CR: 24004280 XER: 00000000 [240889.116967] CFAR: c00000000034542c IRQMASK: 0 GPR00: c000000000e098f8 c0000000295e7a40 c0000000026ba900 0000000000000075 GPR04: 00000000ffffbfff 0000000000000001 0000001ffc2d0000 0000000000000001 GPR08: 0000000000000027 0000000000000000 0000000000000000 c0000000295e7890 GPR12: c000201fff18ffa8 c000001ffffde600 c000000000299c28 c0002000113fdb40 GPR16: 0000000000000000 0000000000000000 0000000000000000 0000000000000000 GPR20: 0000000000000000 0000000000000000 0000000000000000 0000000000000001 GPR24: 0000000000000000 c0000000295e7c70 c00000002bbce800 c000001126d57830 GPR28: 0000000000000000 c000201d2995b588 c00020005f464000 c000201d2995b540 [240889.117343] NIP [c000000000e098fc] __list_add_valid_or_report+0xdc/0x140 [240889.117400] LR [c000000000e098f8] __list_add_valid_or_report+0xd8/0x140 [240889.117454] ---- interrupt: 700 [240889.117487] [c0000000295e7ab0] [c0080000238f0ad4] release_stripe_plug+0x9c/0x150 [raid456] [240889.117544] [c0000000295e7b00] [c0080000238f59f4] make_stripe_request+0x32c/0x560 [raid456] [240889.117605] [c0000000295e7bd0] [c0080000238f5df8] raid5_make_request+0x1d0/0x610 [raid456] [240889.117678] [c0000000295e7d10] [c000000001369a04] md_handle_request+0x1c4/0x400 [240889.117738] [c0000000295e7da0] [c000000000d04010] __submit_bio+0x230/0x3d0 [240889.117792] [c0000000295e7e40] [c000000000d04244] __submit_bio_noacct+0x94/0x250 [240889.117859] [c0000000295e7eb0] [c00000000138743c] dm_submit_bio_remap+0x4c/0x120 [240889.117956] [c0000000295e7ef0] [c00800001bce26a8] dmcrypt_write+0x1a0/0x200 [dm_crypt] [240889.118017] [c0000000295e7f90] [c000000000299da8] kthread+0x188/0x1a0 [240889.118084] [c0000000295e7fe0] [c00000000000ded8] start_kernel_thread+0x14/0x18 [240889.118131] Code: 91610008 f8010010 f821ff61 e92d0c78 f9210078 39200000 892d0932 552907fe 0b090000 e95f0c68 312affff 7d295110 <0b090000> ebbf0b90 ebdf0b88 7fa3eb78 [240889.118320] ---[ end trace 0000000000000000 ]--- If that's any useful to reveal a bug, about the tainting module, it's a port of https://github.com/gnif/vendor-reset to ppc to reset some older graphics cards, nothing that should affect what the log is talking about, furthermore its been here for quite a long time without any errors. Thanks
publickey - [email protected] - 0x344F580A.asc
Description: application/pgp-keys
signature.asc
Description: OpenPGP digital signature
