Hi all,
after installation of 5.4.0-47 I also got the impression that the bug was gone
and was happy.
Until now ... I'm getting this type of data corruption with the recent main
focal kernel:
Linux 5.4.0-47-generic #51-Ubuntu SMP Fri Sep 4 19:50:52 UTC 2020
x86_64 x86_64 x86_64 GNU/Linux
(relatively fresh Ubuntu 20.04 installation on ZFS after this bug
hopelessly corrupted the old ext4 installation)
Setup:
* remote NFS4 + krb5 ( over Wifi)
* local ZFS
Trigger:
* rsync'ing a large amount of data from ZFS (local) to NFS4 (remote)
Workqueue: rpciod rpc_async_schedule [sunrpc]
RIP:
#1: 0010:kmem_cache_free+0x237/0x2b0
#2: 0010:kmem_cache_alloc+0x7e/0x230
Any idea?
BR, Martin
[198007.326710] ------------[ cut here ]------------
[198007.326711] virt_to_cache: Object is not a Slab page!
[198007.326721] WARNING: CPU: 2 PID: 1317011 at mm/slab.h:473
kmem_cache_free+0x237/0x2b0
[198007.326722] Modules linked in: cx23885 altera_ci tda18271 altera_stapl
m88ds3103 tveeprom cx2341x videobuf2_dvb dvb_core rc_core videobuf2_dma_sg
videobuf2_memops videobuf2_v4l2 videobuf2_common btrfs xor zstd_compress
raid6_pq ufs qnx4 hfsplus hfs minix ntfs msdos jfs xfs libcrc32c
rpcsec_gss_krb5 auth_rpcgss nfsv4 nfs lockd grace fscache cmac algif_hash
algif_skcipher af_alg bnep nls_iso8859_1 si2157 si2168 cx25840 i2c_mux
snd_hda_codec_realtek snd_hda_codec_generic ledtrig_audio snd_hda_codec_hdmi
videodev snd_hda_intel snd_intel_dspcfg mc snd_hda_codec snd_hda_core snd_hwdep
mei_hdcp intel_rapl_msr snd_pcm snd_seq_midi snd_seq_midi_event
intel_rapl_common x86_pkg_temp_thermal intel_powerclamp snd_rawmidi kvm_intel
btusb btrtl snd_seq kvm btbcm btintel crct10dif_pclmul ghash_clmulni_intel
aesni_intel crypto_simd eeepc_wmi cryptd glue_helper snd_seq_device snd_timer
rapl intel_cstate bluetooth snd asus_wmi sparse_keymap ecdh_generic ecc
wmi_bmof cdc_acm mei_me soundcore mei mac_hid
[198007.326749] acpi_pad sch_fq_codel nct6775 hwmon_vid coretemp parport_pc
ppdev lp parport sunrpc ip_tables x_tables autofs4 zfs(POE) zunicode(POE)
zavl(POE) icp(POE) zcommon(POE) znvpair(POE) spl(OE) zlua(POE) hid_generic
usbhid hid i915 i2c_algo_bit mxm_wmi crc32_pclmul drm_kms_helper ahci libahci
syscopyarea r8169 lpc_ich i2c_i801 sysfillrect realtek sysimgblt fb_sys_fops
drm wmi video [last unloaded: dvb_core]
[198007.326765] CPU: 2 PID: 1317011 Comm: kworker/u8:3 Tainted: P OE
5.4.0-47-generic #51-Ubuntu
[198007.326766] Hardware name: ASUS All Series/H97M-E, BIOS 2702 03/28/2016
[198007.326804] Workqueue: rpciod rpc_async_schedule [sunrpc]
[198007.326809] RIP: 0010:kmem_cache_free+0x237/0x2b0
[198007.326810] Code: ff ff ff 80 3d a6 45 56 01 00 0f 85 39 ff ff ff 48 c7 c6
60 44 87 a5 48 c7 c7 00 2e b8 a5 c6 05 8b 45 56 01 01 e8 14 7f df ff <0f> 0b e9
18 ff ff ff 48 8b 57 58 49 8b 4f 58 48 c7 c6 70 44 87 a5
[198007.326811] RSP: 0018:ffffae38c34e3d20 EFLAGS: 00010282
[198007.326812] RAX: 0000000000000000 RBX: ffff927771c5355f RCX:
0000000000000006
[198007.326812] RDX: 0000000000000007 RSI: 0000000000000092 RDI:
ffff9277d79178c0
[198007.326813] RBP: ffffae38c34e3d48 R08: 0000000000000b72 R09:
0000000000000004
[198007.326813] R10: 0000000000000000 R11: 0000000000000001 R12:
ffff9277f1c5355f
[198007.326814] R13: 0000000000000000 R14: ffff92774426d080 R15:
ffff92779ea6acb0
[198007.326815] FS: 0000000000000000(0000) GS:ffff9277d7900000(0000)
knlGS:0000000000000000
[198007.326815] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[198007.326816] CR2: 00007f19cc03b000 CR3: 00000000af60a005 CR4:
00000000001606e0
[198007.326816] Call Trace:
[198007.326823] mempool_free_slab+0x17/0x20
[198007.326825] mempool_free+0x2f/0x80
[198007.326846] rpc_free+0x47/0x60 [sunrpc]
[198007.326856] xprt_release+0x91/0x1a0 [sunrpc]
[198007.326863] rpc_release_resources_task+0x13/0x50 [sunrpc]
[198007.326869] __rpc_execute+0x182/0x3a0 [sunrpc]
[198007.326875] rpc_async_schedule+0x30/0x50 [sunrpc]
[198007.326877] process_one_work+0x1eb/0x3b0
[198007.326878] worker_thread+0x4d/0x400
[198007.326880] kthread+0x104/0x140
[198007.326881] ? process_one_work+0x3b0/0x3b0
[198007.326882] ? kthread_park+0x90/0x90
[198007.326885] ret_from_fork+0x35/0x40
[198007.326886] ---[ end trace c87e78ba40592766 ]---
[198010.422632] general protection fault: 0000 [#1] SMP PTI
[198010.422637] CPU: 1 PID: 1321230 Comm: kworker/u8:5 Tainted: P W OE
5.4.0-47-generic #51-Ubuntu
[198010.422638] Hardware name: ASUS All Series/H97M-E, BIOS 2702 03/28/2016
[198010.422661] Workqueue: rpciod rpc_async_schedule [sunrpc]
[198010.422666] RIP: 0010:kmem_cache_alloc+0x7e/0x230
[198010.422668] Code: 99 01 00 00 4d 8b 07 65 49 8b 50 08 65 4c 03 05 a0 91 56
5b 4d 8b 20 4d 85 e4 0f 84 85 01 00 00 41 8b 47 20 49 8b 3f 4c 01 e0 <48> 8b 18
48 89 c1 49 33 9f 70 01 00 00 4c 89 e0 48 0f c9 48 31 cb
[198010.422669] RSP: 0018:ffffae38e0a83cc8 EFLAGS: 00010206
[198010.422671] RAX: 7113d0192329439a RBX: 0000000000000000 RCX:
0000000000000002
[198010.422672] RDX: 000000000000004a RSI: 0000000000092800 RDI:
0000000000031ca0
[198010.422673] RBP: ffffae38e0a83cf8 R08: ffff9277d78b1ca0 R09:
0000000000000000
[198010.422674] R10: ffff92776f6aba2c R11: 0000000000000018 R12:
7113d0192329439a
[198010.422675] R13: 0000000000092800 R14: ffff9277d61cefc0 R15:
ffff9277d61cefc0
[198010.422677] FS: 0000000000000000(0000) GS:ffff9277d7880000(0000)
knlGS:0000000000000000
[198010.422678] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[198010.422680] CR2: 00007f19cc00e000 CR3: 00000000af60a005 CR4:
00000000001606e0
[198010.422681] Call Trace:
[198010.422685] ? mempool_alloc_slab+0x17/0x20
[198010.422688] mempool_alloc_slab+0x17/0x20
[198010.422691] mempool_alloc+0x64/0x180
[198010.422703] rpc_malloc+0xa1/0xb0 [sunrpc]
[198010.422713] call_allocate+0xd1/0x1b0 [sunrpc]
[198010.422722] ? call_refreshresult+0x100/0x100 [sunrpc]
[198010.422731] __rpc_execute+0x8c/0x3a0 [sunrpc]
[198010.422741] rpc_async_schedule+0x30/0x50 [sunrpc]
[198010.422744] process_one_work+0x1eb/0x3b0
[198010.422746] worker_thread+0x4d/0x400
[198010.422749] kthread+0x104/0x140
[198010.422751] ? process_one_work+0x3b0/0x3b0
[198010.422753] ? kthread_park+0x90/0x90
[198010.422757] ret_from_fork+0x35/0x40
[198010.422759] Modules linked in: cx23885 altera_ci tda18271 altera_stapl
m88ds3103 tveeprom cx2341x videobuf2_dvb dvb_core rc_core videobuf2_dma_sg
videobuf2_memops videobuf2_v4l2 videobuf2_common btrfs xor zstd_compress
raid6_pq ufs qnx4 hfsplus hfs minix ntfs msdos jfs xfs libcrc32c
rpcsec_gss_krb5 auth_rpcgss nfsv4 nfs lockd grace fscache cmac algif_hash
algif_skcipher af_alg bnep nls_iso8859_1 si2157 si2168 cx25840 i2c_mux
snd_hda_codec_realtek snd_hda_codec_generic ledtrig_audio snd_hda_codec_hdmi
videodev snd_hda_intel snd_intel_dspcfg mc snd_hda_codec snd_hda_core snd_hwdep
mei_hdcp intel_rapl_msr snd_pcm snd_seq_midi snd_seq_midi_event
intel_rapl_common x86_pkg_temp_thermal intel_powerclamp snd_rawmidi kvm_intel
btusb btrtl snd_seq kvm btbcm btintel crct10dif_pclmul ghash_clmulni_intel
aesni_intel crypto_simd eeepc_wmi cryptd glue_helper snd_seq_device snd_timer
rapl intel_cstate bluetooth snd asus_wmi sparse_keymap ecdh_generic ecc
wmi_bmof cdc_acm mei_me soundcore mei mac_hid
[198010.422790] acpi_pad sch_fq_codel nct6775 hwmon_vid coretemp parport_pc
ppdev lp parport sunrpc ip_tables x_tables autofs4 zfs(POE) zunicode(POE)
zavl(POE) icp(POE) zcommon(POE) znvpair(POE) spl(OE) zlua(POE) hid_generic
usbhid hid i915 i2c_algo_bit mxm_wmi crc32_pclmul drm_kms_helper ahci libahci
syscopyarea r8169 lpc_ich i2c_i801 sysfillrect realtek sysimgblt fb_sys_fops
drm wmi video [last unloaded: dvb_core]
[198010.422841] ---[ end trace c87e78ba40592767 ]---
--
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1886277
Title:
Regression on NFS: unable to handle page fault in mempool_alloc_slab
Status in linux package in Ubuntu:
Fix Released
Status in linux source package in Focal:
Fix Committed
Bug description:
On kernel 5.4.0-40-generic in focal I'm getting errors like this on
several machines with different hardware in the first hour after boot:
Jul 04 16:58:32 hostname kernel: BUG: unable to handle page fault for
address: ffff9083e222e632
Jul 04 16:58:32 hostname kernel: #PF: supervisor read access in kernel mode
Jul 04 16:58:32 hostname kernel: #PF: error_code(0x0000) - not-present page
Jul 04 16:58:32 hostname kernel: PGD 3ac205067 P4D 3ac205067 PUD 0
Jul 04 16:58:32 hostname kernel: Oops: 0000 [#1] SMP NOPTI
Jul 04 16:58:32 hostname kernel: CPU: 4 PID: 289 Comm: kworker/u16:4 Tainted:
G OE 5.4.0-40-generic #44-Ubuntu
Jul 04 16:58:32 hostname kernel: Hardware name: LENOVO 20N2CTO1WW/20N2CTO1WW,
BIOS N2IET88W (1.66 ) 04/22/2020
Jul 04 16:58:32 hostname kernel: Workqueue: rpciod rpc_async_schedule [sunrpc]
Jul 04 16:58:32 hostname kernel: RIP: 0010:kmem_cache_alloc+0x7e/0x230
Jul 04 16:58:32 hostname kernel: Code: 99 01 00 00 4d 8b 07 65 49 8b 50 08 65
4c 03 05 40 9d 56 44 4d 8b 20 4d 85 e4 0f 84 85 01 00 00 41 8b 47 20 49 8b 3f
4c 01 e0 <48> 8b 18 48 89 c1 49 33 9f 70 01 00 00 4c 89 e0 48 0f c9 48 31 cb
Jul 04 16:58:32 hostname kernel: RSP: 0018:ffffbc38c046fcc8 EFLAGS: 00010282
Jul 04 16:58:32 hostname kernel: RAX: ffff9083e222e632 RBX: 0000000000000000
RCX: 0000000000000002
Jul 04 16:58:32 hostname kernel: RDX: 0000000000000009 RSI: 0000000000092800
RDI: 0000000000031fb0
Jul 04 16:58:32 hostname kernel: RBP: ffffbc38c046fcf8 R08: ffff90836c331fb0
R09: ffffffffc1436a94
Jul 04 16:58:32 hostname kernel: R10: ffff908368178d2c R11: 0000000000000018
R12: ffff9083e222e632
Jul 04 16:58:32 hostname kernel: R13: 0000000000092800 R14: ffff908367ca6140
R15: ffff908367ca6140
Jul 04 16:58:32 hostname kernel: FS: 0000000000000000(0000)
GS:ffff90836c300000(0000) knlGS:0000000000000000
Jul 04 16:58:32 hostname kernel: CS: 0010 DS: 0000 ES: 0000 CR0:
0000000080050033
Jul 04 16:58:32 hostname kernel: CR2: ffff9083e222e632 CR3: 00000003ab80a003
CR4: 00000000003606e0
Jul 04 16:58:32 hostname kernel: Call Trace:
Jul 04 16:58:32 hostname kernel: ? mempool_alloc_slab+0x17/0x20
Jul 04 16:58:32 hostname kernel: mempool_alloc_slab+0x17/0x20
Jul 04 16:58:32 hostname kernel: mempool_alloc+0x64/0x180
Jul 04 16:58:32 hostname kernel: rpc_malloc+0xa1/0xb0 [sunrpc]
Jul 04 16:58:32 hostname kernel: call_allocate+0xd1/0x1b0 [sunrpc]
Jul 04 16:58:32 hostname kernel: ? call_refreshresult+0x100/0x100 [sunrpc]
Jul 04 16:58:32 hostname kernel: __rpc_execute+0x8c/0x3a0 [sunrpc]
Jul 04 16:58:32 hostname kernel: rpc_async_schedule+0x30/0x50 [sunrpc]
Jul 04 16:58:32 hostname kernel: process_one_work+0x1eb/0x3b0
Jul 04 16:58:32 hostname kernel: worker_thread+0x4d/0x400
Jul 04 16:58:32 hostname kernel: kthread+0x104/0x140
Jul 04 16:58:32 hostname kernel: ? process_one_work+0x3b0/0x3b0
Jul 04 16:58:32 hostname kernel: ? kthread_park+0x90/0x90
Jul 04 16:58:32 hostname kernel: ret_from_fork+0x35/0x40
Jul 04 16:58:32 hostname kernel: Modules linked in: rfcomm rpcsec_gss_krb5
auth_rpcgss nfsv4 nfs lockd grace fscache vboxnetadp(OE) vboxnetflt(OE)
vboxdrv(OE) msr ccm cmac algif_hash algif_skcipher af_alg aufs bnep overlay
nls_iso8859_1 mei_hdcp intel_rapl_msr snd_s>
Jul 04 16:58:32 hostname kernel: nvram ledtrig_audio mei_me cfg80211 mei
processor_thermal_device snd_seq ucsi_acpi typec_ucsi intel_rapl_common
intel_soc_dts_iosf snd_seq_device typec intel_pch_thermal snd_timer snd
int3403_thermal soundcore int340x_thermal_zone i>
Jul 04 16:58:32 hostname kernel: pinctrl_cannonlake video pinctrl_intel
Jul 04 16:58:32 hostname kernel: CR2: ffff9083e222e632
Jul 04 16:58:32 hostname kernel: ---[ end trace cbbaed921eb439ce ]---
Jul 04 16:58:32 hostname kernel: RIP: 0010:kmem_cache_alloc+0x7e/0x230
Jul 04 16:58:32 hostname kernel: Code: 99 01 00 00 4d 8b 07 65 49 8b 50 08 65
4c 03 05 40 9d 56 44 4d 8b 20 4d 85 e4 0f 84 85 01 00 00 41 8b 47 20 49 8b 3f
4c 01 e0 <48> 8b 18 48 89 c1 49 33 9f 70 01 00 00 4c 89 e0 48 0f c9 48 31 cb
Jul 04 16:58:32 hostname kernel: RSP: 0018:ffffbc38c046fcc8 EFLAGS: 00010282
Jul 04 16:58:32 hostname kernel: RAX: ffff9083e222e632 RBX: 0000000000000000
RCX: 0000000000000002
Jul 04 16:58:32 hostname kernel: RDX: 0000000000000009 RSI: 0000000000092800
RDI: 0000000000031fb0
Jul 04 16:58:32 hostname kernel: RBP: ffffbc38c046fcf8 R08: ffff90836c331fb0
R09: ffffffffc1436a94
Jul 04 16:58:32 hostname kernel: R10: ffff908368178d2c R11: 0000000000000018
R12: ffff9083e222e632
Jul 04 16:58:32 hostname kernel: R13: 0000000000092800 R14: ffff908367ca6140
R15: ffff908367ca6140
Jul 04 16:58:32 hostname kernel: FS: 0000000000000000(0000)
GS:ffff90836c300000(0000) knlGS:0000000000000000
Jul 04 16:58:32 hostname kernel: CS: 0010 DS: 0000 ES: 0000 CR0:
0000000080050033
Jul 04 16:58:32 hostname kernel: CR2: ffff9083e222e632 CR3: 00000003ab80a003
CR4: 00000000003606e0
When booting 5.4.0-39-generic the problem does not occur.
---
ProblemType: Bug
ApportVersion: 2.20.11-0ubuntu27.3
Architecture: amd64
AudioDevicesInUse:
USER PID ACCESS COMMAND
/dev/snd/controlC0: lsysadmin 2042 F.... pulseaudio
CasperMD5CheckResult: skip
DistroRelease: Ubuntu 20.04
HibernationDevice: RESUME=UUID=9d3714bb-8799-42f9-a51d-790f87b0a7fc
MachineType: LENOVO 20N2CTO1WW
Package: linux (not installed)
ProcFB: 0 i915drmfb
ProcKernelCmdLine: BOOT_IMAGE=/vmlinuz-5.4.0-40-generic
root=/dev/mapper/vgmagiko-root ro quiet splash vt.handoff=7
ProcVersionSignature: Ubuntu 5.4.0-40.44-generic 5.4.44
PulseList: Error: command ['pacmd', 'list'] failed with exit code 1: No
PulseAudio daemon running, or not running as session daemon.
RelatedPackageVersions:
linux-restricted-modules-5.4.0-40-generic N/A
linux-backports-modules-5.4.0-40-generic N/A
linux-firmware 1.187.1
Tags: focal
Uname: Linux 5.4.0-40-generic x86_64
UpgradeStatus: No upgrade log present (probably fresh install)
UserGroups: N/A
_MarkForUpload: True
dmi.bios.date: 04/22/2020
dmi.bios.vendor: LENOVO
dmi.bios.version: N2IET88W (1.66 )
dmi.board.asset.tag: Not Available
dmi.board.name: 20N2CTO1WW
dmi.board.vendor: LENOVO
dmi.board.version: SDK0J40709 WIN
dmi.chassis.asset.tag: No Asset Information
dmi.chassis.type: 10
dmi.chassis.vendor: LENOVO
dmi.chassis.version: None
dmi.modalias:
dmi:bvnLENOVO:bvrN2IET88W(1.66):bd04/22/2020:svnLENOVO:pn20N2CTO1WW:pvrThinkPadT490:rvnLENOVO:rn20N2CTO1WW:rvrSDK0J40709WIN:cvnLENOVO:ct10:cvrNone:
dmi.product.family: ThinkPad T490
dmi.product.name: 20N2CTO1WW
dmi.product.sku: LENOVO_MT_20N2_BU_Think_FM_ThinkPad T490
dmi.product.version: ThinkPad T490
dmi.sys.vendor: LENOVO
To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1886277/+subscriptions
--
Mailing list: https://launchpad.net/~kernel-packages
Post to : [email protected]
Unsubscribe : https://launchpad.net/~kernel-packages
More help : https://help.launchpad.net/ListHelp