Also reproducible w/ the 5.0.0-37.40 kernel. I'll try a mainline 5.5-rc6 build next.
[ 602.796765] Internal error: synchronous parity or ECC error: 96000018 [#1] SMP [ 602.803994] Modules linked in: nls_iso8859_1 cavium_rng_vf ipmi_ssif ipmi_devintf input_leds joydev ipmi_msghandler thunderx_edac cavium_rng sch_fq_codel ib_iser rdma_cm iw_cm ib_cm ib_core iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi ip_tables x_tables autofs4 btrfs zstd_compress raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor xor_neon raid6_pq libcrc32c raid1 raid0 multipath linear aes_ce_blk aes_ce_cipher nicvf cavium_ptp ast i2c_algo_bit ttm drm_kms_helper crct10dif_ce ghash_ce syscopyarea sysfillrect sha2_ce sysimgblt uas hid_generic nicpf fb_sys_fops sha256_arm64 drm sha1_ce usbhid usb_storage hid thunder_bgx ahci thunder_xcv i2c_thunderx mdio_thunder thunderx_mmc mdio_cavium aes_neon_bs aes_neon_blk crypto_simd cryptd aes_arm64 [ 602.872414] Process cc1 (pid: 40126, stack limit = 0x0000000090887c2f) [ 602.878949] CPU: 10 PID: 40126 Comm: cc1 Not tainted 5.0.0-37-generic #40~18.04.1-Ubuntu [ 602.887040] Hardware name: GIGABYTE R120-T33/MT30-GS1, BIOS T49 02/02/2018 [ 602.893921] pstate: 80000005 (Nzcv daif -PAN -UAO) [ 602.898724] pc : __arch_copy_to_user+0x13c/0x248 [ 602.903353] lr : cp_new_stat+0x140/0x178 [ 602.907277] sp : ffff00002599bcc0 [ 602.910594] x29: ffff00002599bcc0 x28: ffff800ed0538ec0 [ 602.915912] x27: 0000000000000000 x26: 0000000000000000 [ 602.921229] x25: 0000000056000000 x24: 0000000000000015 [ 602.926547] x23: ffff000010c716d8 x22: 000000002599bd08 [ 602.931865] x21: ffff800ed0538ec0 x20: ffff00001170c000 [ 602.937181] x19: ffff00002599bdb0 x18: 0000000000000000 [ 602.942498] x17: 0000000000000000 x16: 0000000000000000 [ 602.947818] x15: 0000000000000000 x14: 0000000000000000 [ 602.953134] x13: 0000000000000000 x12: 0000000000000000 [ 602.958452] x11: 0000000000000000 x10: 000000000000152f [ 602.963769] x9 : 0000000000001000 x8 : 00000001000081a4 [ 602.969087] x7 : 0000000000a60da3 x6 : 000000002599bd20 [ 602.974405] x5 : 000000002599bd88 x4 : 0000000000000008 [ 602.979721] x3 : 0000000000000802 x2 : fffffffffffffff8 [ 602.985038] x1 : ffff00002599bd10 x0 : 000000002599bd08 [ 602.990356] Call trace: [ 602.992821] __arch_copy_to_user+0x13c/0x248 [ 602.997107] __se_sys_newfstat+0x58/0x88 [ 603.001045] __arm64_sys_newfstat+0x20/0x30 [ 603.005243] el0_svc_common+0x88/0x180 [ 603.009005] el0_svc_handler+0x38/0x78 [ 603.012770] el0_svc+0x8/0xc [ 603.015664] Code: a8c12027 a88120c7 d503201f d503201f (a8c12829) [ 603.021765] ---[ end trace 08068f2978fb8211 ]--- -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1860013 Title: [thunderx] Synchronous External Abort: synchronous parity or ECC error Status in linux package in Ubuntu: Triaged Status in linux source package in Bionic: Confirmed Status in linux source package in Disco: Triaged Status in linux source package in Eoan: Triaged Status in linux source package in Focal: Triaged Bug description: [Impact] Under load, ThunderX systems eventually fail with: [ 282.360376] Synchronous External Abort: synchronous parity or ECC error (0x96000018) at 0x0000ffffa6eb7000 [ 282.372351] Internal error: : 96000018 [#1] SMP [ 282.379152] Modules linked in: nls_iso8859_1 thunderx_edac thunderx_zip shpchp cavium_rng_vf cavium_rng gpio_keys uio_pdrv_genirq uio ipmi_ssif ipmi_devintf ipmi_msghandler sch_fq_codel ib_iser rdma_cm iw_cm ib_cm ib_core iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi ip_tables x_tables autofs4 btrfs zstd_compress raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 raid0 multipath linear nicvf nicpf uas usb_storage ast i2c_algo_bit ttm drm_kms_helper syscopyarea sysfillrect sysimgblt aes_ce_blk fb_sys_fops aes_ce_cipher drm crc32_ce crct10dif_ce ghash_ce sha2_ce sha256_arm64 sha1_ce ahci libahci thunder_bgx thunder_xcv i2c_thunderx mdio_thunder thunderx_mmc mdio_cavium aes_neon_bs aes_neon_blk crypto_simd cryptd aes_arm64 [ 282.467284] Process cc1 (pid: 39700, stack limit = 0x00000000e0c44146) [ 282.477172] CPU: 25 PID: 39700 Comm: cc1 Not tainted 4.15.0-75-generic #85+lp1857074.1 [ 282.488379] Hardware name: Cavium ThunderX CRB/To be filled by O.E.M., BIOS 5.11 12/12/2012 [ 282.500121] pstate: 80000005 (Nzcv daif -PAN -UAO) [ 282.508297] pc : __arch_copy_to_user+0x13c/0x248 [ 282.516430] lr : cp_new_stat+0x140/0x178 [ 282.523768] sp : ffff00002e4d3d40 [ 282.530369] x29: ffff00002e4d3d40 x28: ffff801f51fa2d00 [ 282.538988] x27: ffff000008b52000 x26: 0000000000000050 [ 282.548031] x25: 0000000000000124 x24: 0000000000000015 [ 282.556872] x23: 0000000000000000 x22: 000000002e4d3d88 [ 282.565449] x21: ffff801f51fa2d00 x20: ffff000009588000 [ 282.574109] x19: ffff00002e4d3e30 x18: 0000ffffa87e7a70 [ 282.582790] x17: 0000ffffa8756110 x16: ffff0000082f4448 [ 282.591433] x15: 0000000000000000 x14: 0000000000000012 [ 282.599986] x13: 00682e6c746e6366 x12: 2f78756e696c2f69 [ 282.608730] x11: 0000000000000000 x10: 0000000000000cf0 [ 282.617283] x9 : 0000000000001000 x8 : 00000001000081a4 [ 282.625839] x7 : 0000000001001a2b x6 : 000000002e4d3da0 [ 282.634238] x5 : 000000002e4d3e08 x4 : 0000000000000008 [ 282.642754] x3 : 0000000000000802 x2 : fffffffffffffff8 [ 282.651250] x1 : ffff00002e4d3d90 x0 : 000000002e4d3d88 [ 282.660013] Call trace: [ 282.665421] __arch_copy_to_user+0x13c/0x248 [ 282.672979] SyS_newfstat+0x58/0x88 [ 282.679272] el0_svc_naked+0x30/0x34 [ 282.685605] Code: a8c12027 a88120c7 d503201f d503201f (a8c12829) [ 282.694411] ---[ end trace 863693cf0c3fd297 ]--- [Test Case] We found this by doing a reboot/kernel build loop. (The reboot maybe unnecessary). Code to automate this setup is at: https://code.launchpad.net/~dannf/+git/kernel-build-reboot-loop [Fix] [Regression Risk] To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1860013/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp