------- Comment From [email protected] 2017-12-20 08:19 EDT-------
Hi,

I did some tests on a LPAR machine. I got the following:

[root@tuletapio1-lp1 memory-hotplug]# sh ./mem-on-off-test.sh -r 80
[...]
[  204.004656] Offlined Pages 4096
[  204.008378] Offlined Pages 4096
[  204.012030] Offlined Pages 4096
[  204.012201] Built 2 zonelists in Node order, mobility grouping on.  Total 
pages: 92347
[  204.012210] Policy zone: DMA
[  204.012283] ------------[ cut here ]------------
[  204.012286] kernel BUG at 
/home/jsalisbury/bugs/lp1706247/zesty/ubuntu-zesty/mm/slub.c:3993!
[  204.012291] Oops: Exception in kernel mode, sig: 5 [#1]
[  204.012294] SMP NR_CPUS=2048
[  204.012295] NUMA
[  204.012297] pSeries
[  204.012300] Modules linked in: xt_tcpudp ip6t_rpfilter ipt_REJECT 
nf_reject_ipv4 ip6t_REJECT nf_reject_ipv6 xt_conntrack ip_set nfnetlink 
ebtable_nat ebtable_broute bridge stp llc ip6table_nat nf_conntrack_ipv6 
nf_defrag_ipv6 nf_nat_ipv6 ip6table_mangle ip6table_security ip6table_raw 
iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 nf_nat nf_conntrack 
iptable_mangle iptable_security iptable_raw ebtable_filter ebtables 
ip6table_filter ip6_tables iptable_filter btrfs xor raid6_pq vmx_crypto 
ip_tables x_tables xfs libcrc32c ibmveth ibmvscsi crc32c_vpmsum autofs4 [last 
unloaded: notifier_error_inject]
[  204.012343] CPU: 18 PID: 6298 Comm: sh Not tainted 4.10.0-42-generic 
#46~lp1706247
[  204.012347] task: c0000001f7dd8c00 task.stack: c0000001eef3c000
[  204.012350] NIP: c000000000311134 LR: c000000000311148 CTR: c000000000310f40
[  204.012353] REGS: c0000001eef3f6c0 TRAP: 0700   Not tainted  
(4.10.0-42-generic)
[  204.012356] MSR: 800000000282b033 <SF,VEC,VSX,EE,FP,ME,IR,DR,RI,LE>
[  204.012361]   CR: 24222828  XER: 00000001
[  204.012365] CFAR: c000000000311154 SOFTE: 1
GPR00: c000000000311148 c0000001eef3f940 c00000000145c900 0000000000000001
GPR04: c0000001fe010000 c0000001fe030000 c0000001fe030000 0000000000000000
GPR08: 0000000000000000 c0000001fe02fd88 0000000000000001 0000000000000000
GPR12: 0000000024222822 c00000000fb8a200 0000000000000bff f000000002ffffc0
GPR16: 0000000000000001 0000000000000000 c000000001483b00 0000000100001686
GPR20: 0000000000001000 0000000000000001 c0000001eef3fae0 c0000001ffec0600
GPR24: 0000000002fc0000 c0000000015ce368 f000000000000000 0000000000000000
GPR28: c000000001360738 0000000000000008 c000000001360758 c0000001fe02fd80
[  204.012404] NIP [c000000000311134] slab_memory_callback+0x1f4/0x2b0
[  204.012408] LR [c000000000311148] slab_memory_callback+0x208/0x2b0
[  204.012410] Call Trace:
[  204.012413] [c0000001eef3f940] [c000000000311148] 
slab_memory_callback+0x208/0x2b0 (unreliable)
[  204.012419] [c0000001eef3f9a0] [c000000000112834] 
notifier_call_chain+0xa4/0x110
[  204.012424] [c0000001eef3f9f0] [c000000000112d34] 
__blocking_notifier_call_chain+0x74/0xb0
[  204.012429] [c0000001eef3fa40] [c000000000779a00] memory_notify+0x30/0x50
[  204.012433] [c0000001eef3fa60] [c0000000003424cc] 
__offline_pages.constprop.8+0xa4c/0xa60
[  204.012437] [c0000001eef3fbb0] [c000000000778afc] 
memory_block_action+0x9c/0x240
[  204.012441] [c0000001eef3fc30] [c000000000779878] 
memory_subsys_offline+0x68/0xf0
[  204.012446] [c0000001eef3fc60] [c00000000074fb84] device_offline+0xf4/0x130
[  204.012449] [c0000001eef3fca0] [c0000000007797f8] store_mem_state+0x178/0x190
[  204.012453] [c0000001eef3fce0] [c00000000074ac7c] dev_attr_store+0x3c/0x60
[  204.012458] [c0000001eef3fd00] [c000000000405538] sysfs_kf_write+0x68/0xa0
[  204.012461] [c0000001eef3fd20] [c0000000004043cc] 
kernfs_fop_write+0x17c/0x250
[  204.012466] [c0000001eef3fd70] [c00000000034714c] __vfs_write+0x3c/0x70
[  204.012470] [c0000001eef3fd90] [c000000000348bd4] vfs_write+0xd4/0x240
[  204.012474] [c0000001eef3fde0] [c00000000034a788] SyS_write+0x68/0x110
[  204.012478] [c0000001eef3fe30] [c00000000000b184] system_call+0x38/0xe0
[  204.012481] Instruction dump:
[  204.012484] 3bffff98 419e0050 7bbd1f24 3b600000 60000000 60000000 60420000 
7d3fea14
[  204.012490] e8890108 2fa40000 419e001c e9440020 <0b0a0000> fb690108 3c62002d 
e863f9e0
[  204.012497] ---[ end trace 3cd9645673a966d8 ]---

I did run twice to get this error with 80% of ratio (using the -r 80) to
increase the chance to take some HugePage.

This problem is not occuring on a PowerNV and it is different from the
previous problem.

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1724120

Title:
  Ubuntu 16.04.3 - call traces occurs when memory-hotplug test is run
  with 16Gb hugepages configured

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu-power-systems/+bug/1724120/+subscriptions

-- 
ubuntu-bugs mailing list
[email protected]
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

Reply via email to