[Kernel-packages] [Bug 1537666] Re: ISST-LTE: Ubuntu 14.04.4 LPAR interrupts at check_and_cede_processor

2017-02-06 Thread Andrew Cloke
As per comment #4, marking this Fix Released.

** Changed in: linux (Ubuntu)
   Status: Triaged => Fix Released

** Changed in: linux (Ubuntu)
 Assignee: Taco Screen team (taco-screen-team) => (unassigned)

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1537666

Title:
  ISST-LTE: Ubuntu 14.04.4 LPAR interrupts at check_and_cede_processor

Status in linux package in Ubuntu:
  Fix Released

Bug description:
  == Comment: #0 - YUECHANG E. MEI  - 2015-12-11 17:19:07 ==
  ---Problem Description---
  We have an Ubuntu 14.04.4 LPAR, conelp2. It is running stress test: base, io, 
and tcp. When checking "dmesg", we see this interruption: 

  [Fri Dec 11 13:58:50 2015] --- interrupt: 501 at plpar_hcall_norets+0x1c/0x28
  [Fri Dec 11 13:58:50 2015] LR = check_and_cede_processor+0x34/0x50

  In the previous test, conelp2 stopped all the stress tests by itself
  because it ran out of memory. Is the out of memory issue relating to
  the interruption?


   
  Contact Information = Yuechang (Erin) Mei /ye...@us.ibm.com,  Raja  Sunkari 
/rajas...@in.ibm.com 
   
  ---uname output---
  Linux conelp2 4.2.0-21-generic #25~14.04.1-Ubuntu SMP Thu Dec 3 13:55:42 UTC 
2015 ppc64le ppc64le ppc64le GNU/Linux
   
  Machine Type = EUH Alpine 8408-E8E 
   
  ---Debugger---
  A debugger is not configured
   
  ---Steps to Reproduce---
   1. install Ubuntu 14.04.4 in a LPAR, then update to the latest 14.04.4 
kernel by using this workaround:
  echo "deb http://software.linux.ibm.com/pub/ubuntu-ppc64el-repository/ 
trusty-proposed main restricted universe multiverse" >> /etc/apt/sources.list

  apt-get update

  apt-get install linux-image-generic-lts-wily

  2. Setup the Stress test, and start base,io, tcp
  3. After an hour, check dmesg, then you will see the message about the 
interruption 
   
  Stack trace output:
   no
   
  Oops output:
   no
   
  System Dump Info:
The system is not configured to capture a system dump.
   
  *Additional Instructions for Yuechang (Erin) Mei /ye...@us.ibm.com,  Raja  
Sunkari /rajas...@in.ibm.com: 
  -Post a private note with access information to the machine that the bug is 
occuring on. 
  -Attach sysctl -a output output to the bug.

  == Comment: #1 - YUECHANG E. MEI  - 2015-12-11
  17:23:00 ==

  
  == Comment: #3 - YUECHANG E. MEI  - 2015-12-14 15:23:33 ==

  
  == Comment: #4 - MAMATHA INAMDAR  - 2015-12-15 03:56:14 
==
  dmrsg show page allocation failure

  [Fri Dec 11 13:45:38 2015] swapper/127: page allocation failure: order:0, 
mode:0x120
  [Fri Dec 11 13:45:38 2015] CPU: 127 PID: 0 Comm: swapper/127 Not tainted 
4.2.0-21-generic #25~14.04.1-Ubuntu
  [Fri Dec 11 13:45:38 2015] Call Trace:
  [Fri Dec 11 13:45:38 2015] [c0027fbc3890] [c0a805ec] 
dump_stack+0x90/0xbc (unreliable)
  [Fri Dec 11 13:45:38 2015] [c0027fbc38c0] [c021c118] 
warn_alloc_failed+0x118/0x160
  [Fri Dec 11 13:45:38 2015] [c0027fbc3960] [c0221114] 
__alloc_pages_nodemask+0x834/0xa60
  [Fri Dec 11 13:45:38 2015] [c0027fbc3b10] [c0221404] 
__alloc_page_frag+0xc4/0x190
  [Fri Dec 11 13:45:38 2015] [c0027fbc3b50] [c08f6d20] 
netdev_alloc_frag+0x50/0x80
  [Fri Dec 11 13:45:38 2015] [c0027fbc3b80] [c0764e80] 
tg3_alloc_rx_data+0xa0/0x2c0
  [Fri Dec 11 13:45:38 2015] [c0027fbc3be0] [c0767344] 
tg3_poll_work+0x484/0x1070
  [Fri Dec 11 13:45:38 2015] [c0027fbc3ce0] [c0767f8c] 
tg3_poll_msix+0x5c/0x210
  [Fri Dec 11 13:45:38 2015] [c0027fbc3d30] [c090ebb8] 
net_rx_action+0x2d8/0x430
  [Fri Dec 11 13:45:38 2015] [c0027fbc3e40] [c00ba124] 
__do_softirq+0x174/0x390
  [Fri Dec 11 13:45:38 2015] [c0027fbc3f40] [c00ba6c8] 
irq_exit+0xc8/0x100
  [Fri Dec 11 13:45:38 2015] [c0027fbc3f60] [c00111ec] 
__do_irq+0x8c/0x190
  [Fri Dec 11 13:45:38 2015] [c0027fbc3f90] [c0024278] 
call_do_irq+0x14/0x24
  [Fri Dec 11 13:45:38 2015] [c002763a39b0] [c0011390] 
do_IRQ+0xa0/0x120
  [Fri Dec 11 13:45:38 2015] [c002763a3a10] [c00099b0] 
restore_check_irq_replay+0x2c/0x70
  [Fri Dec 11 13:45:38 2015] --- interrupt: 501 at plpar_hcall_norets+0x1c/0x28
  [Fri Dec 11 13:45:38 2015] LR = check_and_cede_processor+0x34/0x50
  [Fri Dec 11 13:45:38 2015] [c002763a3d00] [c08a8d90] 
check_and_cede_processor+0x20/0x50 (unreliable)
  [Fri Dec 11 13:45:38 2015] [c002763a3d60] [c08a8fb8] 
shared_cede_loop+0x68/0x170
  [Fri Dec 11 13:45:38 2015] [c002763a3da0] [c08a615c] 
cpuidle_enter_state+0xbc/0x350
  [Fri Dec 11 13:45:38 2015] [c002763a3e00] [c0110f3c] 
call_cpuidle+0x7c/0xd0
  [Fri Dec 11 13:45:38 2015] [c002763a3e40] [c01112d0] 
cpu_startup_entry+0x340/0x450
  [Fri Dec 11 13:45:38 2015] [c002763a3f10] 

[Kernel-packages] [Bug 1537666] Re: ISST-LTE: Ubuntu 14.04.4 LPAR interrupts at check_and_cede_processor

2016-02-01 Thread bugproxy
** Tags removed: targetmilestone-inin---
** Tags added: targetmilestone-inin14044

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1537666

Title:
  ISST-LTE: Ubuntu 14.04.4 LPAR interrupts at check_and_cede_processor

Status in linux package in Ubuntu:
  Triaged

Bug description:
  == Comment: #0 - YUECHANG E. MEI  - 2015-12-11 17:19:07 ==
  ---Problem Description---
  We have an Ubuntu 14.04.4 LPAR, conelp2. It is running stress test: base, io, 
and tcp. When checking "dmesg", we see this interruption: 

  [Fri Dec 11 13:58:50 2015] --- interrupt: 501 at plpar_hcall_norets+0x1c/0x28
  [Fri Dec 11 13:58:50 2015] LR = check_and_cede_processor+0x34/0x50

  In the previous test, conelp2 stopped all the stress tests by itself
  because it ran out of memory. Is the out of memory issue relating to
  the interruption?


   
  Contact Information = Yuechang (Erin) Mei /ye...@us.ibm.com,  Raja  Sunkari 
/rajas...@in.ibm.com 
   
  ---uname output---
  Linux conelp2 4.2.0-21-generic #25~14.04.1-Ubuntu SMP Thu Dec 3 13:55:42 UTC 
2015 ppc64le ppc64le ppc64le GNU/Linux
   
  Machine Type = EUH Alpine 8408-E8E 
   
  ---Debugger---
  A debugger is not configured
   
  ---Steps to Reproduce---
   1. install Ubuntu 14.04.4 in a LPAR, then update to the latest 14.04.4 
kernel by using this workaround:
  echo "deb http://software.linux.ibm.com/pub/ubuntu-ppc64el-repository/ 
trusty-proposed main restricted universe multiverse" >> /etc/apt/sources.list

  apt-get update

  apt-get install linux-image-generic-lts-wily

  2. Setup the Stress test, and start base,io, tcp
  3. After an hour, check dmesg, then you will see the message about the 
interruption 
   
  Stack trace output:
   no
   
  Oops output:
   no
   
  System Dump Info:
The system is not configured to capture a system dump.
   
  *Additional Instructions for Yuechang (Erin) Mei /ye...@us.ibm.com,  Raja  
Sunkari /rajas...@in.ibm.com: 
  -Post a private note with access information to the machine that the bug is 
occuring on. 
  -Attach sysctl -a output output to the bug.

  == Comment: #1 - YUECHANG E. MEI  - 2015-12-11
  17:23:00 ==

  
  == Comment: #3 - YUECHANG E. MEI  - 2015-12-14 15:23:33 ==

  
  == Comment: #4 - MAMATHA INAMDAR  - 2015-12-15 03:56:14 
==
  dmrsg show page allocation failure

  [Fri Dec 11 13:45:38 2015] swapper/127: page allocation failure: order:0, 
mode:0x120
  [Fri Dec 11 13:45:38 2015] CPU: 127 PID: 0 Comm: swapper/127 Not tainted 
4.2.0-21-generic #25~14.04.1-Ubuntu
  [Fri Dec 11 13:45:38 2015] Call Trace:
  [Fri Dec 11 13:45:38 2015] [c0027fbc3890] [c0a805ec] 
dump_stack+0x90/0xbc (unreliable)
  [Fri Dec 11 13:45:38 2015] [c0027fbc38c0] [c021c118] 
warn_alloc_failed+0x118/0x160
  [Fri Dec 11 13:45:38 2015] [c0027fbc3960] [c0221114] 
__alloc_pages_nodemask+0x834/0xa60
  [Fri Dec 11 13:45:38 2015] [c0027fbc3b10] [c0221404] 
__alloc_page_frag+0xc4/0x190
  [Fri Dec 11 13:45:38 2015] [c0027fbc3b50] [c08f6d20] 
netdev_alloc_frag+0x50/0x80
  [Fri Dec 11 13:45:38 2015] [c0027fbc3b80] [c0764e80] 
tg3_alloc_rx_data+0xa0/0x2c0
  [Fri Dec 11 13:45:38 2015] [c0027fbc3be0] [c0767344] 
tg3_poll_work+0x484/0x1070
  [Fri Dec 11 13:45:38 2015] [c0027fbc3ce0] [c0767f8c] 
tg3_poll_msix+0x5c/0x210
  [Fri Dec 11 13:45:38 2015] [c0027fbc3d30] [c090ebb8] 
net_rx_action+0x2d8/0x430
  [Fri Dec 11 13:45:38 2015] [c0027fbc3e40] [c00ba124] 
__do_softirq+0x174/0x390
  [Fri Dec 11 13:45:38 2015] [c0027fbc3f40] [c00ba6c8] 
irq_exit+0xc8/0x100
  [Fri Dec 11 13:45:38 2015] [c0027fbc3f60] [c00111ec] 
__do_irq+0x8c/0x190
  [Fri Dec 11 13:45:38 2015] [c0027fbc3f90] [c0024278] 
call_do_irq+0x14/0x24
  [Fri Dec 11 13:45:38 2015] [c002763a39b0] [c0011390] 
do_IRQ+0xa0/0x120
  [Fri Dec 11 13:45:38 2015] [c002763a3a10] [c00099b0] 
restore_check_irq_replay+0x2c/0x70
  [Fri Dec 11 13:45:38 2015] --- interrupt: 501 at plpar_hcall_norets+0x1c/0x28
  [Fri Dec 11 13:45:38 2015] LR = check_and_cede_processor+0x34/0x50
  [Fri Dec 11 13:45:38 2015] [c002763a3d00] [c08a8d90] 
check_and_cede_processor+0x20/0x50 (unreliable)
  [Fri Dec 11 13:45:38 2015] [c002763a3d60] [c08a8fb8] 
shared_cede_loop+0x68/0x170
  [Fri Dec 11 13:45:38 2015] [c002763a3da0] [c08a615c] 
cpuidle_enter_state+0xbc/0x350
  [Fri Dec 11 13:45:38 2015] [c002763a3e00] [c0110f3c] 
call_cpuidle+0x7c/0xd0
  [Fri Dec 11 13:45:38 2015] [c002763a3e40] [c01112d0] 
cpu_startup_entry+0x340/0x450
  [Fri Dec 11 13:45:38 2015] [c002763a3f10] [c0044ab4] 
start_secondary+0x364/0x3a0
  [Fri Dec 11 13:45:38 2015] [c002763a3f90] [c0008b6c] 
start_secondary_prolog+0x10/0x14
  

[Kernel-packages] [Bug 1537666] Re: ISST-LTE: Ubuntu 14.04.4 LPAR interrupts at check_and_cede_processor

2016-01-29 Thread Christopher M. Penalver
** Changed in: linux (Ubuntu)
   Status: New => Triaged

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1537666

Title:
  ISST-LTE: Ubuntu 14.04.4 LPAR interrupts at check_and_cede_processor

Status in linux package in Ubuntu:
  Triaged

Bug description:
  == Comment: #0 - YUECHANG E. MEI  - 2015-12-11 17:19:07 ==
  ---Problem Description---
  We have an Ubuntu 14.04.4 LPAR, conelp2. It is running stress test: base, io, 
and tcp. When checking "dmesg", we see this interruption: 

  [Fri Dec 11 13:58:50 2015] --- interrupt: 501 at plpar_hcall_norets+0x1c/0x28
  [Fri Dec 11 13:58:50 2015] LR = check_and_cede_processor+0x34/0x50

  In the previous test, conelp2 stopped all the stress tests by itself
  because it ran out of memory. Is the out of memory issue relating to
  the interruption?


   
  Contact Information = Yuechang (Erin) Mei /ye...@us.ibm.com,  Raja  Sunkari 
/rajas...@in.ibm.com 
   
  ---uname output---
  Linux conelp2 4.2.0-21-generic #25~14.04.1-Ubuntu SMP Thu Dec 3 13:55:42 UTC 
2015 ppc64le ppc64le ppc64le GNU/Linux
   
  Machine Type = EUH Alpine 8408-E8E 
   
  ---Debugger---
  A debugger is not configured
   
  ---Steps to Reproduce---
   1. install Ubuntu 14.04.4 in a LPAR, then update to the latest 14.04.4 
kernel by using this workaround:
  echo "deb http://software.linux.ibm.com/pub/ubuntu-ppc64el-repository/ 
trusty-proposed main restricted universe multiverse" >> /etc/apt/sources.list

  apt-get update

  apt-get install linux-image-generic-lts-wily

  2. Setup the Stress test, and start base,io, tcp
  3. After an hour, check dmesg, then you will see the message about the 
interruption 
   
  Stack trace output:
   no
   
  Oops output:
   no
   
  System Dump Info:
The system is not configured to capture a system dump.
   
  *Additional Instructions for Yuechang (Erin) Mei /ye...@us.ibm.com,  Raja  
Sunkari /rajas...@in.ibm.com: 
  -Post a private note with access information to the machine that the bug is 
occuring on. 
  -Attach sysctl -a output output to the bug.

  == Comment: #1 - YUECHANG E. MEI  - 2015-12-11
  17:23:00 ==

  
  == Comment: #3 - YUECHANG E. MEI  - 2015-12-14 15:23:33 ==

  
  == Comment: #4 - MAMATHA INAMDAR  - 2015-12-15 03:56:14 
==
  dmrsg show page allocation failure

  [Fri Dec 11 13:45:38 2015] swapper/127: page allocation failure: order:0, 
mode:0x120
  [Fri Dec 11 13:45:38 2015] CPU: 127 PID: 0 Comm: swapper/127 Not tainted 
4.2.0-21-generic #25~14.04.1-Ubuntu
  [Fri Dec 11 13:45:38 2015] Call Trace:
  [Fri Dec 11 13:45:38 2015] [c0027fbc3890] [c0a805ec] 
dump_stack+0x90/0xbc (unreliable)
  [Fri Dec 11 13:45:38 2015] [c0027fbc38c0] [c021c118] 
warn_alloc_failed+0x118/0x160
  [Fri Dec 11 13:45:38 2015] [c0027fbc3960] [c0221114] 
__alloc_pages_nodemask+0x834/0xa60
  [Fri Dec 11 13:45:38 2015] [c0027fbc3b10] [c0221404] 
__alloc_page_frag+0xc4/0x190
  [Fri Dec 11 13:45:38 2015] [c0027fbc3b50] [c08f6d20] 
netdev_alloc_frag+0x50/0x80
  [Fri Dec 11 13:45:38 2015] [c0027fbc3b80] [c0764e80] 
tg3_alloc_rx_data+0xa0/0x2c0
  [Fri Dec 11 13:45:38 2015] [c0027fbc3be0] [c0767344] 
tg3_poll_work+0x484/0x1070
  [Fri Dec 11 13:45:38 2015] [c0027fbc3ce0] [c0767f8c] 
tg3_poll_msix+0x5c/0x210
  [Fri Dec 11 13:45:38 2015] [c0027fbc3d30] [c090ebb8] 
net_rx_action+0x2d8/0x430
  [Fri Dec 11 13:45:38 2015] [c0027fbc3e40] [c00ba124] 
__do_softirq+0x174/0x390
  [Fri Dec 11 13:45:38 2015] [c0027fbc3f40] [c00ba6c8] 
irq_exit+0xc8/0x100
  [Fri Dec 11 13:45:38 2015] [c0027fbc3f60] [c00111ec] 
__do_irq+0x8c/0x190
  [Fri Dec 11 13:45:38 2015] [c0027fbc3f90] [c0024278] 
call_do_irq+0x14/0x24
  [Fri Dec 11 13:45:38 2015] [c002763a39b0] [c0011390] 
do_IRQ+0xa0/0x120
  [Fri Dec 11 13:45:38 2015] [c002763a3a10] [c00099b0] 
restore_check_irq_replay+0x2c/0x70
  [Fri Dec 11 13:45:38 2015] --- interrupt: 501 at plpar_hcall_norets+0x1c/0x28
  [Fri Dec 11 13:45:38 2015] LR = check_and_cede_processor+0x34/0x50
  [Fri Dec 11 13:45:38 2015] [c002763a3d00] [c08a8d90] 
check_and_cede_processor+0x20/0x50 (unreliable)
  [Fri Dec 11 13:45:38 2015] [c002763a3d60] [c08a8fb8] 
shared_cede_loop+0x68/0x170
  [Fri Dec 11 13:45:38 2015] [c002763a3da0] [c08a615c] 
cpuidle_enter_state+0xbc/0x350
  [Fri Dec 11 13:45:38 2015] [c002763a3e00] [c0110f3c] 
call_cpuidle+0x7c/0xd0
  [Fri Dec 11 13:45:38 2015] [c002763a3e40] [c01112d0] 
cpu_startup_entry+0x340/0x450
  [Fri Dec 11 13:45:38 2015] [c002763a3f10] [c0044ab4] 
start_secondary+0x364/0x3a0
  [Fri Dec 11 13:45:38 2015] [c002763a3f90] [c0008b6c] 
start_secondary_prolog+0x10/0x14
  [Fri Dec 11 13:45:38 

[Kernel-packages] [Bug 1537666] Re: ISST-LTE: Ubuntu 14.04.4 LPAR interrupts at check_and_cede_processor

2016-01-25 Thread Christopher M. Penalver
** Changed in: linux (Ubuntu)
   Importance: Undecided => Medium

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1537666

Title:
  ISST-LTE: Ubuntu 14.04.4 LPAR interrupts at check_and_cede_processor

Status in linux package in Ubuntu:
  New

Bug description:
  == Comment: #0 - YUECHANG E. MEI  - 2015-12-11 17:19:07 ==
  ---Problem Description---
  We have an Ubuntu 14.04.4 LPAR, conelp2. It is running stress test: base, io, 
and tcp. When checking "dmesg", we see this interruption: 

  [Fri Dec 11 13:58:50 2015] --- interrupt: 501 at plpar_hcall_norets+0x1c/0x28
  [Fri Dec 11 13:58:50 2015] LR = check_and_cede_processor+0x34/0x50

  In the previous test, conelp2 stopped all the stress tests by itself
  because it ran out of memory. Is the out of memory issue relating to
  the interruption?


   
  Contact Information = Yuechang (Erin) Mei /ye...@us.ibm.com,  Raja  Sunkari 
/rajas...@in.ibm.com 
   
  ---uname output---
  Linux conelp2 4.2.0-21-generic #25~14.04.1-Ubuntu SMP Thu Dec 3 13:55:42 UTC 
2015 ppc64le ppc64le ppc64le GNU/Linux
   
  Machine Type = EUH Alpine 8408-E8E 
   
  ---Debugger---
  A debugger is not configured
   
  ---Steps to Reproduce---
   1. install Ubuntu 14.04.4 in a LPAR, then update to the latest 14.04.4 
kernel by using this workaround:
  echo "deb http://software.linux.ibm.com/pub/ubuntu-ppc64el-repository/ 
trusty-proposed main restricted universe multiverse" >> /etc/apt/sources.list

  apt-get update

  apt-get install linux-image-generic-lts-wily

  2. Setup the Stress test, and start base,io, tcp
  3. After an hour, check dmesg, then you will see the message about the 
interruption 
   
  Stack trace output:
   no
   
  Oops output:
   no
   
  System Dump Info:
The system is not configured to capture a system dump.
   
  *Additional Instructions for Yuechang (Erin) Mei /ye...@us.ibm.com,  Raja  
Sunkari /rajas...@in.ibm.com: 
  -Post a private note with access information to the machine that the bug is 
occuring on. 
  -Attach sysctl -a output output to the bug.

  == Comment: #1 - YUECHANG E. MEI  - 2015-12-11
  17:23:00 ==

  
  == Comment: #3 - YUECHANG E. MEI  - 2015-12-14 15:23:33 ==

  
  == Comment: #4 - MAMATHA INAMDAR  - 2015-12-15 03:56:14 
==
  dmrsg show page allocation failure

  [Fri Dec 11 13:45:38 2015] swapper/127: page allocation failure: order:0, 
mode:0x120
  [Fri Dec 11 13:45:38 2015] CPU: 127 PID: 0 Comm: swapper/127 Not tainted 
4.2.0-21-generic #25~14.04.1-Ubuntu
  [Fri Dec 11 13:45:38 2015] Call Trace:
  [Fri Dec 11 13:45:38 2015] [c0027fbc3890] [c0a805ec] 
dump_stack+0x90/0xbc (unreliable)
  [Fri Dec 11 13:45:38 2015] [c0027fbc38c0] [c021c118] 
warn_alloc_failed+0x118/0x160
  [Fri Dec 11 13:45:38 2015] [c0027fbc3960] [c0221114] 
__alloc_pages_nodemask+0x834/0xa60
  [Fri Dec 11 13:45:38 2015] [c0027fbc3b10] [c0221404] 
__alloc_page_frag+0xc4/0x190
  [Fri Dec 11 13:45:38 2015] [c0027fbc3b50] [c08f6d20] 
netdev_alloc_frag+0x50/0x80
  [Fri Dec 11 13:45:38 2015] [c0027fbc3b80] [c0764e80] 
tg3_alloc_rx_data+0xa0/0x2c0
  [Fri Dec 11 13:45:38 2015] [c0027fbc3be0] [c0767344] 
tg3_poll_work+0x484/0x1070
  [Fri Dec 11 13:45:38 2015] [c0027fbc3ce0] [c0767f8c] 
tg3_poll_msix+0x5c/0x210
  [Fri Dec 11 13:45:38 2015] [c0027fbc3d30] [c090ebb8] 
net_rx_action+0x2d8/0x430
  [Fri Dec 11 13:45:38 2015] [c0027fbc3e40] [c00ba124] 
__do_softirq+0x174/0x390
  [Fri Dec 11 13:45:38 2015] [c0027fbc3f40] [c00ba6c8] 
irq_exit+0xc8/0x100
  [Fri Dec 11 13:45:38 2015] [c0027fbc3f60] [c00111ec] 
__do_irq+0x8c/0x190
  [Fri Dec 11 13:45:38 2015] [c0027fbc3f90] [c0024278] 
call_do_irq+0x14/0x24
  [Fri Dec 11 13:45:38 2015] [c002763a39b0] [c0011390] 
do_IRQ+0xa0/0x120
  [Fri Dec 11 13:45:38 2015] [c002763a3a10] [c00099b0] 
restore_check_irq_replay+0x2c/0x70
  [Fri Dec 11 13:45:38 2015] --- interrupt: 501 at plpar_hcall_norets+0x1c/0x28
  [Fri Dec 11 13:45:38 2015] LR = check_and_cede_processor+0x34/0x50
  [Fri Dec 11 13:45:38 2015] [c002763a3d00] [c08a8d90] 
check_and_cede_processor+0x20/0x50 (unreliable)
  [Fri Dec 11 13:45:38 2015] [c002763a3d60] [c08a8fb8] 
shared_cede_loop+0x68/0x170
  [Fri Dec 11 13:45:38 2015] [c002763a3da0] [c08a615c] 
cpuidle_enter_state+0xbc/0x350
  [Fri Dec 11 13:45:38 2015] [c002763a3e00] [c0110f3c] 
call_cpuidle+0x7c/0xd0
  [Fri Dec 11 13:45:38 2015] [c002763a3e40] [c01112d0] 
cpu_startup_entry+0x340/0x450
  [Fri Dec 11 13:45:38 2015] [c002763a3f10] [c0044ab4] 
start_secondary+0x364/0x3a0
  [Fri Dec 11 13:45:38 2015] [c002763a3f90] [c0008b6c] 
start_secondary_prolog+0x10/0x14
  [Fri Dec 11 13:45:38 

[Kernel-packages] [Bug 1537666] Re: ISST-LTE: Ubuntu 14.04.4 LPAR interrupts at check_and_cede_processor

2016-01-25 Thread Steve Langasek
** Package changed: ubuntu => linux (Ubuntu)

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1537666

Title:
  ISST-LTE: Ubuntu 14.04.4 LPAR interrupts at check_and_cede_processor

Status in linux package in Ubuntu:
  New

Bug description:
  == Comment: #0 - YUECHANG E. MEI  - 2015-12-11 17:19:07 ==
  ---Problem Description---
  We have an Ubuntu 14.04.4 LPAR, conelp2. It is running stress test: base, io, 
and tcp. When checking "dmesg", we see this interruption: 

  [Fri Dec 11 13:58:50 2015] --- interrupt: 501 at plpar_hcall_norets+0x1c/0x28
  [Fri Dec 11 13:58:50 2015] LR = check_and_cede_processor+0x34/0x50

  In the previous test, conelp2 stopped all the stress tests by itself
  because it ran out of memory. Is the out of memory issue relating to
  the interruption?


   
  Contact Information = Yuechang (Erin) Mei /ye...@us.ibm.com,  Raja  Sunkari 
/rajas...@in.ibm.com 
   
  ---uname output---
  Linux conelp2 4.2.0-21-generic #25~14.04.1-Ubuntu SMP Thu Dec 3 13:55:42 UTC 
2015 ppc64le ppc64le ppc64le GNU/Linux
   
  Machine Type = EUH Alpine 8408-E8E 
   
  ---Debugger---
  A debugger is not configured
   
  ---Steps to Reproduce---
   1. install Ubuntu 14.04.4 in a LPAR, then update to the latest 14.04.4 
kernel by using this workaround:
  echo "deb http://software.linux.ibm.com/pub/ubuntu-ppc64el-repository/ 
trusty-proposed main restricted universe multiverse" >> /etc/apt/sources.list

  apt-get update

  apt-get install linux-image-generic-lts-wily

  2. Setup the Stress test, and start base,io, tcp
  3. After an hour, check dmesg, then you will see the message about the 
interruption 
   
  Stack trace output:
   no
   
  Oops output:
   no
   
  System Dump Info:
The system is not configured to capture a system dump.
   
  *Additional Instructions for Yuechang (Erin) Mei /ye...@us.ibm.com,  Raja  
Sunkari /rajas...@in.ibm.com: 
  -Post a private note with access information to the machine that the bug is 
occuring on. 
  -Attach sysctl -a output output to the bug.

  == Comment: #1 - YUECHANG E. MEI  - 2015-12-11
  17:23:00 ==

  
  == Comment: #3 - YUECHANG E. MEI  - 2015-12-14 15:23:33 ==

  
  == Comment: #4 - MAMATHA INAMDAR  - 2015-12-15 03:56:14 
==
  dmrsg show page allocation failure

  [Fri Dec 11 13:45:38 2015] swapper/127: page allocation failure: order:0, 
mode:0x120
  [Fri Dec 11 13:45:38 2015] CPU: 127 PID: 0 Comm: swapper/127 Not tainted 
4.2.0-21-generic #25~14.04.1-Ubuntu
  [Fri Dec 11 13:45:38 2015] Call Trace:
  [Fri Dec 11 13:45:38 2015] [c0027fbc3890] [c0a805ec] 
dump_stack+0x90/0xbc (unreliable)
  [Fri Dec 11 13:45:38 2015] [c0027fbc38c0] [c021c118] 
warn_alloc_failed+0x118/0x160
  [Fri Dec 11 13:45:38 2015] [c0027fbc3960] [c0221114] 
__alloc_pages_nodemask+0x834/0xa60
  [Fri Dec 11 13:45:38 2015] [c0027fbc3b10] [c0221404] 
__alloc_page_frag+0xc4/0x190
  [Fri Dec 11 13:45:38 2015] [c0027fbc3b50] [c08f6d20] 
netdev_alloc_frag+0x50/0x80
  [Fri Dec 11 13:45:38 2015] [c0027fbc3b80] [c0764e80] 
tg3_alloc_rx_data+0xa0/0x2c0
  [Fri Dec 11 13:45:38 2015] [c0027fbc3be0] [c0767344] 
tg3_poll_work+0x484/0x1070
  [Fri Dec 11 13:45:38 2015] [c0027fbc3ce0] [c0767f8c] 
tg3_poll_msix+0x5c/0x210
  [Fri Dec 11 13:45:38 2015] [c0027fbc3d30] [c090ebb8] 
net_rx_action+0x2d8/0x430
  [Fri Dec 11 13:45:38 2015] [c0027fbc3e40] [c00ba124] 
__do_softirq+0x174/0x390
  [Fri Dec 11 13:45:38 2015] [c0027fbc3f40] [c00ba6c8] 
irq_exit+0xc8/0x100
  [Fri Dec 11 13:45:38 2015] [c0027fbc3f60] [c00111ec] 
__do_irq+0x8c/0x190
  [Fri Dec 11 13:45:38 2015] [c0027fbc3f90] [c0024278] 
call_do_irq+0x14/0x24
  [Fri Dec 11 13:45:38 2015] [c002763a39b0] [c0011390] 
do_IRQ+0xa0/0x120
  [Fri Dec 11 13:45:38 2015] [c002763a3a10] [c00099b0] 
restore_check_irq_replay+0x2c/0x70
  [Fri Dec 11 13:45:38 2015] --- interrupt: 501 at plpar_hcall_norets+0x1c/0x28
  [Fri Dec 11 13:45:38 2015] LR = check_and_cede_processor+0x34/0x50
  [Fri Dec 11 13:45:38 2015] [c002763a3d00] [c08a8d90] 
check_and_cede_processor+0x20/0x50 (unreliable)
  [Fri Dec 11 13:45:38 2015] [c002763a3d60] [c08a8fb8] 
shared_cede_loop+0x68/0x170
  [Fri Dec 11 13:45:38 2015] [c002763a3da0] [c08a615c] 
cpuidle_enter_state+0xbc/0x350
  [Fri Dec 11 13:45:38 2015] [c002763a3e00] [c0110f3c] 
call_cpuidle+0x7c/0xd0
  [Fri Dec 11 13:45:38 2015] [c002763a3e40] [c01112d0] 
cpu_startup_entry+0x340/0x450
  [Fri Dec 11 13:45:38 2015] [c002763a3f10] [c0044ab4] 
start_secondary+0x364/0x3a0
  [Fri Dec 11 13:45:38 2015] [c002763a3f90] [c0008b6c] 
start_secondary_prolog+0x10/0x14
  [Fri Dec 11 13:45:38 2015] Mem-Info:
  [Fri