[Kernel-packages] [Bug 1925124] Re: panic in ipt_do_table+0x621/0x640

2021-04-20 Thread chudihuang
** Description changed:

  1.This bug has happened on the following kernel version:
  
  4.4.0-104-generic
- 4.15.0-88-generic
  4.15.0-13-generic
  5.4.0-42-generic
  5.4.0-47-generic
  
  2.we cannot reliably reproduce this issue.
  
  3.this patch seems to fix this issue.
  
https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/net/netfilter/x_tables.c?id=175e476b8cdf2a4de7432583b49c871345e4f8a1
  
  4.the following crash dmesg from 5.4.0-47-generic:
  [2021-03-19 12:17:53][3581678.514072] audit: audit_lost=1 audit_rate_limit=1 
audit_backlog_limit=1024
  [2021-03-23 21:57:06][3962019.737898] general protection fault:  [#1] SMP 
PTI
  [2021-03-23 21:57:06][3962019.739019] CPU: 4 PID: 0 Comm: swapper/4 Kdump: 
loaded Not tainted 5.4.0-47-generic #51~18.04.1-Ubuntu
  [2021-03-23 21:57:06][3962019.740433] Hardware name: Tencent Cloud CVM, BIOS 
seabios-1.9.1-qemu-project.org 04/01/2014
  [2021-03-23 21:57:06][3962019.741829] RIP: 0010:ipt_do_table+0x621/0x640 
[ip_tables]
  [2021-03-23 21:57:06][3962019.742587] Code: ff ff ff 45 31 d2 48 89 85 50 ff 
ff ff e9 7e fe ff ff 83 ad 70 ff ff ff 01 48 8b bd 78 ff ff ff 8b 85 70 ff ff 
ff 4c 8b 34 c7 <41> 0f b7 46 5a 49 01 c6 e9 [2021-03-23 21:57:06]18 fb ff ff c7 
45 a4 00 00 00 00 e9 e9
  [2021-03-23 21:57:06][3962019.745295] RSP: 0018:bcd080184c18 EFLAGS: 
00010246
  [2021-03-23 21:57:06][3962019.745988] RAX:  RBX: 
96b7e9b717b0 RCX: dcd07f70a150
  [2021-03-23 21:57:06][3962019.747025] RDX: 0001ab6a RSI: 
96b7e1d98d80 RDI: 96b7e1d98d80
  [2021-03-23 21:57:06][3962019.748015] RBP: bcd080184d08 R08: 
0008 R09: 96b82aca2980
  [2021-03-23 21:57:06][3962019.748988] R10: 0d38 R11: 
96b7cacaa380 R12: 96b833fea000
  [2021-03-23 21:57:06][3962019.749927] R13: 96b7ebb4a04e R14: 
3f13d7139e6d8ccb R15: 96b7d2c28f00
  [2021-03-23 21:57:06][3962019.750873] FS:  () 
GS:96b83fb0() knlGS:
  [2021-03-23 21:57:06][3962019.751919] CS:  0010 DS:  ES:  CR0: 
80050033
  [2021-03-23 21:57:06][3962019.752720] CR2: 7fee206258f0 CR3: 
000f9c428001 CR4: 003606e0
  [2021-03-23 21:57:06][3962019.753603] DR0:  DR1: 
 DR2: 
  [2021-03-23 21:57:06][3962019.754619] DR3:  DR6: 
fffe0ff0 DR7: 0400
  [2021-03-23 21:57:06][3962019.755606] Call Trace:
  [2021-03-23 21:57:06][3962019.755984]  
  [2021-03-23 21:57:06][3962019.756282]  iptable_filter_hook+0x1f/0x30 
[iptable_filter]
  [2021-03-23 21:57:06][3962019.757199]  nf_hook_slow+0x48/0xc0
  [2021-03-23 21:57:06][3962019.757753]  ip_forward+0x410/0x480
  [2021-03-23 21:57:06][3962019.758350]  ? ip4_key_hashfn+0xc0/0xc0
  [2021-03-23 21:57:06][3962019.758907]  ip_rcv_finish+0x84/0xa0
  [2021-03-23 21:57:06][3962019.759469]  ip_rcv+0xbc/0xd0
  [2021-03-23 21:57:06][3962019.760543]  ? 
ip_rcv_finish_core.isra.18+0x3b0/0x3b0
  [2021-03-23 21:57:06][3962019.761916]  __netif_receive_skb_one_core+0x86/0xa0
  [2021-03-23 21:57:06][3962019.763210]  __netif_receive_skb+0x18/0x60
  [2021-03-23 21:57:06][3962019.764341]  process_backlog+0xa0/0x170
  [2021-03-23 21:57:06][3962019.765600]  net_rx_action+0x140/0x3c0
  [2021-03-23 21:57:06][3962019.766550]  __do_softirq+0xe4/0x2da
  [2021-03-23 21:57:06][3962019.767792]  irq_exit+0xae/0xb0
  [2021-03-23 21:57:06][3962019.768857]  
smp_call_function_single_interrupt+0x48/0xd0
  [2021-03-23 21:57:06][3962019.770140]  call_function_single_interrupt+0xf/0x20
  [2021-03-23 21:57:06][3962019.771315]  
  [2021-03-23 21:57:06][3962019.772038] RIP: 0010:native_safe_halt+0x12/0x20

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1925124

Title:
  panic in ipt_do_table+0x621/0x640

Status in linux package in Ubuntu:
  Incomplete

Bug description:
  1.This bug has happened on the following kernel version:

  4.4.0-104-generic
  4.15.0-13-generic
  5.4.0-42-generic
  5.4.0-47-generic

  2.we cannot reliably reproduce this issue.

  3.this patch seems to fix this issue.
  
https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/net/netfilter/x_tables.c?id=175e476b8cdf2a4de7432583b49c871345e4f8a1

  4.the following crash dmesg from 5.4.0-47-generic:
  [2021-03-19 12:17:53][3581678.514072] audit: audit_lost=1 audit_rate_limit=1 
audit_backlog_limit=1024
  [2021-03-23 21:57:06][3962019.737898] general protection fault:  [#1] SMP 
PTI
  [2021-03-23 21:57:06][3962019.739019] CPU: 4 PID: 0 Comm: swapper/4 Kdump: 
loaded Not tainted 5.4.0-47-generic #51~18.04.1-Ubuntu
  [2021-03-23 21:57:06][3962019.740433] Hardware name: Tencent Cloud CVM, BIOS 
seabios-1.9.1-qemu-project.org 04/01/2014
  [2021-03-23 21:57:06][3962019.741829] RIP: 0010:ipt_do_table+0x621/0x640 
[ip_tables]
  [2021-03-23 21:57:06][3962019.742587] Code: ff ff ff 

[Kernel-packages] [Bug 1925124] Re: panic in ipt_do_table+0x621/0x640

2021-04-19 Thread chudihuang
** Description changed:

  1.This bug has happened on the following kernel version:
  
- 4.4.0-104-generic  
- 4.15.0-88-generic 
- 4.15.0-13-generic  
- 5.4.0-42-generic 
- 5.4.0-47-generic  
+ 4.4.0-104-generic
+ 4.15.0-88-generic
+ 4.15.0-13-generic
+ 5.4.0-42-generic
+ 5.4.0-47-generic
  
  2.we cannot reliably reproduce this issue.
  
  3.this patch seems to fix this issue.
  
https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/net/netfilter/x_tables.c?id=175e476b8cdf2a4de7432583b49c871345e4f8a1
  
  4.the following crash dmesg from 5.4.0-47-generic:
  [2021-03-19 12:17:53][3581678.514072] audit: audit_lost=1 audit_rate_limit=1 
audit_backlog_limit=1024
  [2021-03-23 21:57:06][3962019.737898] general protection fault:  [#1] SMP 
PTI
  [2021-03-23 21:57:06][3962019.739019] CPU: 4 PID: 0 Comm: swapper/4 Kdump: 
loaded Not tainted 5.4.0-47-generic #51~18.04.1-Ubuntu
  [2021-03-23 21:57:06][3962019.740433] Hardware name: Tencent Cloud CVM, BIOS 
seabios-1.9.1-qemu-project.org 04/01/2014
  [2021-03-23 21:57:06][3962019.741829] RIP: 0010:ipt_do_table+0x621/0x640 
[ip_tables]
  [2021-03-23 21:57:06][3962019.742587] Code: ff ff ff 45 31 d2 48 89 85 50 ff 
ff ff e9 7e fe ff ff 83 ad 70 ff ff ff 01 48 8b bd 78 ff ff ff 8b 85 70 ff ff 
ff 4c 8b 34 c7 <41> 0f b7 46 5a 49 01 c6 e9 [2021-03-23 21:57:06]18 fb ff ff c7 
45 a4 00 00 00 00 e9 e9
  [2021-03-23 21:57:06][3962019.745295] RSP: 0018:bcd080184c18 EFLAGS: 
00010246
  [2021-03-23 21:57:06][3962019.745988] RAX:  RBX: 
96b7e9b717b0 RCX: dcd07f70a150
  [2021-03-23 21:57:06][3962019.747025] RDX: 0001ab6a RSI: 
96b7e1d98d80 RDI: 96b7e1d98d80
  [2021-03-23 21:57:06][3962019.748015] RBP: bcd080184d08 R08: 
0008 R09: 96b82aca2980
  [2021-03-23 21:57:06][3962019.748988] R10: 0d38 R11: 
96b7cacaa380 R12: 96b833fea000
  [2021-03-23 21:57:06][3962019.749927] R13: 96b7ebb4a04e R14: 
3f13d7139e6d8ccb R15: 96b7d2c28f00
  [2021-03-23 21:57:06][3962019.750873] FS:  () 
GS:96b83fb0() knlGS:
  [2021-03-23 21:57:06][3962019.751919] CS:  0010 DS:  ES:  CR0: 
80050033
  [2021-03-23 21:57:06][3962019.752720] CR2: 7fee206258f0 CR3: 
000f9c428001 CR4: 003606e0
  [2021-03-23 21:57:06][3962019.753603] DR0:  DR1: 
 DR2: 
  [2021-03-23 21:57:06][3962019.754619] DR3:  DR6: 
fffe0ff0 DR7: 0400
  [2021-03-23 21:57:06][3962019.755606] Call Trace:
  [2021-03-23 21:57:06][3962019.755984]  
  [2021-03-23 21:57:06][3962019.756282]  iptable_filter_hook+0x1f/0x30 
[iptable_filter]
  [2021-03-23 21:57:06][3962019.757199]  nf_hook_slow+0x48/0xc0
  [2021-03-23 21:57:06][3962019.757753]  ip_forward+0x410/0x480
  [2021-03-23 21:57:06][3962019.758350]  ? ip4_key_hashfn+0xc0/0xc0
  [2021-03-23 21:57:06][3962019.758907]  ip_rcv_finish+0x84/0xa0
  [2021-03-23 21:57:06][3962019.759469]  ip_rcv+0xbc/0xd0
  [2021-03-23 21:57:06][3962019.760543]  ? 
ip_rcv_finish_core.isra.18+0x3b0/0x3b0
  [2021-03-23 21:57:06][3962019.761916]  __netif_receive_skb_one_core+0x86/0xa0
  [2021-03-23 21:57:06][3962019.763210]  __netif_receive_skb+0x18/0x60
  [2021-03-23 21:57:06][3962019.764341]  process_backlog+0xa0/0x170
  [2021-03-23 21:57:06][3962019.765600]  net_rx_action+0x140/0x3c0
  [2021-03-23 21:57:06][3962019.766550]  __do_softirq+0xe4/0x2da
  [2021-03-23 21:57:06][3962019.767792]  irq_exit+0xae/0xb0
  [2021-03-23 21:57:06][3962019.768857]  
smp_call_function_single_interrupt+0x48/0xd0
  [2021-03-23 21:57:06][3962019.770140]  call_function_single_interrupt+0xf/0x20
  [2021-03-23 21:57:06][3962019.771315]  
  [2021-03-23 21:57:06][3962019.772038] RIP: 0010:native_safe_halt+0x12/0x20
- [2021-03-23 21:57:06][3962019.773239] Code: 00 0f 00 2d 32 98 53 00 f4 5d c3 
0f 1f 00 66 2e 0f 1f 84 00 00 00 00 00 55 48 89 e5 e9 07 00 00 00 0f 00 2d 12 
98 53 00 fb f4 <5d> c3 90 90 90 90 90 90 90 [2021-03-23 21:57:06]90 90 90 90 90 
0f 1f 44 00 00 55 48 89
- [2021-03-23 21:57:06][3962019.776646] RSP: 0018:bcd080093e70 EFLAGS: 
0246 ORIG_RAX: ff04
- [2021-03-23 21:57:06][3962019.778015] RAX: 93ad22f0 RBX: 
0004 RCX: 0001
- [2021-03-23 21:57:06][3962019.779356] RDX: 96b83fb2ba80 RSI: 
bcd080093e40 RDI: 
- [2021-03-23 21:57:06][3962019.780633] RBP: bcd080093e70 R08: 
0010e421e97a71a1 R09: 0001
- [2021-03-23 21:57:06][3962019.782033] R10: 007e8c00 R11: 
0fca3b36 R12: 0004
- [2021-03-23 21:57:06][3962019.783373] R13:  R14: 
 R15: 
- [2021-03-23 21:57:06][3962019.784725]  ? ldsem_down_write+0x230/0x230
- [2021-03-23 21:57:06][3962019.785741]  default_idle+0x22/0x150
- [2021-03-23 21:57:06][3962019.786673]  arch_cpu_idle+0x15/0x20
- [2021-03-23 21:57:06][3962019.787667]  

[Kernel-packages] [Bug 1925124] [NEW] panic in ipt_do_table+0x621/0x640

2021-04-19 Thread chudihuang
Public bug reported:

1.This bug has happened on the following kernel version:

4.4.0-104-generic  
4.15.0-88-generic 
4.15.0-13-generic  
5.4.0-42-generic 
5.4.0-47-generic  

2.we cannot reliably reproduce this issue.

3.this patch seems to fix this issue.
https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/net/netfilter/x_tables.c?id=175e476b8cdf2a4de7432583b49c871345e4f8a1

4.the following crash dmesg from 5.4.0-47-generic:
[2021-03-19 12:17:53][3581678.514072] audit: audit_lost=1 audit_rate_limit=1 
audit_backlog_limit=1024
[2021-03-23 21:57:06][3962019.737898] general protection fault:  [#1] SMP 
PTI
[2021-03-23 21:57:06][3962019.739019] CPU: 4 PID: 0 Comm: swapper/4 Kdump: 
loaded Not tainted 5.4.0-47-generic #51~18.04.1-Ubuntu
[2021-03-23 21:57:06][3962019.740433] Hardware name: Tencent Cloud CVM, BIOS 
seabios-1.9.1-qemu-project.org 04/01/2014
[2021-03-23 21:57:06][3962019.741829] RIP: 0010:ipt_do_table+0x621/0x640 
[ip_tables]
[2021-03-23 21:57:06][3962019.742587] Code: ff ff ff 45 31 d2 48 89 85 50 ff ff 
ff e9 7e fe ff ff 83 ad 70 ff ff ff 01 48 8b bd 78 ff ff ff 8b 85 70 ff ff ff 
4c 8b 34 c7 <41> 0f b7 46 5a 49 01 c6 e9 [2021-03-23 21:57:06]18 fb ff ff c7 45 
a4 00 00 00 00 e9 e9
[2021-03-23 21:57:06][3962019.745295] RSP: 0018:bcd080184c18 EFLAGS: 
00010246
[2021-03-23 21:57:06][3962019.745988] RAX:  RBX: 
96b7e9b717b0 RCX: dcd07f70a150
[2021-03-23 21:57:06][3962019.747025] RDX: 0001ab6a RSI: 
96b7e1d98d80 RDI: 96b7e1d98d80
[2021-03-23 21:57:06][3962019.748015] RBP: bcd080184d08 R08: 
0008 R09: 96b82aca2980
[2021-03-23 21:57:06][3962019.748988] R10: 0d38 R11: 
96b7cacaa380 R12: 96b833fea000
[2021-03-23 21:57:06][3962019.749927] R13: 96b7ebb4a04e R14: 
3f13d7139e6d8ccb R15: 96b7d2c28f00
[2021-03-23 21:57:06][3962019.750873] FS:  () 
GS:96b83fb0() knlGS:
[2021-03-23 21:57:06][3962019.751919] CS:  0010 DS:  ES:  CR0: 
80050033
[2021-03-23 21:57:06][3962019.752720] CR2: 7fee206258f0 CR3: 
000f9c428001 CR4: 003606e0
[2021-03-23 21:57:06][3962019.753603] DR0:  DR1: 
 DR2: 
[2021-03-23 21:57:06][3962019.754619] DR3:  DR6: 
fffe0ff0 DR7: 0400
[2021-03-23 21:57:06][3962019.755606] Call Trace:
[2021-03-23 21:57:06][3962019.755984]  
[2021-03-23 21:57:06][3962019.756282]  iptable_filter_hook+0x1f/0x30 
[iptable_filter]
[2021-03-23 21:57:06][3962019.757199]  nf_hook_slow+0x48/0xc0
[2021-03-23 21:57:06][3962019.757753]  ip_forward+0x410/0x480
[2021-03-23 21:57:06][3962019.758350]  ? ip4_key_hashfn+0xc0/0xc0
[2021-03-23 21:57:06][3962019.758907]  ip_rcv_finish+0x84/0xa0
[2021-03-23 21:57:06][3962019.759469]  ip_rcv+0xbc/0xd0
[2021-03-23 21:57:06][3962019.760543]  ? ip_rcv_finish_core.isra.18+0x3b0/0x3b0
[2021-03-23 21:57:06][3962019.761916]  __netif_receive_skb_one_core+0x86/0xa0
[2021-03-23 21:57:06][3962019.763210]  __netif_receive_skb+0x18/0x60
[2021-03-23 21:57:06][3962019.764341]  process_backlog+0xa0/0x170
[2021-03-23 21:57:06][3962019.765600]  net_rx_action+0x140/0x3c0
[2021-03-23 21:57:06][3962019.766550]  __do_softirq+0xe4/0x2da
[2021-03-23 21:57:06][3962019.767792]  irq_exit+0xae/0xb0
[2021-03-23 21:57:06][3962019.768857]  
smp_call_function_single_interrupt+0x48/0xd0
[2021-03-23 21:57:06][3962019.770140]  call_function_single_interrupt+0xf/0x20
[2021-03-23 21:57:06][3962019.771315]  
[2021-03-23 21:57:06][3962019.772038] RIP: 0010:native_safe_halt+0x12/0x20
[2021-03-23 21:57:06][3962019.773239] Code: 00 0f 00 2d 32 98 53 00 f4 5d c3 0f 
1f 00 66 2e 0f 1f 84 00 00 00 00 00 55 48 89 e5 e9 07 00 00 00 0f 00 2d 12 98 
53 00 fb f4 <5d> c3 90 90 90 90 90 90 90 [2021-03-23 21:57:06]90 90 90 90 90 0f 
1f 44 00 00 55 48 89
[2021-03-23 21:57:06][3962019.776646] RSP: 0018:bcd080093e70 EFLAGS: 
0246 ORIG_RAX: ff04
[2021-03-23 21:57:06][3962019.778015] RAX: 93ad22f0 RBX: 
0004 RCX: 0001
[2021-03-23 21:57:06][3962019.779356] RDX: 96b83fb2ba80 RSI: 
bcd080093e40 RDI: 
[2021-03-23 21:57:06][3962019.780633] RBP: bcd080093e70 R08: 
0010e421e97a71a1 R09: 0001
[2021-03-23 21:57:06][3962019.782033] R10: 007e8c00 R11: 
0fca3b36 R12: 0004
[2021-03-23 21:57:06][3962019.783373] R13:  R14: 
 R15: 
[2021-03-23 21:57:06][3962019.784725]  ? ldsem_down_write+0x230/0x230
[2021-03-23 21:57:06][3962019.785741]  default_idle+0x22/0x150
[2021-03-23 21:57:06][3962019.786673]  arch_cpu_idle+0x15/0x20
[2021-03-23 21:57:06][3962019.787667]  default_idle_call+0x23/0x30
[2021-03-23 21:57:06][3962019.788803]  do_idle+0x1bd/0x270
[2021-03-23 21:57:06][3962019.789875]  cpu_startup_entry+0x1d/0x20
[2021-03-23 21:57:06][3962019.790990]  start_secondary+0x166/0x1c0
[2021-03-23 

[Kernel-packages] [Bug 1653498] Re: Server reboots every 4.1 weeks

2020-06-30 Thread chudihuang
Hi,

We have the same issue on our k8s cluster.

Description: ubuntu16.04.1 LTSx86_64
Release: 16.04.1

Kernel: 4.4.0-104-generic

the dump file can be downloaded via following way:
wget http://129.226.115.161/dump.202006231820.tar.gz


I did some analysis, however i still didnot find the root cause:

Load the vmcore in crash (please refer to the hyperlink above). Crash should 
present details similar to the following:
crash> bt
PID: 11388  TASK: 880eb1f79e00  CPU: 29  COMMAND: "heartbeat"
 #0 [8809131a7b08] machine_kexec at 8105c22b
 #1 [8809131a7b68] crash_kexec at 8110e852
 #2 [8809131a7c38] oops_end at 81031c49
 #3 [8809131a7c60] die at 810320fb
 #4 [8809131a7c90] do_trap at 8102f121
 #5 [8809131a7ce0] do_error_trap at 8102f4a9
 #6 [8809131a7da0] do_invalid_op at 8102fa10
 #7 [8809131a7db0] invalid_op at 8184638e
[exception RIP: __fput+541]
RIP: 812126ad  RSP: 8809131a7e68  RFLAGS: 00010246
RAX:   RBX: 880ef6915700  RCX: 000365fb1705
RDX: 0001  RSI: 880fff55a020  RDI: 
RBP: 8809131a7ea0   R8: 0001a020   R9: 811b591d
R10: ea002b69b300  R11: 880ef6915710  R12: 0010
R13: 880ed152aef8  R14: 8800bba18aa0  R15: 880ed1513a40
ORIG_RAX:   CS: 0010  SS: 0018
 #8 [8809131a7e60] __fput at 812125ac
 #9 [8809131a7ea8] fput at 812126ee
#10 [8809131a7eb8] task_work_run at 8109f101
#11 [8809131a7ef8] exit_to_usermode_loop at 81003242
#12 [8809131a7f30] syscall_return_slowpath at 81003c6e
#13 [8809131a7f50] int_ret_from_sys_call at 818449d0
RIP: 0047f704  RSP: 00c423b77c98  RFLAGS: 0246
RAX:   RBX:   RCX: 0047f704
RDX:   RSI:   RDI: 00ca
RBP: 00c423b77ce0   R8:    R9: 
R10:   R11: 0246  R12: 
R13:   R14: 00c423b78ee0  R15: 0008
ORIG_RAX: 0003  CS: 0033  SS: 002b
crash>

crash> log
[19156101.592212] [ cut here ]
[19156101.593103] kernel BUG at 
/build/linux-SwhOyu/linux-4.4.0/include/linux/fs.h:2582!
[19156101.594385] invalid opcode:  [#1] SMP
[19156101.595083] Modules linked in: binfmt_misc af_packet_diag netlink_diag 
dccp_diag dccp tcp_diag udp_diag inet_diag unix_diag veth br_netfilter 
ipt_MASQUERADE nf_nat_masquerade_ipv4 xt_set xt_mark ip_set_hash_ipportnet 
ip_set_hash_ipportip ip_set_bitmap_port ip_set_hash_ipport ip_set dummy 
xt_comment xt_addrtype iptable_nat nf_nat_ipv4 nf_conntrack_ipv4 nf_defrag_ipv4 
ip_vs_sh ip_vs_wrr ip_vs_rr ip_vs xt_tcpudp bridge stp llc nf_conntrack_netlink 
nfnetlink xfrm_user xfrm_algo iptable_filter ip_tables xt_conntrack x_tables 
nf_nat nf_conntrack aufs isofs ppdev crct10dif_pclmul parport_pc crc32_pclmul 
input_leds joydev ghash_clmulni_intel parport serio_raw ib_iser rdma_cm iw_cm 
ib_cm ib_sa ib_mad ib_core ib_addr sunrpc iscsi_tcp libiscsi_tcp libiscsi 
scsi_transport_iscsi autofs4 btrfs raid10 raid456 async_raid6_recov
[19156101.606434]  async_memcpy async_pq async_xor async_tx xor raid6_pq 
libcrc32c raid1 raid0 multipath linear aesni_intel aes_x86_64 glue_helper lrw 
gf128mul ablk_helper cryptd psmouse floppy
[19156101.609129] CPU: 29 PID: 11388 Comm: heartbeat Not tainted 
4.4.0-104-generic #127-Ubuntu
[19156101.610384] Hardware name: Smdbmds KVM, BIOS 
seabios-1.9.1-qemu-project.org 04/01/2014
[19156101.611637] task: 880eb1f79e00 ti: 8809131a4000 task.ti: 
8809131a4000
[19156101.612905] RIP: 0010:[]  [] 
__fput+0x21d/0x220
[19156101.614188] RSP: 0018:8809131a7e68  EFLAGS: 00010246
[19156101.614989] RAX:  RBX: 880ef6915700 RCX: 
000365fb1705
[19156101.616143] RDX: 0001 RSI: 880fff55a020 RDI: 

[19156101.617285] RBP: 8809131a7ea0 R08: 0001a020 R09: 
811b591d
[19156101.618422] R10: ea002b69b300 R11: 880ef6915710 R12: 
0010
[19156101.619574] R13: 880ed152aef8 R14: 8800bba18aa0 R15: 
880ed1513a40
[19156101.620785] FS:  00c42085bc90() GS:880fff54() 
knlGS:
[19156101.622074] CS:  0010 DS:  ES:  CR0: 80050033
[19156101.622921] CR2: 7f508b166b04 CR3: 000e0981b000 CR4: 
003406e0
[19156101.624062] DR0:  DR1:  DR2: 

[19156101.625210] DR3:  DR6: fffe0ff0 DR7: 
0400
[19156101.626349] Stack:
[19156101.626765]  880ed152aef8 880ef6915710 880eb1f79e00 
8210ad50
[19156101.628018]  880ef6915700  880eb1f7a4a0 
8809131a7eb0
[19156101.629305]  812126ee 

[Kernel-packages] [Bug 1653498] Re: Server reboots every 4.1 weeks

2020-06-30 Thread chudihuang
Hello,

We have the same issue on our k8s cluster.

Description: ubuntu16.04.1 LTSx86_64
Release: 16.04.1


Kernel: 4.4.0-104-generic

[19156101.592212] [ cut here ]
[19156101.593103] kernel BUG at 
/build/linux-SwhOyu/linux-4.4.0/include/linux/fs.h:2582!
[19156101.594385] invalid opcode:  [#1] SMP
[19156101.595083] Modules linked in: binfmt_misc af_packet_diag netlink_diag 
dccp_diag dccp tcp_diag udp_diag inet_diag unix_diag veth br_netfilter 
ipt_MASQUERADE nf_nat_masquerade_ipv4 xt_set xt_mark ip_set_hash_ipportnet 
ip_set_hash_ipportip ip_set_bitmap_port ip_set_hash_ipport ip_set dummy 
xt_comment xt_addrtype iptable_nat nf_nat_ipv4 nf_conntrack_ipv4 nf_defrag_ipv4 
ip_vs_sh ip_vs_wrr ip_vs_rr ip_vs xt_tcpudp bridge stp llc nf_conntrack_netlink 
nfnetlink xfrm_user xfrm_algo iptable_filter ip_tables xt_conntrack x_tables 
nf_nat nf_conntrack aufs isofs ppdev crct10dif_pclmul parport_pc crc32_pclmul 
input_leds joydev ghash_clmulni_intel parport serio_raw ib_iser rdma_cm iw_cm 
ib_cm ib_sa ib_mad ib_core ib_addr sunrpc iscsi_tcp libiscsi_tcp libiscsi 
scsi_transport_iscsi autofs4 btrfs raid10 raid456 async_raid6_recov
[19156101.606434]  async_memcpy async_pq async_xor async_tx xor raid6_pq 
libcrc32c raid1 raid0 multipath linear aesni_intel aes_x86_64 glue_helper lrw 
gf128mul ablk_helper cryptd psmouse floppy
[19156101.609129] CPU: 29 PID: 11388 Comm: heartbeat Not tainted 
4.4.0-104-generic #127-Ubuntu
[19156101.610384] Hardware name: Smdbmds KVM, BIOS 
seabios-1.9.1-qemu-project.org 04/01/2014
[19156101.611637] task: 880eb1f79e00 ti: 8809131a4000 task.ti: 
8809131a4000
[19156101.612905] RIP: 0010:[]  [] 
__fput+0x21d/0x220
[19156101.614188] RSP: 0018:8809131a7e68  EFLAGS: 00010246
[19156101.614989] RAX:  RBX: 880ef6915700 RCX: 
000365fb1705
[19156101.616143] RDX: 0001 RSI: 880fff55a020 RDI: 

[19156101.617285] RBP: 8809131a7ea0 R08: 0001a020 R09: 
811b591d
[19156101.618422] R10: ea002b69b300 R11: 880ef6915710 R12: 
0010
[19156101.619574] R13: 880ed152aef8 R14: 8800bba18aa0 R15: 
880ed1513a40
[19156101.620785] FS:  00c42085bc90() GS:880fff54() 
knlGS:
[19156101.622074] CS:  0010 DS:  ES:  CR0: 80050033
[19156101.622921] CR2: 7f508b166b04 CR3: 000e0981b000 CR4: 
003406e0
[19156101.624062] DR0:  DR1:  DR2: 

[19156101.625210] DR3:  DR6: fffe0ff0 DR7: 
0400
[19156101.626349] Stack:
[19156101.626765]  880ed152aef8 880ef6915710 880eb1f79e00 
8210ad50
[19156101.628018]  880ef6915700  880eb1f7a4a0 
8809131a7eb0
[19156101.629305]  812126ee 8809131a7ef0 8109f101 
880eb1f7a4d4
[19156101.630568] Call Trace:
[19156101.631036]  [] fput+0xe/0x10
[19156101.631804]  [] task_work_run+0x81/0xa0
[19156101.632612]  [] exit_to_usermode_loop+0xc2/0xd0
[19156101.633497]  [] syscall_return_slowpath+0x4e/0x60
[19156101.634402]  [] int_ret_from_sys_call+0x25/0x8f
[19156101.635285] Code: 0f 84 cf fe ff ff 48 8b 43 28 48 8b 80 80 00 00 00 48 
85 c0 0f 84 bb fe ff ff 31 d2 48 89 de bf ff ff ff ff ff d0 e9 aa fe ff ff <0f> 
0b 90 0f 1f 44 00 00 31 ff 48 87 3d 8a 6e fc 00 48 85 ff 74
[19156101.639244] RIP  [] __fput+0x21d/0x220
[19156101.640049]  RSP 
crash>


Does this issue exist on Ubuntu18.04

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1653498

Title:
  Server reboots every 4.1 weeks

Status in linux package in Ubuntu:
  Confirmed
Status in linux source package in Xenial:
  Confirmed

Bug description:
  Every 4.1 uptime weeks (more or less), our 34 servers reboots with the
  logs below.

  Description:Ubuntu 16.04.1 LTS
  Release:16.04

  The servers hosts the stack flanneld (0.5.5) docker (1.11.2, build
  b9f10c9) kubernetes (v1.3.6) plus etcd (2.3.7)


  Jan 02 06:40:32 prd-node021 kernel: [ cut here ]
  Jan 02 06:40:32 prd-node021 kernel: kernel BUG at 
/build/linux-xHzv4a/linux-4.4.0/include/linux/fs.h:2569!
  Jan 02 06:40:32 prd-node021 kernel: invalid opcode:  [#1] SMP
  Jan 02 06:40:32 prd-node021 kernel: Modules linked in: nf_conntrack_netlink 
nfnetlink veth xt_statistic xt_nat xt_recent ipt_REJECT nf_reject_ipv4 
xt_tcpudp tcp_diag inet_diag
  Jan 02 06:40:32 prd-node021 kernel:  raid456 async_raid6_recov async_memcpy 
async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 raid0 multipath linear 
crct10dif_pclmul crc
  Jan 02 06:40:32 prd-node021 kernel: CPU: 46 PID: 22749 Comm: iptables-restor 
Not tainted 4.4.0-47-generic #68-Ubuntu
  Jan 02 06:40:32 prd-node021 kernel: Hardware name: HP ProLiant DL360 
Gen9/ProLiant DL360 Gen9, BIOS P89 09/13/2016
  Jan 02 06:40:32 prd-node021 kernel: 

[Kernel-packages] [Bug 1653498] Re: Server reboots every 4.1 weeks

2020-06-30 Thread chudihuang
Hi, Does this issue exist on Ubuntu18.04?

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1653498

Title:
  Server reboots every 4.1 weeks

Status in linux package in Ubuntu:
  Confirmed
Status in linux source package in Xenial:
  Confirmed

Bug description:
  Every 4.1 uptime weeks (more or less), our 34 servers reboots with the
  logs below.

  Description:Ubuntu 16.04.1 LTS
  Release:16.04

  The servers hosts the stack flanneld (0.5.5) docker (1.11.2, build
  b9f10c9) kubernetes (v1.3.6) plus etcd (2.3.7)


  Jan 02 06:40:32 prd-node021 kernel: [ cut here ]
  Jan 02 06:40:32 prd-node021 kernel: kernel BUG at 
/build/linux-xHzv4a/linux-4.4.0/include/linux/fs.h:2569!
  Jan 02 06:40:32 prd-node021 kernel: invalid opcode:  [#1] SMP
  Jan 02 06:40:32 prd-node021 kernel: Modules linked in: nf_conntrack_netlink 
nfnetlink veth xt_statistic xt_nat xt_recent ipt_REJECT nf_reject_ipv4 
xt_tcpudp tcp_diag inet_diag
  Jan 02 06:40:32 prd-node021 kernel:  raid456 async_raid6_recov async_memcpy 
async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 raid0 multipath linear 
crct10dif_pclmul crc
  Jan 02 06:40:32 prd-node021 kernel: CPU: 46 PID: 22749 Comm: iptables-restor 
Not tainted 4.4.0-47-generic #68-Ubuntu
  Jan 02 06:40:32 prd-node021 kernel: Hardware name: HP ProLiant DL360 
Gen9/ProLiant DL360 Gen9, BIOS P89 09/13/2016
  Jan 02 06:40:32 prd-node021 kernel: task: 882f7ccb44c0 ti: 
882fcb81 task.ti: 882fcb81
  Jan 02 06:40:32 prd-node021 kernel: RIP: 0010:[]  
[] __fput+0x21d/0x220
  Jan 02 06:40:32 prd-node021 kernel: RSP: 0018:882fcb813e68  EFLAGS: 
00010246
  Jan 02 06:40:32 prd-node021 kernel: RAX:  RBX: 
8829dbd92700 RCX: 308c6396
  Jan 02 06:40:32 prd-node021 kernel: RDX: 0001 RSI: 
88301f299f60 RDI: 
  Jan 02 06:40:32 prd-node021 kernel: RBP: 882fcb813ea0 R08: 
00019f60 R09: 811b3a1d
  Jan 02 06:40:32 prd-node021 kernel: R10: ea0064756680 R11: 
8829dbd92710 R12: 0010
  Jan 02 06:40:32 prd-node021 kernel: R13: 8817d3520518 R14: 
881021a34da0 R15: 8817d3542540
  Jan 02 06:40:32 prd-node021 kernel: FS:  7fcdaa753700() 
GS:88301f28() knlGS:
  Jan 02 06:40:32 prd-node021 kernel: CS:  0010 DS:  ES:  CR0: 
80050033
  Jan 02 06:40:32 prd-node021 kernel: CR2: 7fcdaa758000 CR3: 
001917993000 CR4: 003406e0
  Jan 02 06:40:32 prd-node021 kernel: DR0:  DR1: 
 DR2: 
  Jan 02 06:40:32 prd-node021 kernel: DR3:  DR6: 
fffe0ff0 DR7: 0400
  Jan 02 06:40:32 prd-node021 kernel: Stack:
  Jan 02 06:40:32 prd-node021 kernel:  8817d3520518 8829dbd92710 
882f7ccb44c0 82103a30
  Jan 02 06:40:32 prd-node021 kernel:  8829dbd92700  
882f7ccb4b38 882fcb813eb0
  Jan 02 06:40:32 prd-node021 kernel:  8120fa2e 882fcb813ef0 
8109ee01 882f7ccb4b6c
  Jan 02 06:40:32 prd-node021 kernel: Call Trace:
  Jan 02 06:40:32 prd-node021 kernel:  [] fput+0xe/0x10
  Jan 02 06:40:32 prd-node021 kernel:  [] 
task_work_run+0x81/0xa0
  Jan 02 06:40:32 prd-node021 kernel:  [] 
exit_to_usermode_loop+0xc2/0xd0
  Jan 02 06:40:32 prd-node021 kernel:  [] 
syscall_return_slowpath+0x4e/0x60
  Jan 02 06:40:32 prd-node021 kernel:  [] 
int_ret_from_sys_call+0x25/0x8f
  Jan 02 06:40:32 prd-node021 kernel: Code: 0f 84 cf fe ff ff 48 8b 43 28 48 8b 
80 80 00 00 00 48 85 c0 0f 84 bb fe ff ff 31 d2 48 89 de bf ff ff ff ff ff d0 
e9 aa fe ff ff <0f>
  Jan 02 06:40:32 prd-node021 kernel: RIP  [] 
__fput+0x21d/0x220
  -- Reboot --
  Jan 02 06:42:56 prd-node021 systemd-journald[819]: Runtime journal 
(/run/log/journal/) is 8.0M, max 1.8G, 1.8G free.
  Jan 02 06:42:56 prd-node021 kernel: Initializing cgroup subsys cpuset

  ProblemType: Bug
  DistroRelease: Ubuntu 16.04
  Package: linux-image-4.4.0-47-generic 4.4.0-47.68
  ProcVersionSignature: Ubuntu 4.4.0-47.68-generic 4.4.24
  Uname: Linux 4.4.0-47-generic x86_64
  AlsaDevices:
   total 0
   crw-rw 1 root audio 116,  1 Jan  2 06:42 seq
   crw-rw 1 root audio 116, 33 Jan  2 06:42 timer
  AplayDevices: Error: [Errno 2] No such file or directory: 'aplay'
  ApportVersion: 2.20.1-0ubuntu2.1
  Architecture: amd64
  ArecordDevices: Error: [Errno 2] No such file or directory: 'arecord'
  AudioDevicesInUse: Error: command ['fuser', '-v', '/dev/snd/seq', 
'/dev/snd/timer'] failed with exit code 1:
  Date: Mon Jan  2 10:49:15 2017
  HibernationDevice: RESUME=/dev/mapper/vg00-swap
  InstallationDate: Installed on 2016-09-13 (110 days ago)
  InstallationMedia: Ubuntu-Server 16.04.1 LTS "Xenial Xerus" - Beta amd64 
(20160803)
  IwConfig: Error: [Errno 2] No such file or directory: 'iwconfig'
  MachineType: HP ProLiant DL360 

[Kernel-packages] [Bug 1653498] Re: Server reboots every 4.1 weeks

2020-06-30 Thread chudihuang
Hi, Does this issue exist on Ubuntu18.04?

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1653498

Title:
  Server reboots every 4.1 weeks

Status in linux package in Ubuntu:
  Confirmed
Status in linux source package in Xenial:
  Confirmed

Bug description:
  Every 4.1 uptime weeks (more or less), our 34 servers reboots with the
  logs below.

  Description:Ubuntu 16.04.1 LTS
  Release:16.04

  The servers hosts the stack flanneld (0.5.5) docker (1.11.2, build
  b9f10c9) kubernetes (v1.3.6) plus etcd (2.3.7)


  Jan 02 06:40:32 prd-node021 kernel: [ cut here ]
  Jan 02 06:40:32 prd-node021 kernel: kernel BUG at 
/build/linux-xHzv4a/linux-4.4.0/include/linux/fs.h:2569!
  Jan 02 06:40:32 prd-node021 kernel: invalid opcode:  [#1] SMP
  Jan 02 06:40:32 prd-node021 kernel: Modules linked in: nf_conntrack_netlink 
nfnetlink veth xt_statistic xt_nat xt_recent ipt_REJECT nf_reject_ipv4 
xt_tcpudp tcp_diag inet_diag
  Jan 02 06:40:32 prd-node021 kernel:  raid456 async_raid6_recov async_memcpy 
async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 raid0 multipath linear 
crct10dif_pclmul crc
  Jan 02 06:40:32 prd-node021 kernel: CPU: 46 PID: 22749 Comm: iptables-restor 
Not tainted 4.4.0-47-generic #68-Ubuntu
  Jan 02 06:40:32 prd-node021 kernel: Hardware name: HP ProLiant DL360 
Gen9/ProLiant DL360 Gen9, BIOS P89 09/13/2016
  Jan 02 06:40:32 prd-node021 kernel: task: 882f7ccb44c0 ti: 
882fcb81 task.ti: 882fcb81
  Jan 02 06:40:32 prd-node021 kernel: RIP: 0010:[]  
[] __fput+0x21d/0x220
  Jan 02 06:40:32 prd-node021 kernel: RSP: 0018:882fcb813e68  EFLAGS: 
00010246
  Jan 02 06:40:32 prd-node021 kernel: RAX:  RBX: 
8829dbd92700 RCX: 308c6396
  Jan 02 06:40:32 prd-node021 kernel: RDX: 0001 RSI: 
88301f299f60 RDI: 
  Jan 02 06:40:32 prd-node021 kernel: RBP: 882fcb813ea0 R08: 
00019f60 R09: 811b3a1d
  Jan 02 06:40:32 prd-node021 kernel: R10: ea0064756680 R11: 
8829dbd92710 R12: 0010
  Jan 02 06:40:32 prd-node021 kernel: R13: 8817d3520518 R14: 
881021a34da0 R15: 8817d3542540
  Jan 02 06:40:32 prd-node021 kernel: FS:  7fcdaa753700() 
GS:88301f28() knlGS:
  Jan 02 06:40:32 prd-node021 kernel: CS:  0010 DS:  ES:  CR0: 
80050033
  Jan 02 06:40:32 prd-node021 kernel: CR2: 7fcdaa758000 CR3: 
001917993000 CR4: 003406e0
  Jan 02 06:40:32 prd-node021 kernel: DR0:  DR1: 
 DR2: 
  Jan 02 06:40:32 prd-node021 kernel: DR3:  DR6: 
fffe0ff0 DR7: 0400
  Jan 02 06:40:32 prd-node021 kernel: Stack:
  Jan 02 06:40:32 prd-node021 kernel:  8817d3520518 8829dbd92710 
882f7ccb44c0 82103a30
  Jan 02 06:40:32 prd-node021 kernel:  8829dbd92700  
882f7ccb4b38 882fcb813eb0
  Jan 02 06:40:32 prd-node021 kernel:  8120fa2e 882fcb813ef0 
8109ee01 882f7ccb4b6c
  Jan 02 06:40:32 prd-node021 kernel: Call Trace:
  Jan 02 06:40:32 prd-node021 kernel:  [] fput+0xe/0x10
  Jan 02 06:40:32 prd-node021 kernel:  [] 
task_work_run+0x81/0xa0
  Jan 02 06:40:32 prd-node021 kernel:  [] 
exit_to_usermode_loop+0xc2/0xd0
  Jan 02 06:40:32 prd-node021 kernel:  [] 
syscall_return_slowpath+0x4e/0x60
  Jan 02 06:40:32 prd-node021 kernel:  [] 
int_ret_from_sys_call+0x25/0x8f
  Jan 02 06:40:32 prd-node021 kernel: Code: 0f 84 cf fe ff ff 48 8b 43 28 48 8b 
80 80 00 00 00 48 85 c0 0f 84 bb fe ff ff 31 d2 48 89 de bf ff ff ff ff ff d0 
e9 aa fe ff ff <0f>
  Jan 02 06:40:32 prd-node021 kernel: RIP  [] 
__fput+0x21d/0x220
  -- Reboot --
  Jan 02 06:42:56 prd-node021 systemd-journald[819]: Runtime journal 
(/run/log/journal/) is 8.0M, max 1.8G, 1.8G free.
  Jan 02 06:42:56 prd-node021 kernel: Initializing cgroup subsys cpuset

  ProblemType: Bug
  DistroRelease: Ubuntu 16.04
  Package: linux-image-4.4.0-47-generic 4.4.0-47.68
  ProcVersionSignature: Ubuntu 4.4.0-47.68-generic 4.4.24
  Uname: Linux 4.4.0-47-generic x86_64
  AlsaDevices:
   total 0
   crw-rw 1 root audio 116,  1 Jan  2 06:42 seq
   crw-rw 1 root audio 116, 33 Jan  2 06:42 timer
  AplayDevices: Error: [Errno 2] No such file or directory: 'aplay'
  ApportVersion: 2.20.1-0ubuntu2.1
  Architecture: amd64
  ArecordDevices: Error: [Errno 2] No such file or directory: 'arecord'
  AudioDevicesInUse: Error: command ['fuser', '-v', '/dev/snd/seq', 
'/dev/snd/timer'] failed with exit code 1:
  Date: Mon Jan  2 10:49:15 2017
  HibernationDevice: RESUME=/dev/mapper/vg00-swap
  InstallationDate: Installed on 2016-09-13 (110 days ago)
  InstallationMedia: Ubuntu-Server 16.04.1 LTS "Xenial Xerus" - Beta amd64 
(20160803)
  IwConfig: Error: [Errno 2] No such file or directory: 'iwconfig'
  MachineType: HP ProLiant DL360