[Kernel-packages] [Bug 1702665] Re: 4.4.0-83-generic + Docker + EC2 frequently crashes at cgroup_rmdir GPF

2017-07-13 Thread sorah
Not seeing crashes at 4.4.76-040476-generic for these few days.

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1702665

Title:
  4.4.0-83-generic + Docker + EC2 frequently crashes at cgroup_rmdir GPF

Status in docker package in Ubuntu:
  Confirmed
Status in linux package in Ubuntu:
  Confirmed

Bug description:
  We run xenial-based Docker container hosts on EC2 with Amazon ECS.
  Recently we refreshed our base image, we started to see frequent
  panic.

  Hosts run Amazon ECS Agent, and the agent automatically creates or
  destroys Docker container based on requests onto ECS cluster.

  I think this crash is caused by Docker-related operations, because
  crashing at cgroups.

  Also, we're running several different cluster with another EC2
  instance types, using same image. This problem is only reproducing at
  t2.small instances. (We also run c4.large and m4.* clusters)

  Our previous image ran 4.4.0-79-generic, and we see no problem with
  79.

  [30558.783899] general protection fault:  [#1] SMP 
  [30558.784056] Modules linked in: veth binfmt_misc xt_nat xt_comment 
xt_tcpudp ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_conntrack_ipv4 
nf_defrag_ipv4 nf_nat_ipv4 xt_addrtype iptable_filter ip_tables xt_conntrack 
x_tables nf_nat nf_conntrack br_netfilter bridge stp llc isofs ppdev input_leds 
serio_raw parport_pc parport autofs4 btrfs xor raid6_pq crct10dif_pclmul 
crc32_pclmul ghash_clmulni_intel aesni_intel aes_x86_64 lrw gf128mul 
glue_helper ablk_helper cryptd psmouse floppy
  [30558.784056] CPU: 0 PID: 1 Comm: systemd Not tainted 4.4.0-83-generic 
#106-Ubuntu
  [30558.784056] Hardware name: Xen HVM domU, BIOS 4.2.amazon 02/16/2017
  [30558.784056] task: 88007c4e8000 ti: 88007c4e4000 task.ti: 
88007c4e4000
  [30558.784056] RIP: 0010:[]  [] 
cgroup_destroy_locked+0x5f/0xf0
  [30558.784056] RSP: 0018:88007c4e7e40  EFLAGS: 00010212
  [30558.784056] RAX: 8800114481bd RBX: 88002827ba50 RCX: 
88007ab8d150
  [30558.784056] RDX: 00111e7e0088 RSI: 88002827ba54 RDI: 
8217745c
  [30558.784056] RBP: 88007c4e7e60 R08: 0020 R09: 
88007c4e7e70
  [30558.784056] R10: 0637760b R11: 880011829a80 R12: 
88007ab8d000
  [30558.784056] R13:  R14: 559b48b2dcc0 R15: 
ff9c
  [30558.784056] FS:  7f29cf0db8c0() GS:88007d20() 
knlGS:
  [30558.784056] CS:  0010 DS:  ES:  CR0: 80050033
  [30558.784056] CR2: 7fa466d58180 CR3: 7c19 CR4: 
001406f0
  [30558.784056] Stack:
  [30558.784056]  88002827ba50 88002827ba50 8800373e70d0 
559b48b2dcc0
  [30558.784056]  88007c4e7e80 811194b3 88002827ba50 

  [30558.784056]  88007c4e7ea0 8128ddcd 880011829a80 

  [30558.784056] Call Trace:
  [30558.784056]  [] cgroup_rmdir+0x23/0x40
  [30558.784056]  [] kernfs_iop_rmdir+0x4d/0x80
  [30558.784056]  [] vfs_rmdir+0xb4/0x130
  [30558.784056]  [] do_rmdir+0x1df/0x200
  [30558.784056]  [] SyS_rmdir+0x16/0x20
  [30558.784056]  [] entry_SYSCALL_64_fastpath+0x16/0x71
  [30558.784056] Code: 74 fd 48 c7 c7 5c 74 17 82 e8 8e 72 72 00 49 8b 94 24 50 
01 00 00 49 8d 8c 24 50 01 00 00 48 39 d1 48 8d 42 f0 74 18 48 8b 50 08  82 
b0 01 00 00 01 48 8b 50 10 48 39 d1 48 8d 42 f0 75 e8 49 
  [30558.784056] RIP  [] cgroup_destroy_locked+0x5f/0xf0
  [30558.784056]  RSP 
  [30558.960828] ---[ end trace 7634e03ff94e8934 ]---
  [30558.964811] Kernel panic - not syncing: Fatal exception in interrupt
  [30558.968805] Kernel Offset: disabled

  ProblemType: Bug
  DistroRelease: Ubuntu 16.04
  Package: linux-image-4.4.0-83-generic 4.4.0-83.106
  ProcVersionSignature: Ubuntu 4.4.0-83.106-generic 4.4.70
  Uname: Linux 4.4.0-83-generic x86_64
  AlsaDevices:
   total 0
   crw-rw 1 root audio 116,  1 Jul  6 10:22 seq
   crw-rw 1 root audio 116, 33 Jul  6 10:22 timer
  AplayDevices: Error: [Errno 2] No such file or directory: 'aplay'
  ApportVersion: 2.20.1-0ubuntu2.6
  Architecture: amd64
  ArecordDevices: Error: [Errno 2] No such file or directory: 'arecord'
  AudioDevicesInUse: Error: command ['fuser', '-v', '/dev/snd/seq', 
'/dev/snd/timer'] failed with exit code 1:
  CRDA: N/A
  Date: Thu Jul  6 10:27:37 2017
  Ec2AMI: ami-34100353
  Ec2AMIManifest: (unknown)
  Ec2AvailabilityZone: ap-northeast-1c
  Ec2InstanceType: t2.small
  Ec2Kernel: unavailable
  Ec2Ramdisk: unavailable
  IwConfig: Error: [Errno 2] No such file or directory: 'iwconfig'
  Lsusb: Error: command ['lsusb'] failed with exit code 1:
  MachineType: Xen HVM domU
  PciMultimedia:
   
  ProcEnviron:
   SHELL=/bin/bash
   TERM=screen-256color
   PATH=(custom, no user)
   LANG=en_US.UTF-8
  ProcFB:
   
  ProcKernelCmdLine: BOOT_IMAGE=/boot/vmlinuz-4.4.0-83-generic 
root=UUID=f76be987-234f-4071-87d4-06318cfc2135 ro 

[Kernel-packages] [Bug 1702665] Re: 4.4.0-83-generic + Docker + EC2 frequently crashes at cgroup_rmdir GPF

2017-07-11 Thread Launchpad Bug Tracker
Status changed to 'Confirmed' because the bug affects multiple users.

** Changed in: docker (Ubuntu)
   Status: New => Confirmed

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1702665

Title:
  4.4.0-83-generic + Docker + EC2 frequently crashes at cgroup_rmdir GPF

Status in docker package in Ubuntu:
  Confirmed
Status in linux package in Ubuntu:
  Confirmed

Bug description:
  We run xenial-based Docker container hosts on EC2 with Amazon ECS.
  Recently we refreshed our base image, we started to see frequent
  panic.

  Hosts run Amazon ECS Agent, and the agent automatically creates or
  destroys Docker container based on requests onto ECS cluster.

  I think this crash is caused by Docker-related operations, because
  crashing at cgroups.

  Also, we're running several different cluster with another EC2
  instance types, using same image. This problem is only reproducing at
  t2.small instances. (We also run c4.large and m4.* clusters)

  Our previous image ran 4.4.0-79-generic, and we see no problem with
  79.

  [30558.783899] general protection fault:  [#1] SMP 
  [30558.784056] Modules linked in: veth binfmt_misc xt_nat xt_comment 
xt_tcpudp ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_conntrack_ipv4 
nf_defrag_ipv4 nf_nat_ipv4 xt_addrtype iptable_filter ip_tables xt_conntrack 
x_tables nf_nat nf_conntrack br_netfilter bridge stp llc isofs ppdev input_leds 
serio_raw parport_pc parport autofs4 btrfs xor raid6_pq crct10dif_pclmul 
crc32_pclmul ghash_clmulni_intel aesni_intel aes_x86_64 lrw gf128mul 
glue_helper ablk_helper cryptd psmouse floppy
  [30558.784056] CPU: 0 PID: 1 Comm: systemd Not tainted 4.4.0-83-generic 
#106-Ubuntu
  [30558.784056] Hardware name: Xen HVM domU, BIOS 4.2.amazon 02/16/2017
  [30558.784056] task: 88007c4e8000 ti: 88007c4e4000 task.ti: 
88007c4e4000
  [30558.784056] RIP: 0010:[]  [] 
cgroup_destroy_locked+0x5f/0xf0
  [30558.784056] RSP: 0018:88007c4e7e40  EFLAGS: 00010212
  [30558.784056] RAX: 8800114481bd RBX: 88002827ba50 RCX: 
88007ab8d150
  [30558.784056] RDX: 00111e7e0088 RSI: 88002827ba54 RDI: 
8217745c
  [30558.784056] RBP: 88007c4e7e60 R08: 0020 R09: 
88007c4e7e70
  [30558.784056] R10: 0637760b R11: 880011829a80 R12: 
88007ab8d000
  [30558.784056] R13:  R14: 559b48b2dcc0 R15: 
ff9c
  [30558.784056] FS:  7f29cf0db8c0() GS:88007d20() 
knlGS:
  [30558.784056] CS:  0010 DS:  ES:  CR0: 80050033
  [30558.784056] CR2: 7fa466d58180 CR3: 7c19 CR4: 
001406f0
  [30558.784056] Stack:
  [30558.784056]  88002827ba50 88002827ba50 8800373e70d0 
559b48b2dcc0
  [30558.784056]  88007c4e7e80 811194b3 88002827ba50 

  [30558.784056]  88007c4e7ea0 8128ddcd 880011829a80 

  [30558.784056] Call Trace:
  [30558.784056]  [] cgroup_rmdir+0x23/0x40
  [30558.784056]  [] kernfs_iop_rmdir+0x4d/0x80
  [30558.784056]  [] vfs_rmdir+0xb4/0x130
  [30558.784056]  [] do_rmdir+0x1df/0x200
  [30558.784056]  [] SyS_rmdir+0x16/0x20
  [30558.784056]  [] entry_SYSCALL_64_fastpath+0x16/0x71
  [30558.784056] Code: 74 fd 48 c7 c7 5c 74 17 82 e8 8e 72 72 00 49 8b 94 24 50 
01 00 00 49 8d 8c 24 50 01 00 00 48 39 d1 48 8d 42 f0 74 18 48 8b 50 08  82 
b0 01 00 00 01 48 8b 50 10 48 39 d1 48 8d 42 f0 75 e8 49 
  [30558.784056] RIP  [] cgroup_destroy_locked+0x5f/0xf0
  [30558.784056]  RSP 
  [30558.960828] ---[ end trace 7634e03ff94e8934 ]---
  [30558.964811] Kernel panic - not syncing: Fatal exception in interrupt
  [30558.968805] Kernel Offset: disabled

  ProblemType: Bug
  DistroRelease: Ubuntu 16.04
  Package: linux-image-4.4.0-83-generic 4.4.0-83.106
  ProcVersionSignature: Ubuntu 4.4.0-83.106-generic 4.4.70
  Uname: Linux 4.4.0-83-generic x86_64
  AlsaDevices:
   total 0
   crw-rw 1 root audio 116,  1 Jul  6 10:22 seq
   crw-rw 1 root audio 116, 33 Jul  6 10:22 timer
  AplayDevices: Error: [Errno 2] No such file or directory: 'aplay'
  ApportVersion: 2.20.1-0ubuntu2.6
  Architecture: amd64
  ArecordDevices: Error: [Errno 2] No such file or directory: 'arecord'
  AudioDevicesInUse: Error: command ['fuser', '-v', '/dev/snd/seq', 
'/dev/snd/timer'] failed with exit code 1:
  CRDA: N/A
  Date: Thu Jul  6 10:27:37 2017
  Ec2AMI: ami-34100353
  Ec2AMIManifest: (unknown)
  Ec2AvailabilityZone: ap-northeast-1c
  Ec2InstanceType: t2.small
  Ec2Kernel: unavailable
  Ec2Ramdisk: unavailable
  IwConfig: Error: [Errno 2] No such file or directory: 'iwconfig'
  Lsusb: Error: command ['lsusb'] failed with exit code 1:
  MachineType: Xen HVM domU
  PciMultimedia:
   
  ProcEnviron:
   SHELL=/bin/bash
   TERM=screen-256color
   PATH=(custom, no user)
   LANG=en_US.UTF-8
  ProcFB:
   
  ProcKernelCmdLine: 

[Kernel-packages] [Bug 1702665] Re: 4.4.0-83-generic + Docker + EC2 frequently crashes at cgroup_rmdir GPF

2017-07-11 Thread sorah
Sure. Upgraded one of an instance running 4.4.76-040476-generic, aside
of existing 4.12 instance.

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1702665

Title:
  4.4.0-83-generic + Docker + EC2 frequently crashes at cgroup_rmdir GPF

Status in docker package in Ubuntu:
  New
Status in linux package in Ubuntu:
  Confirmed

Bug description:
  We run xenial-based Docker container hosts on EC2 with Amazon ECS.
  Recently we refreshed our base image, we started to see frequent
  panic.

  Hosts run Amazon ECS Agent, and the agent automatically creates or
  destroys Docker container based on requests onto ECS cluster.

  I think this crash is caused by Docker-related operations, because
  crashing at cgroups.

  Also, we're running several different cluster with another EC2
  instance types, using same image. This problem is only reproducing at
  t2.small instances. (We also run c4.large and m4.* clusters)

  Our previous image ran 4.4.0-79-generic, and we see no problem with
  79.

  [30558.783899] general protection fault:  [#1] SMP 
  [30558.784056] Modules linked in: veth binfmt_misc xt_nat xt_comment 
xt_tcpudp ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_conntrack_ipv4 
nf_defrag_ipv4 nf_nat_ipv4 xt_addrtype iptable_filter ip_tables xt_conntrack 
x_tables nf_nat nf_conntrack br_netfilter bridge stp llc isofs ppdev input_leds 
serio_raw parport_pc parport autofs4 btrfs xor raid6_pq crct10dif_pclmul 
crc32_pclmul ghash_clmulni_intel aesni_intel aes_x86_64 lrw gf128mul 
glue_helper ablk_helper cryptd psmouse floppy
  [30558.784056] CPU: 0 PID: 1 Comm: systemd Not tainted 4.4.0-83-generic 
#106-Ubuntu
  [30558.784056] Hardware name: Xen HVM domU, BIOS 4.2.amazon 02/16/2017
  [30558.784056] task: 88007c4e8000 ti: 88007c4e4000 task.ti: 
88007c4e4000
  [30558.784056] RIP: 0010:[]  [] 
cgroup_destroy_locked+0x5f/0xf0
  [30558.784056] RSP: 0018:88007c4e7e40  EFLAGS: 00010212
  [30558.784056] RAX: 8800114481bd RBX: 88002827ba50 RCX: 
88007ab8d150
  [30558.784056] RDX: 00111e7e0088 RSI: 88002827ba54 RDI: 
8217745c
  [30558.784056] RBP: 88007c4e7e60 R08: 0020 R09: 
88007c4e7e70
  [30558.784056] R10: 0637760b R11: 880011829a80 R12: 
88007ab8d000
  [30558.784056] R13:  R14: 559b48b2dcc0 R15: 
ff9c
  [30558.784056] FS:  7f29cf0db8c0() GS:88007d20() 
knlGS:
  [30558.784056] CS:  0010 DS:  ES:  CR0: 80050033
  [30558.784056] CR2: 7fa466d58180 CR3: 7c19 CR4: 
001406f0
  [30558.784056] Stack:
  [30558.784056]  88002827ba50 88002827ba50 8800373e70d0 
559b48b2dcc0
  [30558.784056]  88007c4e7e80 811194b3 88002827ba50 

  [30558.784056]  88007c4e7ea0 8128ddcd 880011829a80 

  [30558.784056] Call Trace:
  [30558.784056]  [] cgroup_rmdir+0x23/0x40
  [30558.784056]  [] kernfs_iop_rmdir+0x4d/0x80
  [30558.784056]  [] vfs_rmdir+0xb4/0x130
  [30558.784056]  [] do_rmdir+0x1df/0x200
  [30558.784056]  [] SyS_rmdir+0x16/0x20
  [30558.784056]  [] entry_SYSCALL_64_fastpath+0x16/0x71
  [30558.784056] Code: 74 fd 48 c7 c7 5c 74 17 82 e8 8e 72 72 00 49 8b 94 24 50 
01 00 00 49 8d 8c 24 50 01 00 00 48 39 d1 48 8d 42 f0 74 18 48 8b 50 08  82 
b0 01 00 00 01 48 8b 50 10 48 39 d1 48 8d 42 f0 75 e8 49 
  [30558.784056] RIP  [] cgroup_destroy_locked+0x5f/0xf0
  [30558.784056]  RSP 
  [30558.960828] ---[ end trace 7634e03ff94e8934 ]---
  [30558.964811] Kernel panic - not syncing: Fatal exception in interrupt
  [30558.968805] Kernel Offset: disabled

  ProblemType: Bug
  DistroRelease: Ubuntu 16.04
  Package: linux-image-4.4.0-83-generic 4.4.0-83.106
  ProcVersionSignature: Ubuntu 4.4.0-83.106-generic 4.4.70
  Uname: Linux 4.4.0-83-generic x86_64
  AlsaDevices:
   total 0
   crw-rw 1 root audio 116,  1 Jul  6 10:22 seq
   crw-rw 1 root audio 116, 33 Jul  6 10:22 timer
  AplayDevices: Error: [Errno 2] No such file or directory: 'aplay'
  ApportVersion: 2.20.1-0ubuntu2.6
  Architecture: amd64
  ArecordDevices: Error: [Errno 2] No such file or directory: 'arecord'
  AudioDevicesInUse: Error: command ['fuser', '-v', '/dev/snd/seq', 
'/dev/snd/timer'] failed with exit code 1:
  CRDA: N/A
  Date: Thu Jul  6 10:27:37 2017
  Ec2AMI: ami-34100353
  Ec2AMIManifest: (unknown)
  Ec2AvailabilityZone: ap-northeast-1c
  Ec2InstanceType: t2.small
  Ec2Kernel: unavailable
  Ec2Ramdisk: unavailable
  IwConfig: Error: [Errno 2] No such file or directory: 'iwconfig'
  Lsusb: Error: command ['lsusb'] failed with exit code 1:
  MachineType: Xen HVM domU
  PciMultimedia:
   
  ProcEnviron:
   SHELL=/bin/bash
   TERM=screen-256color
   PATH=(custom, no user)
   LANG=en_US.UTF-8
  ProcFB:
   
  ProcKernelCmdLine: BOOT_IMAGE=/boot/vmlinuz-4.4.0-83-generic 

[Kernel-packages] [Bug 1702665] Re: 4.4.0-83-generic + Docker + EC2 frequently crashes at cgroup_rmdir GPF

2017-07-11 Thread sorah
Clarify: Upgraded one of an instance running 4.4.0-83 to
4.4.76-040476-generic, *

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1702665

Title:
  4.4.0-83-generic + Docker + EC2 frequently crashes at cgroup_rmdir GPF

Status in docker package in Ubuntu:
  New
Status in linux package in Ubuntu:
  Confirmed

Bug description:
  We run xenial-based Docker container hosts on EC2 with Amazon ECS.
  Recently we refreshed our base image, we started to see frequent
  panic.

  Hosts run Amazon ECS Agent, and the agent automatically creates or
  destroys Docker container based on requests onto ECS cluster.

  I think this crash is caused by Docker-related operations, because
  crashing at cgroups.

  Also, we're running several different cluster with another EC2
  instance types, using same image. This problem is only reproducing at
  t2.small instances. (We also run c4.large and m4.* clusters)

  Our previous image ran 4.4.0-79-generic, and we see no problem with
  79.

  [30558.783899] general protection fault:  [#1] SMP 
  [30558.784056] Modules linked in: veth binfmt_misc xt_nat xt_comment 
xt_tcpudp ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_conntrack_ipv4 
nf_defrag_ipv4 nf_nat_ipv4 xt_addrtype iptable_filter ip_tables xt_conntrack 
x_tables nf_nat nf_conntrack br_netfilter bridge stp llc isofs ppdev input_leds 
serio_raw parport_pc parport autofs4 btrfs xor raid6_pq crct10dif_pclmul 
crc32_pclmul ghash_clmulni_intel aesni_intel aes_x86_64 lrw gf128mul 
glue_helper ablk_helper cryptd psmouse floppy
  [30558.784056] CPU: 0 PID: 1 Comm: systemd Not tainted 4.4.0-83-generic 
#106-Ubuntu
  [30558.784056] Hardware name: Xen HVM domU, BIOS 4.2.amazon 02/16/2017
  [30558.784056] task: 88007c4e8000 ti: 88007c4e4000 task.ti: 
88007c4e4000
  [30558.784056] RIP: 0010:[]  [] 
cgroup_destroy_locked+0x5f/0xf0
  [30558.784056] RSP: 0018:88007c4e7e40  EFLAGS: 00010212
  [30558.784056] RAX: 8800114481bd RBX: 88002827ba50 RCX: 
88007ab8d150
  [30558.784056] RDX: 00111e7e0088 RSI: 88002827ba54 RDI: 
8217745c
  [30558.784056] RBP: 88007c4e7e60 R08: 0020 R09: 
88007c4e7e70
  [30558.784056] R10: 0637760b R11: 880011829a80 R12: 
88007ab8d000
  [30558.784056] R13:  R14: 559b48b2dcc0 R15: 
ff9c
  [30558.784056] FS:  7f29cf0db8c0() GS:88007d20() 
knlGS:
  [30558.784056] CS:  0010 DS:  ES:  CR0: 80050033
  [30558.784056] CR2: 7fa466d58180 CR3: 7c19 CR4: 
001406f0
  [30558.784056] Stack:
  [30558.784056]  88002827ba50 88002827ba50 8800373e70d0 
559b48b2dcc0
  [30558.784056]  88007c4e7e80 811194b3 88002827ba50 

  [30558.784056]  88007c4e7ea0 8128ddcd 880011829a80 

  [30558.784056] Call Trace:
  [30558.784056]  [] cgroup_rmdir+0x23/0x40
  [30558.784056]  [] kernfs_iop_rmdir+0x4d/0x80
  [30558.784056]  [] vfs_rmdir+0xb4/0x130
  [30558.784056]  [] do_rmdir+0x1df/0x200
  [30558.784056]  [] SyS_rmdir+0x16/0x20
  [30558.784056]  [] entry_SYSCALL_64_fastpath+0x16/0x71
  [30558.784056] Code: 74 fd 48 c7 c7 5c 74 17 82 e8 8e 72 72 00 49 8b 94 24 50 
01 00 00 49 8d 8c 24 50 01 00 00 48 39 d1 48 8d 42 f0 74 18 48 8b 50 08  82 
b0 01 00 00 01 48 8b 50 10 48 39 d1 48 8d 42 f0 75 e8 49 
  [30558.784056] RIP  [] cgroup_destroy_locked+0x5f/0xf0
  [30558.784056]  RSP 
  [30558.960828] ---[ end trace 7634e03ff94e8934 ]---
  [30558.964811] Kernel panic - not syncing: Fatal exception in interrupt
  [30558.968805] Kernel Offset: disabled

  ProblemType: Bug
  DistroRelease: Ubuntu 16.04
  Package: linux-image-4.4.0-83-generic 4.4.0-83.106
  ProcVersionSignature: Ubuntu 4.4.0-83.106-generic 4.4.70
  Uname: Linux 4.4.0-83-generic x86_64
  AlsaDevices:
   total 0
   crw-rw 1 root audio 116,  1 Jul  6 10:22 seq
   crw-rw 1 root audio 116, 33 Jul  6 10:22 timer
  AplayDevices: Error: [Errno 2] No such file or directory: 'aplay'
  ApportVersion: 2.20.1-0ubuntu2.6
  Architecture: amd64
  ArecordDevices: Error: [Errno 2] No such file or directory: 'arecord'
  AudioDevicesInUse: Error: command ['fuser', '-v', '/dev/snd/seq', 
'/dev/snd/timer'] failed with exit code 1:
  CRDA: N/A
  Date: Thu Jul  6 10:27:37 2017
  Ec2AMI: ami-34100353
  Ec2AMIManifest: (unknown)
  Ec2AvailabilityZone: ap-northeast-1c
  Ec2InstanceType: t2.small
  Ec2Kernel: unavailable
  Ec2Ramdisk: unavailable
  IwConfig: Error: [Errno 2] No such file or directory: 'iwconfig'
  Lsusb: Error: command ['lsusb'] failed with exit code 1:
  MachineType: Xen HVM domU
  PciMultimedia:
   
  ProcEnviron:
   SHELL=/bin/bash
   TERM=screen-256color
   PATH=(custom, no user)
   LANG=en_US.UTF-8
  ProcFB:
   
  ProcKernelCmdLine: BOOT_IMAGE=/boot/vmlinuz-4.4.0-83-generic 

[Kernel-packages] [Bug 1702665] Re: 4.4.0-83-generic + Docker + EC2 frequently crashes at cgroup_rmdir GPF

2017-07-11 Thread Joseph Salisbury
Can you also give the latest upstream 4.4 kernel a test?  It is available from:
http://kernel.ubuntu.com/~kernel-ppa/mainline/v4.4.76/

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1702665

Title:
  4.4.0-83-generic + Docker + EC2 frequently crashes at cgroup_rmdir GPF

Status in docker package in Ubuntu:
  New
Status in linux package in Ubuntu:
  Confirmed

Bug description:
  We run xenial-based Docker container hosts on EC2 with Amazon ECS.
  Recently we refreshed our base image, we started to see frequent
  panic.

  Hosts run Amazon ECS Agent, and the agent automatically creates or
  destroys Docker container based on requests onto ECS cluster.

  I think this crash is caused by Docker-related operations, because
  crashing at cgroups.

  Also, we're running several different cluster with another EC2
  instance types, using same image. This problem is only reproducing at
  t2.small instances. (We also run c4.large and m4.* clusters)

  Our previous image ran 4.4.0-79-generic, and we see no problem with
  79.

  [30558.783899] general protection fault:  [#1] SMP 
  [30558.784056] Modules linked in: veth binfmt_misc xt_nat xt_comment 
xt_tcpudp ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_conntrack_ipv4 
nf_defrag_ipv4 nf_nat_ipv4 xt_addrtype iptable_filter ip_tables xt_conntrack 
x_tables nf_nat nf_conntrack br_netfilter bridge stp llc isofs ppdev input_leds 
serio_raw parport_pc parport autofs4 btrfs xor raid6_pq crct10dif_pclmul 
crc32_pclmul ghash_clmulni_intel aesni_intel aes_x86_64 lrw gf128mul 
glue_helper ablk_helper cryptd psmouse floppy
  [30558.784056] CPU: 0 PID: 1 Comm: systemd Not tainted 4.4.0-83-generic 
#106-Ubuntu
  [30558.784056] Hardware name: Xen HVM domU, BIOS 4.2.amazon 02/16/2017
  [30558.784056] task: 88007c4e8000 ti: 88007c4e4000 task.ti: 
88007c4e4000
  [30558.784056] RIP: 0010:[]  [] 
cgroup_destroy_locked+0x5f/0xf0
  [30558.784056] RSP: 0018:88007c4e7e40  EFLAGS: 00010212
  [30558.784056] RAX: 8800114481bd RBX: 88002827ba50 RCX: 
88007ab8d150
  [30558.784056] RDX: 00111e7e0088 RSI: 88002827ba54 RDI: 
8217745c
  [30558.784056] RBP: 88007c4e7e60 R08: 0020 R09: 
88007c4e7e70
  [30558.784056] R10: 0637760b R11: 880011829a80 R12: 
88007ab8d000
  [30558.784056] R13:  R14: 559b48b2dcc0 R15: 
ff9c
  [30558.784056] FS:  7f29cf0db8c0() GS:88007d20() 
knlGS:
  [30558.784056] CS:  0010 DS:  ES:  CR0: 80050033
  [30558.784056] CR2: 7fa466d58180 CR3: 7c19 CR4: 
001406f0
  [30558.784056] Stack:
  [30558.784056]  88002827ba50 88002827ba50 8800373e70d0 
559b48b2dcc0
  [30558.784056]  88007c4e7e80 811194b3 88002827ba50 

  [30558.784056]  88007c4e7ea0 8128ddcd 880011829a80 

  [30558.784056] Call Trace:
  [30558.784056]  [] cgroup_rmdir+0x23/0x40
  [30558.784056]  [] kernfs_iop_rmdir+0x4d/0x80
  [30558.784056]  [] vfs_rmdir+0xb4/0x130
  [30558.784056]  [] do_rmdir+0x1df/0x200
  [30558.784056]  [] SyS_rmdir+0x16/0x20
  [30558.784056]  [] entry_SYSCALL_64_fastpath+0x16/0x71
  [30558.784056] Code: 74 fd 48 c7 c7 5c 74 17 82 e8 8e 72 72 00 49 8b 94 24 50 
01 00 00 49 8d 8c 24 50 01 00 00 48 39 d1 48 8d 42 f0 74 18 48 8b 50 08  82 
b0 01 00 00 01 48 8b 50 10 48 39 d1 48 8d 42 f0 75 e8 49 
  [30558.784056] RIP  [] cgroup_destroy_locked+0x5f/0xf0
  [30558.784056]  RSP 
  [30558.960828] ---[ end trace 7634e03ff94e8934 ]---
  [30558.964811] Kernel panic - not syncing: Fatal exception in interrupt
  [30558.968805] Kernel Offset: disabled

  ProblemType: Bug
  DistroRelease: Ubuntu 16.04
  Package: linux-image-4.4.0-83-generic 4.4.0-83.106
  ProcVersionSignature: Ubuntu 4.4.0-83.106-generic 4.4.70
  Uname: Linux 4.4.0-83-generic x86_64
  AlsaDevices:
   total 0
   crw-rw 1 root audio 116,  1 Jul  6 10:22 seq
   crw-rw 1 root audio 116, 33 Jul  6 10:22 timer
  AplayDevices: Error: [Errno 2] No such file or directory: 'aplay'
  ApportVersion: 2.20.1-0ubuntu2.6
  Architecture: amd64
  ArecordDevices: Error: [Errno 2] No such file or directory: 'arecord'
  AudioDevicesInUse: Error: command ['fuser', '-v', '/dev/snd/seq', 
'/dev/snd/timer'] failed with exit code 1:
  CRDA: N/A
  Date: Thu Jul  6 10:27:37 2017
  Ec2AMI: ami-34100353
  Ec2AMIManifest: (unknown)
  Ec2AvailabilityZone: ap-northeast-1c
  Ec2InstanceType: t2.small
  Ec2Kernel: unavailable
  Ec2Ramdisk: unavailable
  IwConfig: Error: [Errno 2] No such file or directory: 'iwconfig'
  Lsusb: Error: command ['lsusb'] failed with exit code 1:
  MachineType: Xen HVM domU
  PciMultimedia:
   
  ProcEnviron:
   SHELL=/bin/bash
   TERM=screen-256color
   PATH=(custom, no user)
   LANG=en_US.UTF-8
  ProcFB:
   
  ProcKernelCmdLine: 

[Kernel-packages] [Bug 1702665] Re: 4.4.0-83-generic + Docker + EC2 frequently crashes at cgroup_rmdir GPF

2017-07-09 Thread sorah
4.12 instance is still running without crash, survived this weekend.

Added kernel-fixed-upstream tag.

** Tags added: kernel-fixed-upstream

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1702665

Title:
  4.4.0-83-generic + Docker + EC2 frequently crashes at cgroup_rmdir GPF

Status in docker package in Ubuntu:
  New
Status in linux package in Ubuntu:
  Confirmed

Bug description:
  We run xenial-based Docker container hosts on EC2 with Amazon ECS.
  Recently we refreshed our base image, we started to see frequent
  panic.

  Hosts run Amazon ECS Agent, and the agent automatically creates or
  destroys Docker container based on requests onto ECS cluster.

  I think this crash is caused by Docker-related operations, because
  crashing at cgroups.

  Also, we're running several different cluster with another EC2
  instance types, using same image. This problem is only reproducing at
  t2.small instances. (We also run c4.large and m4.* clusters)

  Our previous image ran 4.4.0-79-generic, and we see no problem with
  79.

  [30558.783899] general protection fault:  [#1] SMP 
  [30558.784056] Modules linked in: veth binfmt_misc xt_nat xt_comment 
xt_tcpudp ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_conntrack_ipv4 
nf_defrag_ipv4 nf_nat_ipv4 xt_addrtype iptable_filter ip_tables xt_conntrack 
x_tables nf_nat nf_conntrack br_netfilter bridge stp llc isofs ppdev input_leds 
serio_raw parport_pc parport autofs4 btrfs xor raid6_pq crct10dif_pclmul 
crc32_pclmul ghash_clmulni_intel aesni_intel aes_x86_64 lrw gf128mul 
glue_helper ablk_helper cryptd psmouse floppy
  [30558.784056] CPU: 0 PID: 1 Comm: systemd Not tainted 4.4.0-83-generic 
#106-Ubuntu
  [30558.784056] Hardware name: Xen HVM domU, BIOS 4.2.amazon 02/16/2017
  [30558.784056] task: 88007c4e8000 ti: 88007c4e4000 task.ti: 
88007c4e4000
  [30558.784056] RIP: 0010:[]  [] 
cgroup_destroy_locked+0x5f/0xf0
  [30558.784056] RSP: 0018:88007c4e7e40  EFLAGS: 00010212
  [30558.784056] RAX: 8800114481bd RBX: 88002827ba50 RCX: 
88007ab8d150
  [30558.784056] RDX: 00111e7e0088 RSI: 88002827ba54 RDI: 
8217745c
  [30558.784056] RBP: 88007c4e7e60 R08: 0020 R09: 
88007c4e7e70
  [30558.784056] R10: 0637760b R11: 880011829a80 R12: 
88007ab8d000
  [30558.784056] R13:  R14: 559b48b2dcc0 R15: 
ff9c
  [30558.784056] FS:  7f29cf0db8c0() GS:88007d20() 
knlGS:
  [30558.784056] CS:  0010 DS:  ES:  CR0: 80050033
  [30558.784056] CR2: 7fa466d58180 CR3: 7c19 CR4: 
001406f0
  [30558.784056] Stack:
  [30558.784056]  88002827ba50 88002827ba50 8800373e70d0 
559b48b2dcc0
  [30558.784056]  88007c4e7e80 811194b3 88002827ba50 

  [30558.784056]  88007c4e7ea0 8128ddcd 880011829a80 

  [30558.784056] Call Trace:
  [30558.784056]  [] cgroup_rmdir+0x23/0x40
  [30558.784056]  [] kernfs_iop_rmdir+0x4d/0x80
  [30558.784056]  [] vfs_rmdir+0xb4/0x130
  [30558.784056]  [] do_rmdir+0x1df/0x200
  [30558.784056]  [] SyS_rmdir+0x16/0x20
  [30558.784056]  [] entry_SYSCALL_64_fastpath+0x16/0x71
  [30558.784056] Code: 74 fd 48 c7 c7 5c 74 17 82 e8 8e 72 72 00 49 8b 94 24 50 
01 00 00 49 8d 8c 24 50 01 00 00 48 39 d1 48 8d 42 f0 74 18 48 8b 50 08  82 
b0 01 00 00 01 48 8b 50 10 48 39 d1 48 8d 42 f0 75 e8 49 
  [30558.784056] RIP  [] cgroup_destroy_locked+0x5f/0xf0
  [30558.784056]  RSP 
  [30558.960828] ---[ end trace 7634e03ff94e8934 ]---
  [30558.964811] Kernel panic - not syncing: Fatal exception in interrupt
  [30558.968805] Kernel Offset: disabled

  ProblemType: Bug
  DistroRelease: Ubuntu 16.04
  Package: linux-image-4.4.0-83-generic 4.4.0-83.106
  ProcVersionSignature: Ubuntu 4.4.0-83.106-generic 4.4.70
  Uname: Linux 4.4.0-83-generic x86_64
  AlsaDevices:
   total 0
   crw-rw 1 root audio 116,  1 Jul  6 10:22 seq
   crw-rw 1 root audio 116, 33 Jul  6 10:22 timer
  AplayDevices: Error: [Errno 2] No such file or directory: 'aplay'
  ApportVersion: 2.20.1-0ubuntu2.6
  Architecture: amd64
  ArecordDevices: Error: [Errno 2] No such file or directory: 'arecord'
  AudioDevicesInUse: Error: command ['fuser', '-v', '/dev/snd/seq', 
'/dev/snd/timer'] failed with exit code 1:
  CRDA: N/A
  Date: Thu Jul  6 10:27:37 2017
  Ec2AMI: ami-34100353
  Ec2AMIManifest: (unknown)
  Ec2AvailabilityZone: ap-northeast-1c
  Ec2InstanceType: t2.small
  Ec2Kernel: unavailable
  Ec2Ramdisk: unavailable
  IwConfig: Error: [Errno 2] No such file or directory: 'iwconfig'
  Lsusb: Error: command ['lsusb'] failed with exit code 1:
  MachineType: Xen HVM domU
  PciMultimedia:
   
  ProcEnviron:
   SHELL=/bin/bash
   TERM=screen-256color
   PATH=(custom, no user)
   LANG=en_US.UTF-8
  ProcFB:
   
  ProcKernelCmdLine: