[Bug 1497428] Re: kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

2019-07-24 Thread Brad Figg
** Tags added: cscc

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1497428

Title:
  kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1497428/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

[Bug 1497428] Re: kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

2019-06-11 Thread Dan Streetman
** Changed in: linux (Ubuntu Trusty)
   Status: Triaged => Won't Fix

** Changed in: linux (Ubuntu)
   Status: Triaged => Fix Released

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1497428

Title:
  kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1497428/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

[Bug 1497428] Re: kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

2017-10-13 Thread Dan Streetman
** Changed in: linux (Ubuntu Trusty)
 Assignee: Dan Streetman (ddstreet) => (unassigned)

** Changed in: linux (Ubuntu)
 Assignee: Dan Streetman (ddstreet) => (unassigned)

** Changed in: linux (Ubuntu Trusty)
   Status: In Progress => Triaged

** Changed in: linux (Ubuntu)
   Status: In Progress => Triaged

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1497428

Title:
  kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1497428/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

[Bug 1497428] Re: kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

2017-03-13 Thread Craig Watcham
m4.10xl also looks good:

[7399124.202570] lp1497428: module verification failed: signature and/or  
required key missing - tainting kernel
[7399124.223762] pageblock_nr_pages 0x200
[7399124.226007] node 0 zone 0 info:
[7399124.227849] node 0 zone 0 provided page pfn 0xfff valid 1 present 1
[7399124.231173] node 0 zone 0 start pfn 0xe00 valid 1 present 1
[7399124.234120] node 0 zone 0 end pfn 0xfff valid 1 present 1
[7399124.237107] node 0 zone 0 page ea03ffc0 start_page 
ea038000 end_page ea03ffc0
[7399124.241777] node 0 zone 0 spans: provided pfn 1 start pfn 1 end pfn 1
[7399124.245160] node 0 zone 0 start pfn 0x1 spanned pages 0xfff end pfn 0x1000
[7399124.248725] node 0 zone 0 present pages 0xf9d managed pages 0xf88
[7399124.251852] node 0 start pfn 0x1 end pfn 0x2818000
[7399124.254459] node 0 normal pageblock multiple
[7399124.256792] node 0 zone 1 info:
[7399124.258611] node 0 zone 1 provided page pfn 0xf valid 0 present 0
[7399124.261926] node 0 zone 1 start pfn 0xffe00 valid 0 present 0
[7399124.264980] node 0 zone 1 end pfn 0xf valid 0 present 0
[7399124.267859] node 0 zone 1 page ea0003c0 start_page 
ea0003ff8000 end_page ea0003c0
[7399124.272375] node 0 zone 1 spans: provided pfn 1 start pfn 1 end pfn 1
[7399124.275627] node 0 zone 1 start pfn 0x1000 spanned pages 0xff000 end pfn 
0x10
[7399124.279624] node 0 zone 1 present pages 0xef000 managed pages 0xea2d3
[7399124.283191] node 0 start pfn 0x1 end pfn 0x2818000
[7399124.286066] node 0 normal pageblock multiple
[7399124.288556] node 0 zone 2 info:
[7399124.290734] node 0 zone 2 provided page pfn 0x2817fff valid 1 present 1
[7399124.294354] node 0 zone 2 start pfn 0x2817e00 valid 1 present 1
[7399124.297599] node 0 zone 2 end pfn 0x2817fff valid 1 present 1
[7399124.300895] node 0 zone 2 page ea00a05fffc0 start_page 
ea00a05f8000 end_page ea00a05fffc0
[7399124.305733] node 0 zone 2 spans: provided pfn 1 start pfn 1 end pfn 1
[7399124.309124] node 0 zone 2 start pfn 0x10 spanned pages 0x2718000 end 
pfn 0x2818000
[7399124.313552] node 0 zone 2 present pages 0x131 managed pages 0x12bf8e5
[7399124.317134] node 0 start pfn 0x1 end pfn 0x2818000
[7399124.319881] node 0 normal pageblock multiple
[7399124.322452] node 1 zone 2 info:
[7399124.324461] node 1 zone 2 provided page pfn 0x280 valid 1 present 1
[7399124.328042] node 1 zone 2 start pfn 0x280fe00 valid 1 present 1
[7399124.331418] node 1 zone 2 end pfn 0x280 valid 1 present 1
[7399124.334599] node 1 zone 2 page ea00a03fffc0 start_page 
ea00a03f8000 end_page ea00a03fffc0
[7399124.339371] node 1 zone 2 spans: provided pfn 1 start pfn 1 end pfn 1
[7399124.342829] node 1 zone 2 start pfn 0x141 spanned pages 0x140 end 
pfn 0x281
[7399124.347266] node 1 zone 2 present pages 0x140 managed pages 0x13af8ab
[7399124.350781] node 1 start pfn 0x141 end pfn 0x281
[7399124.353662] node 1 normal pageblock multiple

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1497428

Title:
  kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1497428/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1497428] Re: kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

2017-03-13 Thread Craig Watcham
Looks like this has been corrected, from a recent c4.8xl launch:
--
[0.00] Linux version 3.13.0-71-generic (buildd@lgw01-09) (gcc version 
4.8.2 (Ubuntu 4.8.2-19ubuntu1) ) #114-Ubuntu SMP Tue Dec 1 02:34:22 UTC 2015 
(Ubuntu 3.13.0-71.114-generic 3.13.11-ckt29)
[0.00] Command line: BOOT_IMAGE=/boot/vmlinuz-3.13.0-71-generic 
root=UUID=0fbd65e4-a082-4f5a-8392-49add7329657 ro console=tty1 console=ttyS0
[0.00] KERNEL supported cpus:
[0.00]   Intel GenuineIntel
[0.00]   AMD AuthenticAMD
[0.00]   Centaur CentaurHauls
[0.00] e820: BIOS-provided physical RAM map:
[0.00] BIOS-e820: [mem 0x-0x0009dfff] usable
[0.00] BIOS-e820: [mem 0x0009e000-0x0009] reserved
[0.00] BIOS-e820: [mem 0x000e-0x000f] reserved
[0.00] BIOS-e820: [mem 0x0010-0xefff] usable
[0.00] BIOS-e820: [mem 0xfc00-0x] reserved
[0.00] BIOS-e820: [mem 0x0001-0x000f0fff] usable
--

Debug module output (for reference);
--
[6409565.542433] lp1497428: module verification failed: signature and/or  
required key missing - tainting kernel
[6409565.565990] pageblock_nr_pages 0x200
[6409565.568026] node 0 zone 0 info:
[6409565.569761] node 0 zone 0 provided page pfn 0xfff valid 1 present 1
[6409565.572417] node 0 zone 0 start pfn 0xe00 valid 1 present 1
[6409565.574936] node 0 zone 0 end pfn 0xfff valid 1 present 1
[6409565.577453] node 0 zone 0 page ea03ffc0 start_page 
ea038000 end_page ea03ffc0
[6409565.581422] node 0 zone 0 spans: provided pfn 1 start pfn 1 end pfn 1
[6409565.584305] node 0 zone 0 start pfn 0x1 spanned pages 0xfff end pfn 0x1000
[6409565.587379] node 0 zone 0 present pages 0xf9d managed pages 0xf88
[6409565.590125] node 0 start pfn 0x1 end pfn 0xf18000
[6409565.592238] node 0 normal pageblock multiple
[6409565.594218] node 0 zone 1 info:
[6409565.595756] node 0 zone 1 provided page pfn 0xf valid 0 present 0
[6409565.598642] node 0 zone 1 start pfn 0xffe00 valid 0 present 0
[6409565.601218] node 0 zone 1 end pfn 0xf valid 0 present 0
[6409565.603587] node 0 zone 1 page ea0003c0 start_page 
ea0003ff8000 end_page ea0003c0
[6409565.607428] node 0 zone 1 spans: provided pfn 1 start pfn 1 end pfn 1
[6409565.610296] node 0 zone 1 start pfn 0x1000 spanned pages 0xff000 end pfn 
0x10
[6409565.613730] node 0 zone 1 present pages 0xef000 managed pages 0xea2d3
[6409565.616697] node 0 start pfn 0x1 end pfn 0xf18000
[6409565.619146] node 0 normal pageblock multiple
[6409565.621321] node 0 zone 2 info:
[6409565.623000] node 0 zone 2 provided page pfn 0xf17fff valid 1 present 1
[6409565.625955] node 0 zone 2 start pfn 0xf17e00 valid 1 present 1
[6409565.628808] node 0 zone 2 end pfn 0xf17fff valid 1 present 1
[6409565.631467] node 0 zone 2 page ea003c5fffc0 start_page 
ea003c5f8000 end_page ea003c5fffc0
[6409565.635675] node 0 zone 2 spans: provided pfn 1 start pfn 1 end pfn 1
[6409565.638791] node 0 zone 2 start pfn 0x10 spanned pages 0xe18000 end 
pfn 0xf18000
[6409565.642571] node 0 zone 2 present pages 0x679100 managed pages 0x65aded
[6409565.645714] node 0 start pfn 0x1 end pfn 0xf18000
[6409565.648024] node 0 normal pageblock multiple
[6409565.650256] node 1 zone 2 info:
[6409565.651940] node 1 zone 2 provided page pfn 0xf0 valid 1 present 1
[6409565.654851] node 1 zone 2 start pfn 0xf0fe00 valid 1 present 1
[6409565.657571] node 1 zone 2 end pfn 0xf0 valid 1 present 1
[6409565.660316] node 1 zone 2 page ea003c3fffc0 start_page 
ea003c3f8000 end_page ea003c3fffc0
[6409565.664331] node 1 zone 2 spans: provided pfn 1 start pfn 1 end pfn 1
[6409565.667235] node 1 zone 2 start pfn 0x779100 spanned pages 0x796f00 end 
pfn 0xf1
[6409565.671038] node 1 zone 2 present pages 0x796f00 managed pages 0x7783b2
[6409565.673982] node 1 start pfn 0x779100 end pfn 0xf1
[6409565.676321] node 1 normal pageblock multiple
--

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1497428

Title:
  kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1497428/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1497428] Re: kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

2016-09-09 Thread Dan Streetman
That is very definitely not the same bug as this bug.

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1497428

Title:
  kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1497428/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1497428] Re: kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

2016-08-23 Thread Matt W
I can't be sure that we ran into the exact same bug, but Amazon seems to
think we may have. I can't find the beginning of the console log, but
here's a mid-point that shows the hang:

Host Type: Amazon EC2 r3.8xlarge
OS: Ubuntu 14.04.5
Kernel: 3.13.0-93-generic
Networking: Intel Enhanced Neworking driver 2.16.4 (ixgbevf)

Workload: Postgres running with most of the systems memory, but Apache
Flume was going a bit haywire at the time taking ~20-30% of the
available CPU (using Oracle Java 7).

[27484.664087] Code: cc cc cc b8 1c 00 00 00 0f 01 c1 c3 cc cc cc cc cc cc cc 
cc cc cc cc cc cc cc cc cc cc cc cc cc cc cc cc b8 1d 00 00 00 0f 01 c1  cc 
cc cc cc cc cc cc cc cc cc cc cc cc cc cc cc cc cc cc cc 
[27504.324077] BUG: soft lockup - CPU#2 stuck for 22s! [java:62266]
[27504.324077] Modules linked in: ipt_REJECT xt_multiport nf_conntrack_ipv4 
nf_defrag_ipv4 xt_comment xt_conntrack nf_conntrack ip6table_filter ip6_tables 
iptable_filter ip_tables x_tables bcache dm_crypt syscopyarea[27504.344088] 
BUG: soft lockup - CPU#3 stuck for 22s! [java:62269]
[27504.344088] Modules linked in: ipt_REJECT xt_multiport nf_conntrack_ipv4 
nf_defrag_ipv4 xt_comment xt_conntrack nf_conntrack ip6table_filter ip6_tables 
iptable_filter ip_tables x_tables bcache dm_crypt syscopyarea sysfillrect 
sysimgblt fb_sys_fops serio_raw isofs raid10 raid456 async_memcpy 
async_raid6_recov async_pq async_xor async_tx xor raid6_pq raid1 raid0 
multipath linear crct10dif_pclmul crc32_pclmul aesni_intel aes_x86_64 lrw 
gf128mul glue_helper ablk_helper cryptd psmouse floppy ixgbevf(OX)
[27504.344088] CPU: 3 PID: 62269 Comm: java Tainted: G  DOX 
3.13.0-93-generic #140-Ubuntu
[27504.344088] Hardware name: Xen HVM domU, BIOS 4.2.amazon 05/12/2016
[27504.344088] task: 883c70a31800 ti: 8837955e task.ti: 
8837955e
[27504.344088] RIP: 0010:[]  [] 
xen_hypercall_sched_op+0x8/0x20
[27504.344088] RSP: :8837955e1c60  EFLAGS: 0202
[27504.344088] RAX:  RBX: 8837955e1c40 RCX: fffa
[27504.344088] RDX:  RSI: 8837955e1c70 RDI: 0003
[27504.344088] RBP: 8837955e1c90 R08: 881e1980f800 R09: 881e19400470
[27504.344088] R10: 0019 R11: 801161833966 R12: 1000
[27504.344088] R13: 883c70778340 R14:  R15: 
[27504.344088] FS:  7f0b9c7d7700() GS:881e19c6() 
knlGS:
[27504.344088] CS:  0010 DS:  ES:  CR0: 80050033
[27504.344088] CR2: 00070809b000 CR3: 0013bdca CR4: 001406e0
[27504.344088] Stack:
[27504.344088]  81438b2e 003b810c1ec7 8837955e1c6c 
0001
[27504.344088]   881e19c6afe0 8837955e1ca0 
8143aab0
[27504.344088]  8837955e1ce8 81011fa3 0213 
3eba955e1d40
[27504.344088] Call Trace:
[27504.344088]  [] ? xen_poll_irq_timeout+0x3e/0x50
[27504.344088]  [] xen_poll_irq+0x10/0x20
[27504.344088]  [] xen_lock_spinning+0xa3/0x100
[27504.344088]  [] 
__raw_callee_save_xen_lock_spinning+0x11/0x20
[27504.344088]  [] ? _raw_spin_lock+0x48/0x50
[27504.344088]  [] do_numa_page+0x5a/0x1b0
[27504.344088]  [] handle_mm_fault+0x5ff/0xf00
[27504.344088]  [] ? filldir+0x88/0x100
[27504.344088]  [] __do_page_fault+0x184/0x560
[27504.344088]  [] ? iterate_dir+0x7c/0xe0
[27504.344088]  [] do_page_fault+0x1a/0x70
[27504.344088]  [] page_fault+0x28/0x30
```

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1497428

Title:
  kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1497428/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1497428] Re: kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

2016-02-02 Thread Joseph Salisbury
** Tags removed: kernel-key
** Tags added: kernel-da-key

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1497428

Title:
  kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1497428/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1497428] Re: kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

2016-01-27 Thread Dan Streetman
Matt,

I think it's fine that upstream has demoted the BUG_ON, as I haven't
heard anyone report this with a kernel later than 3.13; I assume
whatever is causing it is fixed in later kernels.

At this point there's not much more I can do, as I can't reproduce it
and don't have much debug info on exactly what/why the zone's pfn range
becomes incorrect.  If you or anyone has any info on how to reproduce
this, or you have a system where it's reproducable, please let me know.

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1497428

Title:
  kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1497428/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1497428] Re: kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

2016-01-11 Thread Matt Wilson
Dan,

This BUG_ON has been demoted to only trigger when DEBUG_VM is set in
upstream:

http://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git/commit/?id=97ee4ba7cbd30f1858f0d16911e042737c53f2ef

I'm looking into why there's a one page difference between the E820
tables and SRAT. You're right that there seems to be an off-by-one in
one or the other.

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1497428

Title:
  kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1497428/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1497428] Re: kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

2015-12-18 Thread Dan Streetman
kernel module to add debug for this mm BUG().  This module is for kernel
3.13.0-71-generic only.

** Attachment added: "lp1497428.ko"
   
https://bugs.launchpad.net/ubuntu/trusty/+source/linux/+bug/1497428/+attachment/4537000/+files/lp1497428.ko

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1497428

Title:
  kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1497428/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1497428] Re: kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

2015-12-18 Thread Dan Streetman
Can anyone seeing this problem, if you're on the 3.13.0-71-generic
kernel, please load the above attached module?  It will initially check
the node/zone start/end locations for validity, and also will check
every time move_freepages is called, and if it detects the BUG() will be
hit it prints out debug about the current node/zone values - but it
doesn't prevent the BUG() so you'll know when the problem reproduces.

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1497428

Title:
  kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1497428/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1497428] Re: kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

2015-12-18 Thread Dan Streetman
BTW, I've only seen this situation - with a node end pfn not on a
pageblock boundary - happen with the AWS flavors "c4.8xlarge" and
"m4.10xlarge".  If anyone else sees this bug anywhere besides those
Amazon AWS instances, please let me know.

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1497428

Title:
  kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1497428/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1497428] Re: kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

2015-12-10 Thread Dan Streetman
>  I won't pretend to know how numactl interleaves the memory across the nodes,
> but I can't help but think high memory usage on these nodes combined with
> forced interleaving might be why we hit this issue?

The numactl interleaving just causes memory to be allocated from all
nodes on a round-robin basis, I don't think that would cause this, other
than mongod simply using a whole lot of memory.

> After weeks of stress testing with your custom kernel, I have yet to
hit this issue again

so, the custom kernel actually bypasses the BUG() call, and logs debug
instead - have you checked your logs to see if there are any relevant
messages?  You would see output like:

page_zone(start_page) !=page_zone(end_page)

and more debug following it; you can search/grep the logs for
"move_freepages".

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1497428

Title:
  kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1497428/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1497428] Re: kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

2015-12-10 Thread dave.muysson
Dan,

  Not sure if this will help or not, but of the 8+ servers we have using
the r3.large instance type, the only two that have encountered the issue
were running MongoDB on them, launched using the numactl tool with the
--interleave=all option set.

Here's the exact launch command used:

exec start-stop-daemon --start --quiet --chuid mongodb --make-pidfile
--pidfile /var/run/mongodb.pid --exec /usr/bin/numactl --
--interleave=all  /usr/bin/mongod --config /etc/mongodb.conf

  I won't pretend to know how numactl interleaves the memory across the
nodes, but I can't help but think high memory usage on these nodes
combined with forced interleaving might be why we hit this issue?

  After weeks of stress testing with your custom kernel, I have yet to
hit this issue again. The synthetic environment I'm using probably isn't
enough to hit this bug. Hopefully your testing with the c4.8xLarge is
more helpful.

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1497428

Title:
  kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1497428/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1497428] Re: kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

2015-12-09 Thread Dan Streetman
i booted a c4.8xlarge flavor AWS instance and got the same memory/numa
layout as comment 16.  To clarify though, the /proc/iomem output isn't
representative of the actual memory layout; specifically it is:

[0.00] e820: BIOS-provided physical RAM map:
[0.00] BIOS-e820: [mem 0x-0x0009dfff] usable
[0.00] BIOS-e820: [mem 0x0009e000-0x0009] reserved
[0.00] BIOS-e820: [mem 0x000e-0x000f] reserved
[0.00] BIOS-e820: [mem 0x0010-0xefff] usable
[0.00] BIOS-e820: [mem 0xfc00-0x] reserved
[0.00] BIOS-e820: [mem 0x0001-0x000f0fffefff] usable

and the SRAT divides it into 2 nodes as:

[929310.710905] SRAT: Node 0 PXM 0 [mem 0x-0xefff]
[929310.710906] SRAT: Node 0 PXM 0 [mem 0x1-0x778ef]
[929310.710907] SRAT: Node 1 PXM 1 [mem 0x778f0-0xf0fff]

so the node ranges are set up as:

[929310.854161] On node 0 totalpages: 7769757
[929310.854162]   DMA zone: 64 pages used for memmap
[929310.854162]   DMA zone: 21 pages reserved
[929310.854163]   DMA zone: 3997 pages, LIFO batch:0
[929310.854196] mminit::memmap_init Initialising map node 0 zone 0 pfns 1 -> 
4096
[929310.854265]   DMA32 zone: 15296 pages used for memmap
[929310.854266]   DMA32 zone: 978944 pages, LIFO batch:31
[929310.854299] mminit::memmap_init Initialising map node 0 zone 1 pfns 4096 -> 
1048576
[929310.869608]   Normal zone: 106044 pages used for memmap
[929310.869611]   Normal zone: 6786816 pages, LIFO batch:31
[929310.869647] mminit::memmap_init Initialising map node 0 zone 2 pfns 1048576 
-> 7835392
[929310.975013] On node 1 totalpages: 7958783
[929310.975018]   Normal zone: 124356 pages used for memmap
[929310.975019]   Normal zone: 7958783 pages, LIFO batch:31
[929310.975055] mminit::memmap_init Initialising map node 1 zone 2 pfns 7835392 
-> 15794175

node 0 DMA and DMA32 ranges are normal, ending at 0x1000 and 0x10,
respectively.  The Normal zone for node 0 ends at 0x778f00, and the
Normal zone for node 1 ends at 0xf0.  Since PAGE_SHIFT is 12 and
pageblock_order (with this system config) is (21 - 12 = 9):

node 0 Normal zone ends on a pageblock boundary, while node 1 Normal
zone ends 1 page short of a pageblock boundary.

Preliminary note: the SRAT table seems to be incorrect; it spans node 1
all the way to 0xf0fff, but e820 memory, and the node 1 Normal zone,
only reach 0xf0fffefff.

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1497428

Title:
  kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1497428/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1497428] Re: kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

2015-12-07 Thread Chris J Arges
** Tags added: kernel-key

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1497428

Title:
  kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1497428/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1497428] Re: kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

2015-12-07 Thread Joseph Salisbury
** Changed in: linux (Ubuntu)
   Importance: Low => High

** Changed in: linux (Ubuntu Trusty)
   Importance: Undecided => High

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1497428

Title:
  kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1497428/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1497428] Re: kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

2015-12-04 Thread Dan Streetman
To clarify the bug, a bit of background is needed first (specific
numbers apply only to this situation).

The kernel refers to all pages under a single PMD (midlevel page table)
as a "pageblock".  It's the same size as a hugepage, 2M.  In the
function triggering the BUG(), it's expecting that the start and end
pages are inside the same zone, but that isn't the case so the BUG() is
triggered.  One function up, move_freepages_block(), is where the start
and end PFNs are set; the function takes one page and calculates the
start and end PFNs (which are aligned) that contain the provided page.
It then verifies that both PFNs are inside the original page's zone, and
passes the start/end pages to move_freepages().

The problem is that the zone's PFN range is wrong.  In this particular
case, the zone's memory ends in the middle of a pageblock, which is
unusual.  So when move_freepages_block() checks if the end PFN of the
pageblock is inside the zone (i.e. < zone end PFN), it *should* fail,
and cause the function to return.  However, it doesn't fail, meaning the
zone's end PFN is wrong, and when move_freepages() checks the
page_zone() of the start and end pages, they don't match - because the
end page isn't valid - and the BUG() is triggered.

In my testing, if I manually limit memory to end in the middle of a
pageblock, the zone's end PFN is correctly set, so it seems that
something is changing the zone PFN range (specifically the zone's
spanned_pages value) at runtime - or, the particular environment for
this bug is different that my test setup and getting the zone end PFN
wrong somehow.  I'm going to create a debug module that will jprobe
these functions to check for this condition, and then print debug output
and avoid the BUG().


As a workaround for this, if the amount of memory is set so that it ends at a 
multiple of the pageblock size (512 4k pages == 2M), this bug should not 
happen.  On x86, the boot mem= param sets the maximum address, which should 
allow changing the zone's end pfn to be aligned with pageblock; e.g. if the 
dmesg e820 output lists the last line of the memory ranges as:

[0.00] BIOS-e820: [mem 0x0001-0x0003e08f]
usable

then the last valid PFN is 0x3e08f, so the zone end pfn (1 more than
last valid pfn) is 0x3e090, which isn't a multiple of the pageblock
size (2M):

$ echo $[ 0x3e090 % (2 * 1024 * 1024) ]
1048576

In this example case, restricting the last 1M of memory by setting
mem=0x3e080 should work around this bug - although since I can't
reproduce it yet, I've no way to verify the workaround; and it may
simply cause the bug to appear at a different location.

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1497428

Title:
  kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1497428/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1497428] Re: kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

2015-12-04 Thread Dan Streetman
The newer kernel may have some change/fix that prevents this bug, as I
haven't seen any reports of this (from google, at least) on any other
kernel.  Plus, the unusual requirement of the memory having to end at
not a multiple of 2M.

> But the “Node 0 Normal” zone, judging from (start_pfn, start_pfn+spanned) 
> spans 1-f1800. 
> The machine in question is a c4.8xlarge in EC2.

Awesome, I'll set up a vm with that flavor to see if I can reproduce
this, or at least reproduce the problematic zone setup.  Thanks!

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1497428

Title:
  kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1497428/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

[Bug 1497428] Re: kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

2015-12-04 Thread Nelson Elhage
** Attachment added: "/proc/zoneinfo from the same machine"
   
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1497428/+attachment/4529727/+files/zoneinfo.txt

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1497428

Title:
  kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1497428/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1497428] Re: kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

2015-12-04 Thread Nelson Elhage
Hi @ddstreet, thanks for the update.

We unfortunately weren't able to reproduce this on your test kernel, and
have since moved to a newer kernel version for other reasons.

However, I can confirm that on the affected machine types, and only the
affected machine type, we see a memory range in `/proc/iomem` that ends
off of a multiple of 2M. Again, we do not see this on any other
machines.

I've attached `/proc/iomem` and `/proc/zoneinfo` from an affected machine 
(currently running an LTS backport kernel:
Linux [redacted] 3.19.0-33-generic #38~14.04.1-Ubuntu SMP Fri Nov 6 18:17:28 
UTC 2015 x86_64 x86_64 x86_64 GNU/Linux
)

I'm pretty sure they show the kind of mismatch you're talking about;
iomem shows

1-f0fffefff : System RAM
f0000-f0fff : RAM buffer
f1000-f17ff : System RAM


But the “Node 0 Normal” zone, judging from (start_pfn, start_pfn+spanned) spans 
1-f1800. The machine in question is a c4.8xlarge in EC2.
 


** Attachment added: "iomem.txt"
   
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1497428/+attachment/4529726/+files/iomem.txt

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1497428

Title:
  kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1497428/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

[Bug 1497428] Re: kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

2015-12-01 Thread Dan Streetman
For reference, here's a pasted sample of the Oops (taken from Diego's
log above):

[415478.493013] [ cut here ]
[415478.496056] kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968!
[415478.496056] invalid opcode:  [#1] SMP 
[415478.496056] Modules linked in: dm_crypt syscopyarea sysfillrect sysimgblt 
fb_sys_fops crct10dif_pclmul crc32_pclmul serio_raw ghash_clmulni_intel isofs 
aesni_intel aes_x86_64 glue_helper lrw gf128mul ablk_helper cryptd floppy 
psmouse ixgbevf
[415478.496056] CPU: 1 PID: 11213 Comm: htop Not tainted 3.13.0-48-generic 
#80-Ubuntu
[415478.496056] Hardware name: Xen HVM domU, BIOS 4.2.amazon 05/06/2015
[415478.496056] task: 880037758000 ti: 8803c9dbe000 task.ti: 
8803c9dbe000
[415478.496056] RIP: 0010:[]  [] 
move_freepages+0x104/0x110
[415478.496056] RSP: 0018:8803c9dbfbd0  EFLAGS: 00010006
[415478.496056] RAX: 8803e08fb000 RBX:  RCX: 
0001
[415478.496056] RDX: ea000f827fc0 RSI: ea000f82 RDI: 
8803e08fbf00
[415478.496056] RBP: 8803c9dbfbd8 R08: 8803e08fbf00 R09: 

[415478.496056] R10:  R11: ea000f820920 R12: 
ea000f820900
[415478.496056] R13: 0001 R14:  R15: 
0014
[415478.496056] FS:  7f18e59b2740() GS:8803e042() 
knlGS:
[415478.496056] CS:  0010 DS:  ES:  CR0: 80050033
[415478.496056] CR2: 7f18e59bb000 CR3: 0001456aa000 CR4: 
001406e0
[415478.496056] Stack:
[415478.496056]  81154793 8803c9dbfc50 8115620b 
8803a9b7f400
[415478.496056]  8801576ba128 8803e08fbff0 0001 
ea000f820920
[415478.496056]  8803e08fbf00 00020001  
0011
[415478.496056] Call Trace:
[415478.496056]  [] ? move_freepages_block+0x73/0x80

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1497428

Title:
  kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1497428/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1497428] Re: kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

2015-12-01 Thread Dan Streetman
Diego, thanks, although the log doesn't provide any new info, and it's
doubtful this is related to hugepages.

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1497428

Title:
  kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1497428/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1497428] Re: kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

2015-11-30 Thread Diego Andres
Also, here's some system config that might have any influence on the crash (in 
particular Transparent Huge Page):
(cannot attach more than one file):

/etc/rc.local:

echo never > /sys/kernel/mm/transparent_hugepage/enabled
exit 0


** Attachment added: "sysctl.conf"
   
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1497428/+attachment/4527462/+files/sysctl.conf

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1497428

Title:
  kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1497428/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1497428] Re: kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

2015-11-30 Thread Diego Andres
Hi, I recently had the same issue in a AWS EC2 r3.large instance.
In attackment you can find the system log. Hope that helps!

** Attachment added: "AWS System Log"
   
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1497428/+attachment/4527455/+files/kernel-crash.txt

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1497428

Title:
  kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1497428/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1497428] Re: kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

2015-11-06 Thread Nelson Elhage
Hey,

We're also seeing this issue on a production system, and have been
around 1/week for a while now. We may be able to boot that test kernel
for experimentation purposes – would that still be useful?

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1497428

Title:
  kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1497428/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

[Bug 1497428] Re: kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

2015-11-06 Thread Dan Streetman
Yep it would definitely be useful to see a repro with the test/debug
kernel, thanks!

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1497428

Title:
  kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1497428/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


Re: [Bug 1497428] Re: kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

2015-10-13 Thread dave.muysson
Dan,

  I haven’t tried to directly reproduce the bug, but I have a few ideas.
If I can free up some time I’ll see if I can reproduce it.


Dave Muysson | Cloud Architect
dave.muys...@360pi.com  |​ (613) 562-2525 x 510 
 |​ 360pi.com 


> On Oct 13, 2015, at 9:20 AM, Dan Streetman  
> wrote:
> 
> Hi Dave,
> 
> are you able to reproduce the bug?  The trace by itself isn't terribly 
> helpful, all it really says is the pageblock spans zones, which means 
> move_freepages_block() logic for detecting that failed for some reason.  I 
> have a debug kernel ppa here:
> pad.lv/ppa/ddstreet/lp1497428
> 
> that includes additional debug if the problem happens (it also should
> prevent the BUG()).  If you can use that kernel to trigger this and send
> the resulting debug output it would help very much :-)
> 
> when the problem reproduces, in the system log you should see:
> page_zone(start_page) !=page_zone(end_page)
> 
> and more debug following that.  It should not trigger BUG() though, so
> you may need to check the logs periodically.
> 
> Thanks!
> 
> -- 
> You received this bug notification because you are subscribed to the bug
> report.
> https://bugs.launchpad.net/bugs/1497428
> 
> Title:
>  kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968
> 
> Status in linux package in Ubuntu:
>  In Progress
> Status in linux source package in Trusty:
>  In Progress
> 
> Bug description:
>  The kernel triggers a BUG when it finds it is in move_freepages() but
>  the start and end pfns for the move are in different zones.
> 
> To manage notifications about this bug go to:
> https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1497428/+subscriptions

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1497428

Title:
  kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1497428/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

[Bug 1497428] Re: kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

2015-10-13 Thread Dan Streetman
Hi Dave,

are you able to reproduce the bug?  The trace by itself isn't terribly helpful, 
all it really says is the pageblock spans zones, which means 
move_freepages_block() logic for detecting that failed for some reason.  I have 
a debug kernel ppa here:
pad.lv/ppa/ddstreet/lp1497428

that includes additional debug if the problem happens (it also should
prevent the BUG()).  If you can use that kernel to trigger this and send
the resulting debug output it would help very much :-)

when the problem reproduces, in the system log you should see:
page_zone(start_page) !=page_zone(end_page)

and more debug following that.  It should not trigger BUG() though, so
you may need to check the logs periodically.

Thanks!

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1497428

Title:
  kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1497428/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1497428] Re: kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

2015-10-08 Thread dave.muysson
Dan, I have run into this issue 4 times over the past few months, on two
separate servers running 3.13. I captured the kernel trace output of
each occurrence and can post them here if it would help. I have attached
the latest one, but there are 3 others I can provide as well.

Environment:
AWS EC2 Virtual Instance: r3.large
Ubuntu lts-trusty 3.13.0-53-generic (and) 3.13.0-45-generic.


** Attachment added: "ServerB-ubuntu-lts-trusty-3.13.0-53.txt"
   
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1497428/+attachment/4488905/+files/ServerB-ubuntu-lts-trusty-3.13.0-53.txt

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1497428

Title:
  kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1497428/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1497428] Re: kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

2015-09-22 Thread Christopher M. Penalver
Dan Steetman, ah, never heard of STS so my bad on zapping the tag. Would
it be possible to perform an apport-collect on a reference computer this
is reproducible with?

Otherwise, nobody can really contribute to this given the current level
of detail provided.

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1497428

Title:
  kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1497428/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1497428] Re: kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

2015-09-22 Thread Dan Streetman
No, I can't add any details just yet, I don't have direct access to the
failing system, but I'm working with the reporter to debug it.  This bug
is currently just a placeholder so I can provide a debug ppa,
pad.lv/ppa/ddstreet/lp1497428.  It's okay that nobody else can help
debug yet, because I'm debugging it :-)

When I have more details I can share, I will add them to the bug.  It's
quite possible this only requires a backport to trusty from vivid, but I
just don't know yet.

** Also affects: linux-lts-trusty (Ubuntu)
   Importance: Undecided
   Status: New

** No longer affects: linux-lts-trusty (Ubuntu)

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1497428

Title:
  kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1497428/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1497428] Re: kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

2015-09-22 Thread Louis Bouchard
** Also affects: linux (Ubuntu Trusty)
   Importance: Undecided
   Status: New

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1497428

Title:
  kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1497428/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1497428] Re: kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

2015-09-22 Thread Dan Streetman
** Changed in: linux (Ubuntu Trusty)
 Assignee: (unassigned) => Dan Streetman (ddstreet)

** Changed in: linux (Ubuntu Trusty)
   Status: New => In Progress

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1497428

Title:
  kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1497428/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1497428] Re: kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

2015-09-21 Thread Christopher M. Penalver
** Tags removed: sts
** Tags added: needs-apport-collect

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1497428

Title:
  kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1497428/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1497428] Re: kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

2015-09-21 Thread Dan Streetman
Chris, this bug is for a Canonical STS issue I'm debugging.  I'll add
more details as I get them.

** Tags removed: needs-apport-collect
** Tags added: sts

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1497428

Title:
  kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1497428/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1497428] Re: kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

2015-09-21 Thread Dan Streetman
** Changed in: linux (Ubuntu)
   Status: Incomplete => In Progress

** Changed in: linux (Ubuntu)
 Assignee: (unassigned) => Dan Streetman (ddstreet)

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1497428

Title:
  kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1497428/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1497428] Re: kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

2015-09-19 Thread Christopher M. Penalver
** Changed in: linux (Ubuntu)
   Importance: Undecided => Low

** Changed in: linux (Ubuntu)
 Assignee: Dan Streetman (ddstreet) => (unassigned)

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1497428

Title:
  kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1497428/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 1497428] Re: kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

2015-09-18 Thread Dan Streetman
** Tags added: trusty

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1497428

Title:
  kernel BUG at /build/buildd/linux-3.13.0/mm/page_alloc.c:968

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1497428/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs