** Tags added: cscc
--
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1568729
Title:
divide error: [#1] SMP in task_numa_migrate - handle_mm_fault
To manage notifications about this bug go to:
** Changed in: linux (Ubuntu)
Status: In Progress => Fix Released
--
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1568729
Title:
divide error: [#1] SMP in task_numa_migrate -
** Information type changed from Public to Public Security
--
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1568729
Title:
divide error: [#1] SMP in task_numa_migrate - handle_mm_fault
To manage
The new Xenail/16.04 kernel should now be in updates.
** Changed in: linux (Ubuntu Xenial)
Status: Fix Committed => Fix Released
--
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1568729
Title:
So jujud exercises the same code path as ceph-osd? curious
--
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1568729
Title:
divide error: [#1] SMP in task_numa_migrate - handle_mm_fault
To
Just ran into this issue on Ubuntu 16.04:
Aug 21 02:49:14 doddering-fransisca kernel: [2532635.918673] divide error:
[#1] SMP
Aug 21 02:49:14 doddering-fransisca kernel: [2532635.935386] Modules linked in:
bridge stp llc bonding nls_iso8859_1 ipmi_ssif ipmi_devintf dcdbas intel_rapl
The required change will be in the 4.4.0-36 (includes the 4.4.0-35
updates) which is currently in -proposed and waiting for verification
and regression testing to finish. If there are no problems found this is
supposed to get released by Aug-29th.
Had the same divide error and a hanging server myself. Ubuntu 16.04.1
LTS with "linux-image-4.4.0-31-generic". Now running "linux-
image-4.4.0-34-generic".
--
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
The -31 Kernel from Stefan's PPA with the patch has been running stable
for 2 weeks and the previous version based on -29 for 3 weeks. So if
that patch will make its way into Ubuntu kernel via the 4.4.16 stable
series update, the bug can be closed from my side at that point
--
You received this
The fixes mentioned are also part of the 4.4.16 stable series, which will land
in the next kernel cycle (August 29th ish). That should be 4.4.0-35 (or higher)
in Ubuntu kernel versions.
See- https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1607404
--
You received this bug notification
Just happened two days ago on Trusty 14.04.4 with Linux 4.4.0-31 on one
of Ceph Jewel OSD server. It ran fine for 8 days though and suddenly the
CPU load spiked to 600.
The server is from SuperMicro SuperStorage Server SSG-6048R-E1CR36L with these
following specs:
2x Intel Xeon E5-2630 v3 @
But that is not the -31 from the PPA. Could you please try the kernel
from the PPA that Stefan has posted?
--
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1568729
Title:
divide error: [#1] SMP
Same issue with 4.4.0-31-generic, stack trace:
[443830.036000] divide error: [#1] SMP
[443830.036583] Modules linked in: nf_conntrack_netlink xt_multiport xt_CT
xt_mac xt_physdev xt_set ip_set_hash_net ip_set nfnetlink vhost_net vhost
macvtap macvlan xt_REDIRECT nf_nat_redirect xt_mark
I have been running the -29 kernel from
http://people.canonical.com/~smb/lp1568729/ for about two weeks now, so
far without triggering the bug. With the kernel from Tim, the bug could
still be triggered. So it seems there were different patches.
I'll try the -33 kernel from ppa as well, to see if
Added a linux-lts-xenial variant to the PPA.
--
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1568729
Title:
divide error: [#1] SMP in task_numa_migrate - handle_mm_fault
To manage notifications
We had something similar-looking again, filed as Bug #1606098
--
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1568729
Title:
divide error: [#1] SMP in task_numa_migrate - handle_mm_fault
To
Stefan,
Can you add a trusty (Xenial HWE) version of this to the PPA? I'm trying a
rebuild of this as part of a Trusty PPA and getting a failure.
--
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1568729
** Tags added: patch
--
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1568729
Title:
divide error: [#1] SMP in task_numa_migrate - handle_mm_fault
To manage notifications about this bug go to:
For reference, this is the one patch picked from upstream stable queue
for 4.4.
** Patch added: "sched-fair-fix-cfs_rq-avg-tracking-underflow.patch"
The whole discussion seems to be going back and forth and is rather confusing.
On one side there is the ceph discussion where Stefan Priebe indeed mentions
that he has many more patches in his tree. On the other side there is the LKML
discussion which ends in GregKH getting exactly one patch
According to this message in the ceph thread those three patches are not
sufficient:
https://www.mail-archive.com/ceph-users@lists.ceph.com/msg30390.html
In a follow up Stefan Priebe mentions he has about 20 other patches
applied, and that must have contributed to having the problem solved on
A new set of test kernel packages can be found at:
http://people.canonical.com/~smb/lp1568729/ (those include backports of
the three patches mentioned above). If someone could check whether that
helps? Thanks.
** Changed in: linux (Ubuntu Xenial)
Importance: Undecided => High
** Changed in:
Not sure which patch(es) Tim had in the test kernel. Following the
various leads from the thread on the ceph mailing list from comment #9
it might be that the 3 patches to pick might be:
2b8c41d sched/fair: Initiate a new task's util avg to a bounded value
b7fa30c sched/fair: Fix
On a system running 4.4.0-28-generic I get something similar-looking:
foonode kernel: [595908.569972] divide error: [#1] SMP
foonode kernel: [595908.571257] Modules linked in: ip6table_raw ip6table_mangle
nf_conntrack_ipv6 xt_CT xt_connmark xt_mac xt_comment xt_physdev br_netfilter
xt_set
There is one person on the Ceph mailing list who thinks this is fixed in
4.7rc6 (http://thread.gmane.org/gmane.comp.file-
systems.ceph.user/30793/focus=30987). Unfortunately, I haven't been able
to figure out a precise patch set that can be applied to 4.4 to fix it.
--
You received this bug
dmesg -T
[Fri Jun 3 01:07:11 2016] divide error: [#1] SMP
[Fri Jun 3 01:07:11 2016] Modules linked in: iptable_nat nf_conntrack_ipv4
nf_defrag_ipv4 nf_nat_ipv4 nf_nat 8021q garp mrp binfmt_misc veth vhost_net
vhost macvtap macvlan ebtable_filter ebtables ip6table_filter ip6_tables
And again. This time with upstream kernel (linux-image-4.5.1-040501-generic):
[Fri Apr 15 13:26:56 2016] divide error: [#1] SMP
[Fri Apr 15 13:26:56 2016] Modules linked in: vhost_net vhost macvtap macvlan
ip6table_mangle nfnetlink_queue nfnetlink xt_CLASSIFY xt_CHECKSUM xt_nat
iptable_nat
Here another call trace:
[Thu Apr 14 13:53:29 2016] divide error: [#1] SMP
[Thu Apr 14 13:53:29 2016] Modules linked in: cpuid arc4 md4 nls_utf8 cifs
vhost_net vhost macvtap macvlan nfnetlink_queue nfnetlink xt_CHECKSUM xt_nat
iptable_nat nf_nat_ipv4 xt_NFQUEUE xt_CLASSIFY ip6table_mangle
Unfortunately, the issue is still present.
Apr 14 00:34:43 cnode17 kernel: [204922.475156] divide error: [#1] SMP
Apr 14 00:34:43 cnode17 kernel: [204922.475185] Modules linked in: cpuid arc4
md4 nls_utf8 cifs vhost_net vhost macvtap macvlan nfnetlink_queue nfnetlink
xt_CHECKSUM xt_nat
Thanks. Will test and report back in a few days.
--
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1568729
Title:
divide error: [#1] SMP in task_numa_migrate - handle_mm_fault
To manage
Please try the test kernel at http://people.canonical.com/~rtg/4.4.0
-fair-sched/
wget
http://people.canonical.com/~rtg/4.4.0-fair-sched/linux-image-4.4.0-19-generic_4.4.0-19.35_amd64.deb
wget
** Also affects: linux (Ubuntu Xenial)
Importance: Undecided
Status: Confirmed
** Changed in: linux (Ubuntu Xenial)
Status: Confirmed => In Progress
** Changed in: linux (Ubuntu Xenial)
Assignee: (unassigned) => Tim Gardner (timg-tpi)
--
You received this bug notification
Encountered the same issue on some machines while running Qemu 2.5 on
lts-xenial kernel in trusty. The machine died with nearly the same
calltrace as above and a very high load. Downgrading to the latest wily
kernel fixed the issue.
--
You received this bug notification because you are a member
** Changed in: linux (Ubuntu)
Status: Incomplete => Confirmed
--
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1568729
Title:
divide error: [#1] SMP in task_numa_migrate - handle_mm_fault
34 matches
Mail list logo