I didn't find so far way to reproduce the issue systematically. It doesn't seem to me load related as nodes with lower load crash more often then ones with high load.
But I can confirm that the fix released with ubuntu 4.4.0-135 kernel doesn't fix the issue. As this morning (17/10/28) we faced the issue again on node with kernel 4.4.0-1069-aws which already include the patch. AWS kernel include the patch released on the ubuntu 4.4.0-135 kernel since 4.4.0-1067-aws. -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1788035 Title: nvme: avoid cqe corruption To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1788035/+subscriptions -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs