Hi,

Quoting Stefan Kooman (ste...@bit.nl):
> Hi,
> 
> TL;DR: we see "used" memory grows indefinitely on our OSD servers.
> Until the point that either 1) a OSD process gets killed by OOMkiller,
> or 2) OSD aborts (proably because malloc cannot provide more RAM). I
> suspect a memory leak of the OSDs.

I got quite some feedback on this thread, thanks for that! I'm pretty
sure we were not hit by a Ceph memory leak, but an Intel i40e driver
leak, specifically in linux kernel 4.13 (Ubuntu Xenial HWE), see [1].

Running 4.13 kernel with Intel X710? You will definitely want to update
to 4.13.0-38 where this issue is fixed.

We are running this kernel now for a week or so and memory is "under
control". Now it's time to crank bluestore cache again :-).

FYI.

Gr. Stefan

[1]: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1748408




-- 
| BIT BV  http://www.bit.nl/        Kamer van Koophandel 09090351
| GPG: 0xD14839C6                   +31 318 648 688 / i...@bit.nl
_______________________________________________
ceph-users mailing list
ceph-users@lists.ceph.com
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com

Reply via email to