Re: [ceph-users] Node failure -- corrupt memory

2019-11-15 Thread Wido den Hollander
On 11/11/19 2:00 PM, Shawn Iverson wrote: > Hello Cephers! > > I had a node over the weekend go nuts from what appears to have been > failed/bad memory modules and/or motherboard. > > This resulted in several OSDs blocking IO for > 128s (indefinitely). > > I was not watching my alerts too

[ceph-users] Node failure -- corrupt memory

2019-11-11 Thread Shawn Iverson
Hello Cephers! I had a node over the weekend go nuts from what appears to have been failed/bad memory modules and/or motherboard. This resulted in several OSDs blocking IO for > 128s (indefinitely). I was not watching my alerts too closely over the weekend, or else I may have caught it early.