Jonas,
I've seen this happening on a weekly basis when I was running 0.61 branch as well, however after switching to 0.67 branch it has stopped. Perhaps you should try upgrading Andrei ----- Original Message ----- From: "Jonas Rottmann (centron GmbH)" <[email protected]> To: "[email protected]" <[email protected]> Sent: Saturday, 28 December, 2013 9:48:12 AM Subject: [ceph-users] One OSD always dieing Hi, One of my OSDs are dieing all the time. I rebooted one after one every node and assured that all has the same kernel version and glibc. I’m using ceph version 0.61.9 (7440dcd135750839fa0f00263f80722ff6f51e90). Dmesg only shows: [ 5745.366041] init: ceph-osd (ceph/3) main process (2510) killed by ABRT signal [ 5745.366235] init: ceph-osd (ceph/3) main process ended, respawning [ 5763.824298] init: ceph-osd (ceph/3) main process (2991) killed by SEGV signal Basically every time this shows up in the logs: 2013-12-28 06:35:08.489431 7fc9eccd5700 -1 osd/ReplicatedPG.cc: In function 'ReplicatedPG::RepGather* ReplicatedPG::trim_object(const hobject_t&)' thread 7fc9eccd5700 time 2013-12-28 06:35:08.487862 osd/ReplicatedPG.cc: 1379: FAILED assert(0) If you need more infos I will send them. Please help ! The whole cluster isn’t working proberbly because of this… _______________________________________________ ceph-users mailing list [email protected] http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com
_______________________________________________ ceph-users mailing list [email protected] http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com
