Re: [ceph-users] OSD Marked down unable to restart continuously failing

2020-01-11 Thread Eugen Block
Hi, you say the daemons are locally up and running but restarting fails? Which one is it? Do you see any messages suggesting flapping OSDs? After 5 retries within 10 minutes the OSDs would be marked out. What is the result of your checks for iostat etc.? Anything pointing to a high load on

Re: [ceph-users] OSD Marked down unable to restart continuously failing

2020-01-10 Thread Radhakrishnan2 S
Can someone please help to respond to the below query ? Regards Radha Krishnan S TCS Enterprise Cloud Practice Tata Consultancy Services Cell:- +1 848 466 4870 Mailto: radhakrishnan...@tcs.com Website: http://www.tcs.com Experience certainty. IT

[ceph-users] OSD Marked down unable to restart continuously failing

2020-01-09 Thread Radhakrishnan2 S
Hello Everyone, One of the OSD node out of 16 has 12 OSD's with a bcache as NVMe, locally those osd daemons seem to be up and running, while the ceph osd tree shows them as down. Logs show that OSD's have struck IO for over 4096 sec. I tried checking for iostat, netstat, ceph -w along with