[ceph-users] Re: mgr daemons becoming unresponsive

2019-11-06 Thread Thomas Schneider
Hi, can you please advise which package(s) should be installed? Thanks Am 06.11.2019 um 22:28 schrieb Sage Weil: > My current working theory is that the mgr is getting hung up when it tries > to scrape the device metrics from the mon. The 'tell' mechanism used to > send mon-targetted

[ceph-users] Fwd: Broken: caps osd = "profile rbd-read-only"

2019-11-06 Thread Markus Kienast
In Nautilus (Ubuntu Cloud Archive Train Version) the osd caps profile rbd-read-only seems broken. It is impossible to map a RBD if the user has the following caps: [client.yyy] key = AQBYL8NdHDpnERAAhk8XOKgFNwhUpCo3EMaW3g== caps mgr = "profile rbd" caps mon = "profile

[ceph-users] Re: mds crash loop

2019-11-06 Thread Karsten Nielsen
-Original message- From: Yan, Zheng Sent: Wed 06-11-2019 14:16 Subject:Re: [ceph-users] mds crash loop To: Karsten Nielsen ; CC: ceph-users@ceph.io; > On Wed, Nov 6, 2019 at 4:42 PM Karsten Nielsen wrote: > > > > -Original message- > > From: Yan, Zheng >

[ceph-users] Re: mds crash loop

2019-11-06 Thread Karsten Nielsen
-Original message- From: Yan, Zheng Sent: Wed 06-11-2019 14:16 Subject:Re: [ceph-users] mds crash loop To: Karsten Nielsen ; CC: ceph-users@ceph.io; > On Wed, Nov 6, 2019 at 4:42 PM Karsten Nielsen wrote: > > > > -Original message- > > From: Yan, Zheng >

[ceph-users] Re: mgr daemons becoming unresponsive

2019-11-06 Thread Sage Weil
My current working theory is that the mgr is getting hung up when it tries to scrape the device metrics from the mon. The 'tell' mechanism used to send mon-targetted commands is pretty kludgey/broken in nautilus and earlier. It's been rewritten for octopus, but isn't worth backporting--it

[ceph-users] Re: Slow write speed on 3-node cluster with 6* SATA Harddisks (~ 3.5 MB/s)

2019-11-06 Thread Paul Emmerich
On Wed, Nov 6, 2019 at 5:57 PM Hermann Himmelbauer wrote: > > Dear Vitaliy, dear Paul, > > Changing the block size for "dd" makes a huge difference. > > However, still some things are not fully clear to me: > > As recommended, I tried writing / reading directly to the rbd and this > is blazingly

[ceph-users] Re: Slow write speed on 3-node cluster with 6* SATA Harddisks (~ 3.5 MB/s)

2019-11-06 Thread Hermann Himmelbauer
Dear Vitaliy, dear Paul, Changing the block size for "dd" makes a huge difference. However, still some things are not fully clear to me: As recommended, I tried writing / reading directly to the rbd and this is blazingly fast: fio -ioengine=rbd -name=test -direct=1 -rw=read -bs=4M -iodepth=16

[ceph-users] Re: mgr daemons becoming unresponsive

2019-11-06 Thread Thomas Schneider
Well, even after restarting the MGR service the relevant log is spoiled with this error messages: 2019-11-06 17:46:22.363 7f81ffdcc700  0 auth: could not find secret_id=3865 2019-11-06 17:46:22.363 7f81ffdcc700  0 cephx: verify_authorizer could not get service secret for service mgr secret_id=3865

[ceph-users] Re: mgr daemons becoming unresponsive

2019-11-06 Thread Thomas Schneider
Hi, does anybody get this error messages in MGR log? 2019-11-06 15:41:44.765 7f10db740700  0 auth: could not find secret_id=3863 2019-11-06 15:41:44.765 7f10db740700  0 cephx: verify_authorizer could not get service secret for service mgr secret_id=3863 THX Am 06.11.2019 um 10:43 schrieb

[ceph-users] Re: mds crash loop

2019-11-06 Thread Yan, Zheng
On Wed, Nov 6, 2019 at 4:42 PM Karsten Nielsen wrote: > > -Original message- > From: Yan, Zheng > Sent: Wed 06-11-2019 08:15 > Subject:Re: [ceph-users] mds crash loop > To: Karsten Nielsen ; > CC: ceph-users@ceph.io; > > On Tue, Nov 5, 2019 at 5:29 PM Karsten Nielsen

[ceph-users] Re: mgr daemons becoming unresponsive

2019-11-06 Thread thoralf schulze
hi oliver, On 11/6/19 10:43 AM, Oliver Freyermuth wrote: […] > Did somebody see something similar after running for a week or more with > Nautilus on old and slow hardware? yes, same here: significantly more mgr failovers / compaction jobs with nautilus than with mimic … most likely due to pgs

[ceph-users] Re: mds crash loop

2019-11-06 Thread Karsten Nielsen
-Original message- From: Yan, Zheng Sent: Wed 06-11-2019 08:15 Subject:Re: [ceph-users] mds crash loop To: Karsten Nielsen ; CC: ceph-users@ceph.io; > On Tue, Nov 5, 2019 at 5:29 PM Karsten Nielsen wrote: > > > > Hi, > > > > Last week I upgraded my ceph cluster from