Re: [ceph-users] slow ops for mon slowly increasing

2019-09-20 Thread Kevin Olbrich
OK, looks like clock skew is the problem. I thought this is caused by the reboot but it did not fix itself after some minutes (mon3 was 6 seconds ahead). After forcing time sync from the same server, it seems to be solved now. Kevin Am Fr., 20. Sept. 2019 um 07:33 Uhr schrieb Kevin Olbrich : >

[ceph-users] slow ops for mon slowly increasing

2019-09-19 Thread Kevin Olbrich
Hi! Today some OSDs went down, a temporary problem that was solved easily. The mimic cluster is working and all OSDs are complete, all active+clean. Completely new for me is this: > 25 slow ops, oldest one blocked for 219 sec, mon.mon03 has slow ops The cluster itself looks fine, monitoring for

Re: [ceph-users] Slow OPS

2019-03-21 Thread Brad Hubbard
21 16:51:56.862447", > "age": 376.527241, > "duration": 1.331278, > > Kind regards, > Glen Baars > > -Original Message- > From: Brad Hubbard > Sent: Thursday, 21 March 2019 1:43 PM > To: Glen Baars > Cc: cep

Re: [ceph-users] Slow OPS

2019-03-21 Thread Glen Baars
ursday, 21 March 2019 1:43 PM To: Glen Baars Cc: ceph-users@lists.ceph.com Subject: Re: [ceph-users] Slow OPS Actually, the lag is between "sub_op_committed" and "commit_sent". Is there any pattern to these slow requests? Do they involve the same osd, or set of osds? On Thu, Mar 21, 201

Re: [ceph-users] Slow OPS

2019-03-20 Thread Brad Hubbard
commit > message back to the client? Are these slow ops always the same client? > > > > > Kind regards, > > Glen Baars > > > > -Original Message- > > From: Brad Hubbard > > Sent: Thursday, 21 March 2019 8:23 AM > > To: Glen Baars &g

Re: [ceph-users] Slow OPS

2019-03-20 Thread Brad Hubbard
ow ops always the same client? > > Kind regards, > Glen Baars > > -Original Message----- > From: Brad Hubbard > Sent: Thursday, 21 March 2019 8:23 AM > To: Glen Baars > Cc: ceph-users@lists.ceph.com > Subject: Re: [ceph-users] Slow OPS > > On Thu, Mar 2

Re: [ceph-users] Slow OPS

2019-03-20 Thread Glen Baars
ad Hubbard Sent: Thursday, 21 March 2019 8:23 AM To: Glen Baars Cc: ceph-users@lists.ceph.com Subject: Re: [ceph-users] Slow OPS On Thu, Mar 21, 2019 at 12:11 AM Glen Baars wrote: > > Hello Ceph Users, > > > > Does anyone know what the flag point ‘Started’ is? Is that ceph osd daemon

Re: [ceph-users] Slow OPS

2019-03-20 Thread Brad Hubbard
On Thu, Mar 21, 2019 at 12:11 AM Glen Baars wrote: > > Hello Ceph Users, > > > > Does anyone know what the flag point ‘Started’ is? Is that ceph osd daemon > waiting on the disk subsystem? This is set by "mark_started()" and is roughly set when the pg starts processing the op. Might want to

[ceph-users] Slow OPS

2019-03-20 Thread Glen Baars
Hello Ceph Users, Does anyone know what the flag point 'Started' is? Is that ceph osd daemon waiting on the disk subsystem? Ceph 13.2.4 on centos 7.5 "description": "osd_op(client.1411875.0:422573570 5.18ds0 5:b1ed18e5:::rbd_data.6.cf7f46b8b4567.0046e41a:head [read

Re: [ceph-users] slow ops after cephfs snapshot removal

2018-11-09 Thread Chris Taylor
> On Nov 9, 2018, at 1:38 PM, Gregory Farnum wrote: > >> On Fri, Nov 9, 2018 at 2:24 AM Kenneth Waegeman >> wrote: >> Hi all, >> >> On Mimic 13.2.1, we are seeing blocked ops on cephfs after removing some >> snapshots: >> >> [root@osd001 ~]# ceph -s >>cluster: >> id:

Re: [ceph-users] slow ops after cephfs snapshot removal

2018-11-09 Thread Gregory Farnum
On Fri, Nov 9, 2018 at 2:24 AM Kenneth Waegeman wrote: > Hi all, > > On Mimic 13.2.1, we are seeing blocked ops on cephfs after removing some > snapshots: > > [root@osd001 ~]# ceph -s >cluster: > id: 92bfcf0a-1d39-43b3-b60f-44f01b630e47 > health: HEALTH_WARN > 5

[ceph-users] slow ops after cephfs snapshot removal

2018-11-09 Thread Kenneth Waegeman
Hi all, On Mimic 13.2.1, we are seeing blocked ops on cephfs after removing some snapshots: [root@osd001 ~]# ceph -s   cluster:     id: 92bfcf0a-1d39-43b3-b60f-44f01b630e47     health: HEALTH_WARN     5 slow ops, oldest one blocked for 1162 sec, mon.mds03 has slow ops