Re: [ceph-users] How to avoid deep-scrubbing performance hit?

2014-10-02 Thread Mark Kirkwood
We are also becoming interested in understanding and taming the impact of deep scrubbing. We may start running something similar to the cron tasks mentioned. Looking at these fine examples of bash + awk I wondered if I could do the job using the python rados api. I have attached my initial (un

Re: [ceph-users] How to avoid deep-scrubbing performance hit?

2014-06-11 Thread Dan Van Der Ster
On 10 Jun 2014, at 11:59, Dan Van Der Ster wrote: > One idea I had was to check the behaviour under different disk io schedulers, > trying exploit thread io priorities with cfq. So I have a question for the > developers about using ionice or ioprio_set to lower the IO priorities of the > threa

Re: [ceph-users] How to avoid deep-scrubbing performance hit?

2014-06-10 Thread Craig Lewis
After doing this, I've found that I'm having problems with a few specific PGs. If I set nodeep-scrub, then manually deep-scrub one specific PG, the responsible OSDs get kicked out. I'm starting a new discussion, subject: "I have PGs that I can't deep-scrub" I'll re-test this correlation after I

Re: [ceph-users] How to avoid deep-scrubbing performance hit?

2014-06-10 Thread Dan Van Der Ster
Hi, I’m just starting to get interested in this topic, since today we’ve found that a weekly peak in latency correlates with a bulk (~30) of deep scrubbing PGs. One idea I had was to check the behaviour under different disk io schedulers, trying exploit thread io priorities with cfq. So I have a

Re: [ceph-users] How to avoid deep-scrubbing performance hit?

2014-06-09 Thread Gregory Farnum
On Mon, Jun 9, 2014 at 6:42 PM, Mike Dawson wrote: > Craig, > > I've struggled with the same issue for quite a while. If your i/o is similar > to mine, I believe you are on the right track. For the past month or so, I > have been running this cronjob: > > * * * * * for strPg in `ceph pg dump

Re: [ceph-users] How to avoid deep-scrubbing performance hit?

2014-06-09 Thread Mike Dawson
Craig, I've struggled with the same issue for quite a while. If your i/o is similar to mine, I believe you are on the right track. For the past month or so, I have been running this cronjob: * * * * * for strPg in `ceph pg dump | egrep '^[0-9]\.[0-9a-f]{1,4}' | sort -k20 | awk '{ print

Re: [ceph-users] How to avoid deep-scrubbing performance hit?

2014-06-09 Thread Gregory Farnum
On Mon, Jun 9, 2014 at 3:22 PM, Craig Lewis wrote: > I've correlated a large deep scrubbing operation to cluster stability > problems. > > My primary cluster does a small amount of deep scrubs all the time, spread > out over the whole week. It has no stability problems. > > My secondary cluster d

[ceph-users] How to avoid deep-scrubbing performance hit?

2014-06-09 Thread Craig Lewis
I've correlated a large deep scrubbing operation to cluster stability problems. My primary cluster does a small amount of deep scrubs all the time, spread out over the whole week. It has no stability problems. My secondary cluster doesn't spread them out. It saves them up, and tries to do all o