We are also becoming interested in understanding and taming the impact
of deep scrubbing.
We may start running something similar to the cron tasks mentioned.
Looking at these fine examples of bash + awk I wondered if I could do
the job using the python rados api. I have attached my initial (un
On 10 Jun 2014, at 11:59, Dan Van Der Ster wrote:
> One idea I had was to check the behaviour under different disk io schedulers,
> trying exploit thread io priorities with cfq. So I have a question for the
> developers about using ionice or ioprio_set to lower the IO priorities of the
> threa
After doing this, I've found that I'm having problems with a few
specific PGs. If I set nodeep-scrub, then manually deep-scrub one
specific PG, the responsible OSDs get kicked out. I'm starting a new
discussion, subject: "I have PGs that I can't deep-scrub"
I'll re-test this correlation after I
Hi,
I’m just starting to get interested in this topic, since today we’ve found that
a weekly peak in latency correlates with a bulk (~30) of deep scrubbing PGs.
One idea I had was to check the behaviour under different disk io schedulers,
trying exploit thread io priorities with cfq. So I have a
On Mon, Jun 9, 2014 at 6:42 PM, Mike Dawson wrote:
> Craig,
>
> I've struggled with the same issue for quite a while. If your i/o is similar
> to mine, I believe you are on the right track. For the past month or so, I
> have been running this cronjob:
>
> * * * * * for strPg in `ceph pg dump
Craig,
I've struggled with the same issue for quite a while. If your i/o is
similar to mine, I believe you are on the right track. For the past
month or so, I have been running this cronjob:
* * * * * for strPg in `ceph pg dump | egrep
'^[0-9]\.[0-9a-f]{1,4}' | sort -k20 | awk '{ print
On Mon, Jun 9, 2014 at 3:22 PM, Craig Lewis wrote:
> I've correlated a large deep scrubbing operation to cluster stability
> problems.
>
> My primary cluster does a small amount of deep scrubs all the time, spread
> out over the whole week. It has no stability problems.
>
> My secondary cluster d
I've correlated a large deep scrubbing operation to cluster stability
problems.
My primary cluster does a small amount of deep scrubs all the time, spread
out over the whole week. It has no stability problems.
My secondary cluster doesn't spread them out. It saves them up, and tries
to do all o