On Fri, 7 Mar 2014, Li Wang wrote:
> Sorry, it is (n/3)*(n/3)*(n/3)/Cn3 = n^3/(27*Cn3)
Cn3 is "n choose 3"?
> > > > Last night it occurred to me that this is almost just having
> > > > pgp_num < pg_num, but I think that's not quite right either.
Actually, maybe it is. Basically, say there are X combinations of 3 disks
= n choose 3. Some fraction of these, say Y, are actually used by CRUSH.
If we are to reduce that number, that implies that there are some PGs that
are overlapping on the same set of disks. Which more or less reduces to
the case where pgp_num < pg_num, or the hashpspool flag isn't set, or any
other behavior that makes more than one PG line up on the same disk.
Just using fewer PGs in the system, in fact, would help here. The main
problem is that doing this tends to make the distribution less uniform, so
there is a tradeoff.
There is a reliability model in ceph-tools.git at
https://github.com/ceph/ceph-tools/tree/master/models/reliability
that Mark Kampe built last year. Sadly I haven't looked at it closely so
I'm not sure if it captures this.
sage
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to [email protected]
More majordomo info at http://vger.kernel.org/majordomo-info.html