[Openstack] [Swift] Erasure code durability and overhead in small clusters

Mark Kirkwood Tue, 13 Mar 2018 15:12:04 -0700

Hi,

I'm looking at adding per-region Erasure code policies to our Swiftcluster. Currently I'm experimenting with a small one - 3 hosts perregion (each with 6 devices). Doing some experimentation seems to havehighlighted a subtle relation between desire to minimize overhead anddurability to survive a *host* outage. I'll do some examples below, andfeel free to check my math :-)

For brevity use k = number of data fragments, m = number of parityfragments.

Suppose I use a (k=4, m=2) policy for each region. My overhead is m/k =50% (i.e 1G uses 1,5G on disk). Each of my 3 hosts has 2 fragments, soif I lose a host I still have 4 in total so can reassemble objects :-)

Suppose I use a (k=8. m=2) policy, Now my overhead is m/k = 25% (yay,better than 50%). However now my fragments get spread around like: 3, 3,4, If I lose a host I have at most 7 fragments - not enough toreassemble objects :-(

To me this suggests that a certain minimum number of *hosts* per regionis needed for a given EC policy to be durable in the advent of hostoutage (or destruction). Is this correct - or have a flubbed thecalculations?


regards

Mark


_______________________________________________
Mailing list: http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack
Post to     : openstack@lists.openstack.org
Unsubscribe : http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack

[Openstack] [Swift] Erasure code durability and overhead in small clusters

Reply via email to