https://github.com/medined/accumulo-sharding - I wrote this description a bit ago. Okay, it was nearly a year. Then a few people commented on it (thanks!) and I got distracted. In any case, here it is. It starts like this:
A distributed database typically is thought of as having data spread across multiple servers. But how does the data spread out? That's a question I hope to answer - at least for Accumulo.
