Avinash Lakshman writes:

> Here is why I think it is not a good fit for Cassandra (at least top 3
> reasons that come to mind):
>
> (1) Cassandra strives to make updates very very cheap. With BTree's every
> update is a read modify write.
> (2) BTree on rebalance tend to result in a lot of fragmentation leading to
> poorer performance over time.
> (3) Every write (update) is a random write while in Cassandra as it is today
> it will be a sequential one because of a log structured nature.

SSTables aren't written on every update.  Why would a B-Tree
implementation differ?  It seems like this could reduce the number of
seeks needed during reads (by having fewer B-Tree files than SSTables,
and possibly having a flatter index in each of those).

The compaction phase is used to reduce fragmentation already, right?

Michael Poole

Reply via email to