I think you are doing ok,
I have a CF with the following schema
ColumnFamily: tk_counters
Key Validation Class:
org.apache.cassandra.db.marshal.CompositeType(org.apache.cassandra.db.marshal.UTF8Type,org.apache.cassandra.db.marshal.UUIDType)
Default column value validator:
org.apache.ca
Check your log for messages about rebuilding indices: that might grow your
dataset some.
One thing is for sure: the data import removed all the crap that lasted in
the 0.8.1 cluster (duplicates, thombstones etc). The decrease is fairly
dramatic but not unlogical at all.
2012/3/16 Jeremiah Jordan
" By default Cassandra tries to write to both nodes, always. Writes will
only fail (on a node) if it is down, and even then hinted handoff will
attempt to keep both nodes in sync when the troubled node comes back up.
The point of having two nodes is to have read and write availability in the
face o