Hey all,
Our setup is 5 machines running Cassandra 0.7.0 with 24GB of heap and 1.5TB
disk each collocated in a DC. We're doing bulk imports from each of the nodes
with RF = 2 and write consistency ANY (write perf is very important). The
behavior we're seeing is this:
- Nodes often
1. Why 24GB of heap? Do you need this high heap? Bigger heap can lead to
longer GC cycles but 15min look too long.
2. Do you have ROW cache enabled?
3. How many column families do you have?
4. Enable GC logs and monitor what GC is doing to get idea of why it is
taking so long. You can add