Re: Endless minor compactions after heavy inserts

2011-04-03 Thread aaron morton
With only one data file your reads would use the least amount of IO to find the data. Most people have multiple nodes and probably fewer disks, so each node may have a TB or two of data. How much capacity do your 10 disks give ? Will you be running multiple nodes in production ? Aaron On

Re: Endless minor compactions after heavy inserts

2011-04-03 Thread Sheng Chen
I think if i can keep a single sstable file in a proper size, the hot data/index files may be able to fit into memory at least in some occasions. In my use case, I want to use cassandra for storage of a large amount of log data. There will be multiple nodes, and each node has 10*2TB disks to hold

Re: Endless minor compactions after heavy inserts

2011-04-03 Thread Edward Capriolo
On Sun, Apr 3, 2011 at 1:46 PM, Sheng Chen chensheng2...@gmail.com wrote: I think if i can keep a single sstable file in a proper size, the hot data/index files may be able to fit into memory at least in some occasions. In my use case, I want to use cassandra for storage of a large amount of

Re: Endless minor compactions after heavy inserts

2011-04-01 Thread aaron morton
If you are doing some sort of bulk load you can disable minor compactions by setting the min_compaction_threshold and max_compaction_threshold to 0 . Then once your insert is complete run a major compaction via nodetool before turning the minor compaction back on. You can also reduce the

Re: Endless minor compactions after heavy inserts

2011-04-01 Thread mcasandra
Is there a way to monitor the compactions using nodetools? I don't see it in tpstats. -- View this message in context: http://cassandra-user-incubator-apache-org.3065146.n2.nabble.com/Endless-minor-compactions-after-heavy-inserts-tp6229633p6231672.html Sent from the cassandra-u

Re: Endless minor compactions after heavy inserts

2011-04-01 Thread Jake Luciani
-minor-compactions-after-heavy-inserts-tp6229633p6231672.html Sent from the cassandra-u...@incubator.apache.org mailing list archive at Nabble.com. -- http://twitter.com/tjake

Re: Endless minor compactions after heavy inserts

2011-04-01 Thread Sheng Chen
Thank you very much. The major compaction will merge everything into one big file., which would be very large. Is there any way to control the number or size of files created by major compaction? Or, is there a recommended number or size of files for cassandra to handle? Thanks. I see the

Endless minor compactions after heavy inserts

2011-03-31 Thread Sheng Chen
I've got a single node of cassandra 0.7.4, and I used the java stress tool to insert about 100 million records. The inserts took about 6 hours (45k inserts/sec) but the following minor compactions last for 2 days and the pending compaction jobs are still increasing. From jconsole I can read the