Michael, Robin Let us know if the reported live load is increasing and diverging from the on disk size.
If it is can you check nodetool cfstats and find an example of a particular CF where Space Used Live has diverged from the on disk size. The provide the schema for the CF and any other info that may be handy. Cheers ----------------- Aaron Morton Freelance Developer @aaronmorton http://www.thelastpickle.com On 18/01/2012, at 10:58 PM, Michael Vaknine wrote: > I did restart the cluster and now it is normal 5GB. > > From: R. Verlangen [mailto:ro...@us2.nl] > Sent: Wednesday, January 18, 2012 11:32 AM > To: user@cassandra.apache.org > Subject: Re: nodetool ring question > > I also have this problem. My data on nodes grows to roughly 30GB. After a > restart only 5GB remains. Is a factor 6 common for Cassandra? > > 2012/1/18 aaron morton <aa...@thelastpickle.com> > Good idea Jeremiah, are you using compression Michael ? > > Scanning through the CF stats this jumps out… > > Column Family: Attractions > SSTable count: 3 > Space used (live): 27542876685 > Space used (total): 1213220387 > Thats 25Gb of live data but only 1.3GB total. > > Otherwise want to see if a restart fixes it :) Would be interesting to know > if it's wrong from the start or drifts during streaming or compaction. > > Cheers > > ----------------- > Aaron Morton > Freelance Developer > @aaronmorton > http://www.thelastpickle.com > > On 18/01/2012, at 12:04 PM, Jeremiah Jordan wrote: > > > There were some nodetool ring load reporting issues with early version of > 1.0.X don't remember when they were fixed, but that could be your issue. Are > you using compressed column families, a lot of the issues were with those. > Might update to 1.0.7. > > -Jeremiah > > On 01/16/2012 04:04 AM, Michael Vaknine wrote: > Hi, > > I have a 4 nodes cluster 1.0.3 version > > This is what I get when I run nodetool ring > > Address DC Rack Status State Load Owns > Token > > 127605887595351923798765477786913079296 > 10.8.193.87 datacenter1 rack1 Up Normal 46.47 GB 25.00% > 0 > 10.5.7.76 datacenter1 rack1 Up Normal 48.01 GB 25.00% > 42535295865117307932921825928971026432 > 10.8.189.197 datacenter1 rack1 Up Normal 53.7 GB 25.00% > 85070591730234615865843651857942052864 > 10.5.3.17 datacenter1 rack1 Up Normal 43.49 GB 25.00% > 127605887595351923798765477786913079296 > > I have finished running repair on all 4 nodes. > > I have less then 10 GB on the /var/lib/cassandra/data/ folders > > My question is Why nodetool reports almost 50 GB on each node? > > Thanks > Michael >