Load reported from node tool ring is the live load, which means SSTables that 
the server has open and will read from during a request. This will include 
tombstones, expired and over written data. 

nodetool ctstats also includes "dead" load, which is sstables that are in use 
but still on disk. 


Cheers
 
-----------------
Aaron Morton
Freelance Developer
@aaronmorton
http://www.thelastpickle.com

On 20/01/2012, at 12:50 AM, Marcel Steinbach wrote:

> 2012/1/19 aaron morton <aa...@thelastpickle.com>:
>> If you have performed any token moves the data will not be deleted until you
>> run nodetool cleanup.
> We did that after adding nodes to the cluster. And then, the cluster
> wasn't balanced either.
> Also, does the "Load" really account for "dead" data, or is it just live data?
> 
>> To get a baseline I would run nodetool compact to do major compaction and
>> purge any tomb stones as others have said.
> We will do that, but I doubt we have 450GB tomb stones on node 2...
> 
>> Cheers
>> 
>> -----------------
>> Aaron Morton
>> Freelance Developer
>> @aaronmorton
>> http://www.thelastpickle.com
>> 
>> On 18/01/2012, at 2:19 PM, Maki Watanabe wrote:
>> 
>> Are there any significant difference of number of sstables on each nodes?
>> 
>> 2012/1/18 Marcel Steinbach <marcel.steinb...@chors.de>:
>> 
>> We are running regular repairs, so I don't think that's the problem.
>> 
>> And the data dir sizes match approx. the load from the nodetool.
>> 
>> Thanks for the advise, though.
>> 
>> 
>> Our keys are digits only, and all contain a few zeros at the same
>> 
>> offsets. I'm not that familiar with the md5 algorithm, but I doubt that it
>> 
>> would generate 'hotspots' for those kind of keys, right?
>> 
>> 
>> On 17.01.2012, at 17:34, Mohit Anchlia wrote:
>> 
>> 
>> Have you tried running repair first on each node? Also, verify using
>> 
>> df -h on the data dirs
>> 
>> 
>> On Tue, Jan 17, 2012 at 7:34 AM, Marcel Steinbach
>> 
>> <marcel.steinb...@chors.de> wrote:
>> 
>> 
>> Hi,
>> 
>> 
>> 
>> we're using RP and have each node assigned the same amount of the token
>> 
>> space. The cluster looks like that:
>> 
>> 
>> 
>> Address         Status State   Load            Owns    Token
>> 
>> 
>> 
>> 205648943402372032879374446248852460236
>> 
>> 
>> 1       Up     Normal  310.83 GB       12.50%
>> 
>>  56775407874461455114148055497453867724
>> 
>> 
>> 2       Up     Normal  470.24 GB       12.50%
>> 
>>  78043055807020109080608968461939380940
>> 
>> 
>> 3       Up     Normal  271.57 GB       12.50%
>> 
>>  99310703739578763047069881426424894156
>> 
>> 
>> 4       Up     Normal  282.61 GB       12.50%
>> 
>>  120578351672137417013530794390910407372
>> 
>> 
>> 5       Up     Normal  248.76 GB       12.50%
>> 
>>  141845999604696070979991707355395920588
>> 
>> 
>> 6       Up     Normal  164.12 GB       12.50%
>> 
>>  163113647537254724946452620319881433804
>> 
>> 
>> 7       Up     Normal  76.23 GB        12.50%
>> 
>>  184381295469813378912913533284366947020
>> 
>> 
>> 8       Up     Normal  19.79 GB        12.50%
>> 
>>  205648943402372032879374446248852460236
>> 
>> 
>> 
>> I was under the impression, the RP would distribute the load more evenly.
>> 
>> 
>> Our row sizes are 0,5-1 KB, hence, we don't store huge rows on a single
>> 
>> node. Should we just move the nodes so that the load is more even
>> 
>> distributed, or is there something off that needs to be fixed first?
>> 
>> 
>> 
>> Thanks
>> 
>> 
>> Marcel
>> 
>> 
>> <hr style="border-color:blue">
>> 
>> 
>> <p>chors GmbH
>> 
>> 
>> <br><hr style="border-color:blue">
>> 
>> 
>> <p>specialists in digital and direct marketing solutions<br>
>> 
>> 
>> Haid-und-Neu-Straße 7<br>
>> 
>> 
>> 76131 Karlsruhe, Germany<br>
>> 
>> 
>> www.chors.com</p>
>> 
>> 
>> <p>Managing Directors: Dr. Volker Hatz, Markus Plattner<br>Amtsgericht
>> 
>> Montabaur, HRB 15029</p>
>> 
>> 
>> <p style="font-size:9px">This e-mail is for the intended recipient only and
>> 
>> may contain confidential or privileged information. If you have received
>> 
>> this e-mail by mistake, please contact us immediately and completely delete
>> 
>> it (and any attachments) and do not forward it or inform any other person of
>> 
>> its contents. If you send us messages by e-mail, we take this as your
>> 
>> authorization to correspond with you by e-mail. E-mail transmission cannot
>> 
>> be guaranteed to be secure or error-free as information could be
>> 
>> intercepted, amended, corrupted, lost, destroyed, arrive late or incomplete,
>> 
>> or contain viruses. Neither chors GmbH nor the sender accept liability for
>> 
>> any errors or omissions in the content of this message which arise as a
>> 
>> result of its e-mail transmission. Please note that all e-mail
>> 
>> communications to and from chors GmbH may be monitored.</p>
>> 
>> 
>> 
>> 
>> 
>> 
>> --
>> w3m
>> 
>> 

Reply via email to