[jira] [Updated] (CASSANDRA-7032) Improve vnode allocation

Branimir Lambov (JIRA) Fri, 06 Feb 2015 06:06:09 -0800

     [ 
https://issues.apache.org/jira/browse/CASSANDRA-7032?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


Branimir Lambov updated CASSANDRA-7032:
---------------------------------------
    Attachment: TestVNodeAllocation.java

Attached new version that does not depend on a fixed number of vnodes per 
token. It distributes the load proportionally to the number of tokens allocated 
to the node and aims to minimize the over/underutilization as a percentage of 
the node's load.

The scheme is the same as the above, only in order to take varying loads into 
account the load of nodes and tokens in the improvement calculation is 
normalized via the inverse of the size of the node.

The results achieved in asymmetric node configurations are very similar to the 
homogeneous scenario. The behaviour for fixed vnode counts is exactly the same 
as the previous version.

I am done tinkering with the algorithm and have started work to attach it to 
the code base.

> Improve vnode allocation
> ------------------------
>
>                 Key: CASSANDRA-7032
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-7032
>             Project: Cassandra
>          Issue Type: Improvement
>          Components: Core
>            Reporter: Benedict
>            Assignee: Branimir Lambov
>              Labels: performance, vnodes
>             Fix For: 3.0
>
>         Attachments: TestVNodeAllocation.java, TestVNodeAllocation.java, 
> TestVNodeAllocation.java, TestVNodeAllocation.java, TestVNodeAllocation.java, 
> TestVNodeAllocation.java
>
>
> It's been known for a little while that random vnode allocation causes 
> hotspots of ownership. It should be possible to improve dramatically on this 
> with deterministic allocation. I have quickly thrown together a simple greedy 
> algorithm that allocates vnodes efficiently, and will repair hotspots in a 
> randomly allocated cluster gradually as more nodes are added, and also 
> ensures that token ranges are fairly evenly spread between nodes (somewhat 
> tunably so). The allocation still permits slight discrepancies in ownership, 
> but it is bound by the inverse of the size of the cluster (as opposed to 
> random allocation, which strangely gets worse as the cluster size increases). 
> I'm sure there is a decent dynamic programming solution to this that would be 
> even better.
> If on joining the ring a new node were to CAS a shared table where a 
> canonical allocation of token ranges lives after running this (or a similar) 
> algorithm, we could then get guaranteed bounds on the ownership distribution 
> in a cluster. This will also help for CASSANDRA-6696.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (CASSANDRA-7032) Improve vnode allocation

Reply via email to