[
https://issues.apache.org/jira/browse/IGNITE-3018?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15279802#comment-15279802
]
Taras Ledkov commented on IGNITE-3018:
--------------------------------------
Unfortunately bucket based distribution has a very bad re-balance index.
In the test below the value is the average partitions count that are changed
after each change of the topology.
Test 60 nodes. Old: 11.4; New: 19.8;
Test 100 nodes. Old: 7.6; New: 135.7;
Test 200 nodes. Old: 4.0; New: 100.1;
Test 300 nodes. Old: 2.7; New: 75.1;
Test 400 nodes. Old: 1.9; New: 60.5;
Test 500 nodes. Old: 1.9; New: 52.5;
Test 600 nodes. Old: 1.2; New: 46.0;
So, it looks like we have to use simple sort all hash(partition, node) to
prevent rebalance problem.
> Cache affinity calculation is slow with large nodes number
> ----------------------------------------------------------
>
> Key: IGNITE-3018
> URL: https://issues.apache.org/jira/browse/IGNITE-3018
> Project: Ignite
> Issue Type: Bug
> Components: cache
> Reporter: Semen Boikov
> Assignee: Taras Ledkov
> Priority: Critical
> Fix For: 1.7
>
> Attachments: 003.png, 064.png, 100.png, 128.png, 200.png, 300.png,
> 400.png, 500.png, 600.png
>
>
> With large number of cache server nodes (> 200) RendezvousAffinityFunction
> and FairAffinityFunction work pretty slow .
> For RendezvousAffinityFunction.assignPartitions can take hundredes of
> milliseconds, for FairAffinityFunction it can take seconds.
> For RendezvousAffinityFunction most time is spent in MD5 hash calculation and
> nodes list sorting. As optimization we can try to cache {partion, node} MD5
> hash or try another hash function. Also several minor optimizations are
> possible (avoid unncecessary allocations, only one thread local 'get', etc).
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)