[ 
https://issues.apache.org/jira/browse/IGNITE-23572?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Mirza Aliev reassigned IGNITE-23572:
------------------------------------

    Assignee: Mirza Aliev

> Change rebalance scheduling when data nodes are changed
> -------------------------------------------------------
>
>                 Key: IGNITE-23572
>                 URL: https://issues.apache.org/jira/browse/IGNITE-23572
>             Project: Ignite
>          Issue Type: Improvement
>            Reporter: Mirza Aliev
>            Assignee: Mirza Aliev
>            Priority: Major
>              Labels: ignite-3
>
> h3. Motivation
> [IEP-131|https://cwiki.apache.org/confluence/display/IGNITE/IEP-131%3A+Partition+Majority+Unavailability+Handling]
>  
>  
> In HA mode for scale up situations:
>  # Let’s say we have [A, B, C] for the partition assignments, B and C left.
>  # Raft group was narrowed in force manner to [A], and after that node B 
> returned, we must enhance stable to [A, B]
>  # In terms of DZ.scale up, there wasn't any change in DZ.scale up time 
> window, so data nodes will be the same, so it could mean that we don’t need 
> to schedule new rebalance to enhance stable assignment to [A, B], but we 
> actually do need. (Note that DZ.scale down timer is quite big and wasn’t 
> event passed)
> Proposed enhancements
>  # Data nodes are rewritten on scale up even if they are the same
>  # When we decide if we need to trigger rebalance after data nodes change, we 
> calculate assignments and apply nodes aliveness check filter to those 
> assignments. If we see that the actual stablePartAssignmentsKey differs from 
> the filtered one, we schedule rebalance.
>  # In our example, calculated assignments will be [A, B, C], we will filter 
> them to [A,B] and schedule new rebalance to enhance stablePartAssignmentsKey
> We need to investigate, if this approach must be improved in SC mode as well, 
> or must be done only in HA mode
> h3. Definition of done
> * After resolving this task, we must understand if the proposed approach must 
> be improved in SC mode as well, or must be done only in HA mode
> * Corresponding approach must be implemented.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to