[
https://issues.apache.org/jira/browse/IGNITE-13193?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17151020#comment-17151020
]
Ignite TC Bot commented on IGNITE-13193:
----------------------------------------
{panel:title=Branch: [pull/7971/head] Base: [master] : Possible Blockers
(1)|borderStyle=dashed|borderColor=#ccc|titleBGColor=#F7D6C1}
{color:#d04437}PDS (Indexing){color} [[tests 0 Exit Code
|https://ci.ignite.apache.org/viewLog.html?buildId=5436543]]
{panel}
{panel:title=Branch: [pull/7971/head] Base: [master] : New Tests
(8)|borderStyle=dashed|borderColor=#ccc|titleBGColor=#D6F7C1}
{color:#00008b}Service Grid{color} [tests 4]
* {color:#013220}IgniteServiceGridTestSuite:
ServiceDeploymentProcessIdSelfTest.requestId[Test event=IgniteBiTuple
[val1=DiscoveryEvent [evtNode=fab90e3a-7a80-42d5-aace-d8bcd082be28, topVer=0,
nodeId8=de225e46, msg=, type=NODE_JOINED, tstamp=1593735466437],
val2=AffinityTopologyVersion [topVer=2720052317725509699, minorTopVer=0]]] -
PASSED{color}
* {color:#013220}IgniteServiceGridTestSuite:
ServiceDeploymentProcessIdSelfTest.topologyVersion[Test event=IgniteBiTuple
[val1=DiscoveryCustomEvent [customMsg=ServiceChangeBatchRequest
[id=9cd49021371-6d964edf-1554-43c4-9610-ba4d1175766c, reqs=SingletonList
[ServiceUndeploymentRequest []]], affTopVer=null, super=DiscoveryEvent
[evtNode=b6e73219-cc3c-4851-b05e-8c9f4d4e04fc, topVer=0, nodeId8=b6e73219,
msg=null, type=DISCOVERY_CUSTOM_EVT, tstamp=1593735466437]],
val2=AffinityTopologyVersion [topVer=-9005077219046006491, minorTopVer=0]]] -
PASSED{color}
* {color:#013220}IgniteServiceGridTestSuite:
ServiceDeploymentProcessIdSelfTest.requestId[Test event=IgniteBiTuple
[val1=DiscoveryCustomEvent [customMsg=ServiceChangeBatchRequest
[id=9cd49021371-6d964edf-1554-43c4-9610-ba4d1175766c, reqs=SingletonList
[ServiceUndeploymentRequest []]], affTopVer=null, super=DiscoveryEvent
[evtNode=b6e73219-cc3c-4851-b05e-8c9f4d4e04fc, topVer=0, nodeId8=b6e73219,
msg=null, type=DISCOVERY_CUSTOM_EVT, tstamp=1593735466437]],
val2=AffinityTopologyVersion [topVer=-9005077219046006491, minorTopVer=0]]] -
PASSED{color}
* {color:#013220}IgniteServiceGridTestSuite:
ServiceDeploymentProcessIdSelfTest.topologyVersion[Test event=IgniteBiTuple
[val1=DiscoveryEvent [evtNode=fab90e3a-7a80-42d5-aace-d8bcd082be28, topVer=0,
nodeId8=de225e46, msg=, type=NODE_JOINED, tstamp=1593735466437],
val2=AffinityTopologyVersion [topVer=2720052317725509699, minorTopVer=0]]] -
PASSED{color}
{color:#00008b}Service Grid (legacy mode){color} [tests 4]
* {color:#013220}IgniteServiceGridTestSuite:
ServiceDeploymentProcessIdSelfTest.topologyVersion[Test event=IgniteBiTuple
[val1=DiscoveryEvent [evtNode=d6e5484b-0c04-4958-af8c-87060631298e, topVer=0,
nodeId8=0b6cb16a, msg=, type=NODE_JOINED, tstamp=1593735530785],
val2=AffinityTopologyVersion [topVer=8840357433858611253, minorTopVer=0]]] -
PASSED{color}
* {color:#013220}IgniteServiceGridTestSuite:
ServiceDeploymentProcessIdSelfTest.requestId[Test event=IgniteBiTuple
[val1=DiscoveryEvent [evtNode=d6e5484b-0c04-4958-af8c-87060631298e, topVer=0,
nodeId8=0b6cb16a, msg=, type=NODE_JOINED, tstamp=1593735530785],
val2=AffinityTopologyVersion [topVer=8840357433858611253, minorTopVer=0]]] -
PASSED{color}
* {color:#013220}IgniteServiceGridTestSuite:
ServiceDeploymentProcessIdSelfTest.topologyVersion[Test event=IgniteBiTuple
[val1=DiscoveryCustomEvent [customMsg=ServiceChangeBatchRequest
[id=d1db4121371-52eedea3-cb4b-4f87-ad96-23a96c33063b, reqs=SingletonList
[ServiceUndeploymentRequest []]], affTopVer=null, super=DiscoveryEvent
[evtNode=39549e86-8440-4fda-aa43-8f588a7e8ac8, topVer=0, nodeId8=39549e86,
msg=null, type=DISCOVERY_CUSTOM_EVT, tstamp=1593735530785]],
val2=AffinityTopologyVersion [topVer=2935041773177708175, minorTopVer=0]]] -
PASSED{color}
* {color:#013220}IgniteServiceGridTestSuite:
ServiceDeploymentProcessIdSelfTest.requestId[Test event=IgniteBiTuple
[val1=DiscoveryCustomEvent [customMsg=ServiceChangeBatchRequest
[id=d1db4121371-52eedea3-cb4b-4f87-ad96-23a96c33063b, reqs=SingletonList
[ServiceUndeploymentRequest []]], affTopVer=null, super=DiscoveryEvent
[evtNode=39549e86-8440-4fda-aa43-8f588a7e8ac8, topVer=0, nodeId8=39549e86,
msg=null, type=DISCOVERY_CUSTOM_EVT, tstamp=1593735530785]],
val2=AffinityTopologyVersion [topVer=2935041773177708175, minorTopVer=0]]] -
PASSED{color}
{panel}
[TeamCity *--> Run :: All*
Results|https://ci.ignite.apache.org/viewLog.html?buildId=5436032&buildTypeId=IgniteTests24Java8_RunAll]
> Implement fallback to full partition rebalancing in case historical supplier
> failed to read all necessary data updates from WAL
> -------------------------------------------------------------------------------------------------------------------------------
>
> Key: IGNITE-13193
> URL: https://issues.apache.org/jira/browse/IGNITE-13193
> Project: Ignite
> Issue Type: Improvement
> Affects Versions: 2.8.1
> Reporter: Vyacheslav Koptilin
> Assignee: Vyacheslav Koptilin
> Priority: Major
> Time Spent: 40m
> Remaining Estimate: 0h
>
> Historical rebalance may fail for several reasons:
> 1) WAL on supplier node is corrupted - the supplier will trigger a failure
> handler in the current implementation.
> 2) After iteration over WAL demander node didn't receive all updates to make
> MOVING partition up-to-date (resulting update counter didn't converge with
> expected update counter of OWNING partition) - demander will silently ignore
> lack of updates in the current implementation.
> Such behavior negatively affects the stability of the cluster: an
> inappropriate state of historical WAL is not a reason to fail a supplier node.
> The more proper way to handle this scenario is:
> - Either try to rebalance partition historically from another supplier
> - Or use full partition rebalance for problem partition
> Once the supplier fails to provide data from part of the WAL, its
> corresponding sequence of checkpoints should be marked as inapplicable for
> historical rebalance in order to prevent further errors.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)