[
https://issues.apache.org/jira/browse/IGNITE-10815?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16728796#comment-16728796
]
ASF GitHub Bot commented on IGNITE-10815:
-----------------------------------------
GitHub user Jokser opened a pull request:
https://github.com/apache/ignite/pull/5746
IGNITE-10815 Fixed coordinator failover in case of exchanges merge and
non-affinity nodes
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/gridgain/apache-ignite ignite-10815
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/ignite/pull/5746.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #5746
----
commit 97d4d22f12f0a24060ea1cd7253758065cf77023
Author: Pavel Kovalenko <jokserfn@...>
Date: 2018-12-25T17:25:57Z
IGNITE-10815 WIP
Signed-off-by: Pavel Kovalenko <[email protected]>
commit 141f40b8742d12681b3f41f7ee3dbc3ae2702380
Author: Pavel Kovalenko <jokserfn@...>
Date: 2018-12-25T18:51:11Z
IGNITE-10815 Fix and test.
Signed-off-by: Pavel Kovalenko <[email protected]>
commit 84b3fc09cca1f7133883bd44c64a1466d32d5b53
Author: Pavel Kovalenko <jokserfn@...>
Date: 2018-12-25T18:52:08Z
IGNITE-10815 Cleanup
Signed-off-by: Pavel Kovalenko <[email protected]>
----
> NullPointerException in InitNewCoordinatorFuture.init() leads to cluster hang
> -----------------------------------------------------------------------------
>
> Key: IGNITE-10815
> URL: https://issues.apache.org/jira/browse/IGNITE-10815
> Project: Ignite
> Issue Type: Bug
> Affects Versions: 2.4
> Reporter: Anton Kurbanov
> Assignee: Pavel Kovalenko
> Priority: Critical
> Fix For: 2.8
>
>
> Possible scenario to reproduce:
> 1. Force few consecutive exchange merges and finish.
> 2. Trigger exchange.
> 3. Shutdown coordinator node before sending/receiving full partitions message.
>
> Stacktrace:
> {code:java}
> 2018-12-24 15:54:02,664 sys-#48%gg% ERROR
> org.apache.ignite.internal.processors.cache.distributed.dht.preloader.GridDhtPartitionsExchangeFuture
> - Failed to init new coordinator future: bd74f7ed-6984-4f78-9941-480df673ab77
> java.lang.NullPointerException: null
> at
> org.apache.ignite.internal.processors.cache.distributed.dht.preloader.GridDhtPartitionsExchangeFuture.events(GridDhtPartitionsExchangeFuture.java:534)
> ~[ignite-core-2.4.13.b4.jar:2.4.13.b4]
> at
> org.apache.ignite.internal.processors.cache.CacheAffinitySharedManager$18.applyx(CacheAffinitySharedManager.java:1790)
> ~[ignite-core-2.4.13.b4.jar:2.4.13.b4]
> at
> org.apache.ignite.internal.processors.cache.CacheAffinitySharedManager$18.applyx(CacheAffinitySharedManager.java:1738)
> ~[ignite-core-2.4.13.b4.jar:2.4.13.b4]
> at
> org.apache.ignite.internal.processors.cache.CacheAffinitySharedManager.forAllRegisteredCacheGroups(CacheAffinitySharedManager.java:1107)
> ~[ignite-core-2.4.13.b4.jar:2.4.13.b4]
> at
> org.apache.ignite.internal.processors.cache.CacheAffinitySharedManager.initCoordinatorCaches(CacheAffinitySharedManager.java:1738)
> ~[ignite-core-2.4.13.b4.jar:2.4.13.b4]
> at
> org.apache.ignite.internal.processors.cache.distributed.dht.preloader.InitNewCoordinatorFuture.init(InitNewCoordinatorFuture.java:104)
> ~[ignite-core-2.4.13.b4.jar:2.4.13.b4]
> at
> org.apache.ignite.internal.processors.cache.distributed.dht.preloader.GridDhtPartitionsExchangeFuture$8$1.call(GridDhtPartitionsExchangeFuture.java:3439)
> [ignite-core-2.4.13.b4.jar:2.4.13.b4]
> at
> org.apache.ignite.internal.processors.cache.distributed.dht.preloader.GridDhtPartitionsExchangeFuture$8$1.call(GridDhtPartitionsExchangeFuture.java:3435)
> [ignite-core-2.4.13.b4.jar:2.4.13.b4]
> at
> org.apache.ignite.internal.util.IgniteUtils.wrapThreadLoader(IgniteUtils.java:6720)
> [ignite-core-2.4.13.b4.jar:2.4.13.b4]
> at
> org.apache.ignite.internal.processors.closure.GridClosureProcessor$2.body(GridClosureProcessor.java:967)
> [ignite-core-2.4.13.b4.jar:2.4.13.b4]
> at org.apache.ignite.internal.util.worker.GridWorker.run(GridWorker.java:110)
> [ignite-core-2.4.13.b4.jar:2.4.13.b4]
> at
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
> [?:1.8.0_171]
> at
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
> [?:1.8.0_171]
> at java.lang.Thread.run(Thread.java:748) [?:1.8.0_171]
> {code}
>
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)