[
https://issues.apache.org/jira/browse/IGNITE-20998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17792788#comment-17792788
]
Vladislav Pyatkov commented on IGNITE-20998:
--------------------------------------------
Merged 87a7e19de14c6449b4ca0b086f55911d210664b0
> ItDurableFinishTest pauses the messaging service
> ------------------------------------------------
>
> Key: IGNITE-20998
> URL: https://issues.apache.org/jira/browse/IGNITE-20998
> Project: Ignite
> Issue Type: Bug
> Reporter: Vladislav Pyatkov
> Assignee: Kirill Sizov
> Priority: Major
> Attachments: _Integration_Tests_Module_Runner_19752.log.zip
>
> Time Spent: 20m
> Remaining Estimate: 0h
>
> The pattern of using the dropMessages method is incorrect because it may lead
> to a breakdown of the cluster topology.
> {code}
> coordinatorMessaging.dropMessages((s, networkMessage) -> {
> if (networkMessage instanceof TxFinishReplicaRequest) {
> try {
> logger().info("Pausing message handling: {}.", networkMessage);
> transfer.countDown();
> msg.await();
> logger().info("Continue message handling: {}.", networkMessage);
> } catch (InterruptedException e) {
> throw new RuntimeException(e);
> }
> }
> return false;
> });
> {code}
> If the pause exceeds ten seconds, it leads to a lost physical connection
> between nodes.
> {noformat}
> _Integration_Tests_Module_Runner_19752.log:[2023-11-28T07:33:36,238][INFO
> ][%idft_tcpof_0%MessagingService-inbound--0][ItDurableFinishTest] Pausing
> message handling: TxFinishReplicaRequestImpl [commit=true,
> commitTimestampLong=111487013108973569, groupId=6_part_0, groups=HashSet
> [6_part_0], term=111487013074829313, timestampLong=111487013108973571,
> txId=018c14d9-055b-0000-0000-000009a89413].
> _Integration_Tests_Module_Runner_19752.log:[2023-11-28T07:33:49,246][INFO
> ][%idft_tcpof_0%MessagingService-inbound--0][ItDurableFinishTest] Continue
> message handling: TxFinishReplicaRequestImpl [commit=true,
> commitTimestampLong=111487013108973569, groupId=6_part_0, groups=HashSet
> [6_part_0], term=111487013074829313, timestampLong=111487013108973571,
> txId=018c14d9-055b-0000-0000-000009a89413].
> _Integration_Tests_Module_Runner_19752.log:[2023-11-28T07:34:07,607][INFO
> ][%idft_twfc_2%JRaft-FSMCaller-Disruptor-_stripe_4-0][ItDurableFinishTest]
> Pausing message handling: TxCleanupReplicaRequestImpl [commit=true,
> commitTimestampLong=111487015164641280, groupId=6_part_0,
> timestampLong=111487015164772353, txId=018c14d9-7fe4-0000-0000-000011e90007].
> _Integration_Tests_Module_Runner_19752.log:[2023-11-28T07:34:12,049][INFO
> ][%idft_twfc_2%JRaft-FSMCaller-Disruptor-_stripe_4-0][ItDurableFinishTest]
> Continue message handling: TxCleanupReplicaRequestImpl [commit=true,
> commitTimestampLong=111487015164641280, groupId=6_part_0,
> timestampLong=111487015164772353, txId=018c14d9-7fe4-0000-0000-000011e90007].
> {noformat}
--
This message was sent by Atlassian Jira
(v8.20.10#820010)