[
https://issues.apache.org/jira/browse/IGNITE-7152?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16798820#comment-16798820
]
Sergey Chugunov commented on IGNITE-7152:
-----------------------------------------
[~zstan],
As I can see from TC Bot report there are non-flaky failures
*IgniteDiscoveryMassiveNodeFailTest.testMassiveFailSelfKill* and
*IgniteDiscoveryMassiveNodeFailTest.testMassiveFailAndRecovery* that don't fail
in master.
These tests cover connection recovery mechanism so it seems that your change
breaks something in it.
I think we need to come up with another way to fix the issue, either by
modifying connection recovery mechanism to segment node facing read receipt
failures or by improving timeouts handling.
> Failure detection timeout don't work on permanent send message errors causing
> infinite loop
> -------------------------------------------------------------------------------------------
>
> Key: IGNITE-7152
> URL: https://issues.apache.org/jira/browse/IGNITE-7152
> Project: Ignite
> Issue Type: Bug
> Components: general
> Affects Versions: 2.3
> Reporter: Alexandr Kuramshin
> Assignee: Stanilovsky Evgeny
> Priority: Critical
> Fix For: 2.8
>
> Attachments: testFailureDetectionOnReceiptError.patch
>
> Time Spent: 10m
> Remaining Estimate: 0h
>
> Relative to {{RingMessageWorker.sendMessageAcrossRing}} implementation.
> {{IgniteSpiOperationTimeoutHelper}} reinitialized every time the socket
> successfully connected.
> If any of {{IOException, IgniteCheckedException}} occurs upon message send
> the socket will be closed and old {{IgniteSpiOperationTimeoutHelper}} will be
> used to reconnect.
> But after successful reconnect the new one will be created and the cycle
> repeat. With a permanent send message error this causes an infinite loop.
> The only send error which may cause to exit out of the loop and the next node
> failure is {{IgniteSpiOperationTimeoutException, SocketTimeoutException,
> SocketException}}.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)