Aleksey Plekhanov created IGNITE-27883:
------------------------------------------

             Summary: Flaky TcpCommunicationSpiDropNodesTest.testOneNode 
                 Key: IGNITE-27883
                 URL: https://issues.apache.org/jira/browse/IGNITE-27883
             Project: Ignite
          Issue Type: Bug
            Reporter: Aleksey Plekhanov
            Assignee: Aleksey Plekhanov


Throws an error on TeamCity:
{noformat}
java.lang.AssertionError: Should have exception here.
        at org.junit.Assert.fail(Assert.java:88)
        at 
org.apache.ignite.testframework.junits.JUnitAssertAware.fail(JUnitAssertAware.java:50)
        at 
org.apache.ignite.spi.communication.tcp.TcpCommunicationSpiDropNodesTest.testOneNode(TcpCommunicationSpiDropNodesTest.java:138)
{noformat}
Can be reproduced locally by changing sleep time in test to 1500 for example.

Reason:
When second node join the grid, partiition resend is scheduled (see 
GridCachePartitionExchangeManager#scheduleResendPartitions) with timeout 1500 
ms (see GridCachePartitionExchangeManager#partResendTimeout). Partition resend 
opens TCP communication connection from node0 to all other nodes. If it happens 
before TCP communication for node2 blocked, connection can be established and 
survive idle timeout.
It happens when time between second node join and TCP communication blocking is 
about 1500 ms (including current 1000 ms sleep time).



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to