[ 
https://issues.apache.org/jira/browse/IGNITE-14224?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17302265#comment-17302265
 ] 

Vladimir Goncharov edited comment on IGNITE-14224 at 3/16/21, 11:25 AM:
------------------------------------------------------------------------

W10 HOME/PRO

JDK1.8.251/241 Oracle/Open

W/o fix - exception on spi.openSocket in ServerImpl.sendMessageDirectly.

IgniteSpiException : class org.apache.ignite.spi.IgniteSpiException: Failed to 
send message to address [addr=/127.0.0.1:47500, 
msg=TcpDiscoveryJoinRequestMessage [node=TcpDiscoveryNode 
[id=b712cdf2-5e80-4d05-b45c-a113bdeea34b, consistentId=127.0.0.1:47501, 
addrs=ArrayList [127.0.0.1], sockAddrs=HashSet [/127.0.0.1:47501], 
discPort=47501, order=0, intOrder=0, lastExchangeTime=1615819551929, loc=true, 
ver=2.11.0#20210310-sha1:e90f9666, isClient=false], 
dataPacket=org.apache.ignite.spi.discovery.tcp.internal.DiscoveryDataPacket@33a630fa,
 super=TcpDiscoveryAbstractMessage [sndNodeId=null, 
id=3ec89563871-b712cdf2-5e80-4d05-b45c-a113bdeea34b, verifierNodeId=null, 
topVer=0, pendingIdx=0, failedNodes=null, isClient=false]]]

Connection count doesnt increment and spi.joinTimeout is zero. This leads to 
infintite cycle of cluster joining.

With fix - timeout exception after open socket and sending message as expected 
witch correct stopping of joining node after 3 failed connection.

Got exception while starting (will rollback startup routine).
 Command protocol successfully stopped: TCP binary

JDK 1.8.261/271/281 Oracle/Open

Works with and without patch as well 


was (Author: shadowru):
Java 1.8.251/241 Oracle/Open

W/o fix - exception on spi.openSocket in ServerImpl.sendMessageDirectly.

IgniteSpiException : class org.apache.ignite.spi.IgniteSpiException: Failed to 
send message to address [addr=/127.0.0.1:47500, 
msg=TcpDiscoveryJoinRequestMessage [node=TcpDiscoveryNode 
[id=b712cdf2-5e80-4d05-b45c-a113bdeea34b, consistentId=127.0.0.1:47501, 
addrs=ArrayList [127.0.0.1], sockAddrs=HashSet [/127.0.0.1:47501], 
discPort=47501, order=0, intOrder=0, lastExchangeTime=1615819551929, loc=true, 
ver=2.11.0#20210310-sha1:e90f9666, isClient=false], 
dataPacket=org.apache.ignite.spi.discovery.tcp.internal.DiscoveryDataPacket@33a630fa,
 super=TcpDiscoveryAbstractMessage [sndNodeId=null, 
id=3ec89563871-b712cdf2-5e80-4d05-b45c-a113bdeea34b, verifierNodeId=null, 
topVer=0, pendingIdx=0, failedNodes=null, isClient=false]]]

Connection count doesnt increment and spi.joinTimeout is zero. This leads to 
infintite cycle of cluster joining.

With fix - timeout exception after open socket and sending message as expected 
witch correct stopping of joining node after 3 failed connection.

Got exception while starting (will rollback startup routine).
Command protocol successfully stopped: TCP binary

Java 1.8.261/271/281 Oracle/Open

Works with and without patch as well 

> Extend logging of client failure handling.
> ------------------------------------------
>
>                 Key: IGNITE-14224
>                 URL: https://issues.apache.org/jira/browse/IGNITE-14224
>             Project: Ignite
>          Issue Type: Improvement
>          Components: clients
>    Affects Versions: 2.9.1
>            Reporter: Stanilovsky Evgeny
>            Assignee: Stanilovsky Evgeny
>            Priority: Major
>             Fix For: 2.11
>
>          Time Spent: 1h 10m
>  Remaining Estimate: 0h
>
> # "Finished serving remote node connection " but this message doesn't contain 
> node id, it should be added (check 
> org.apache.ignite.spi.discovery.tcp.ServerImpl.SocketReader#body).
>  # When client worker is removed from clientMsgWorkers, 
> org.apache.ignite.internal.util.IgniteUtils#closeQuiet(java.lang.AutoCloseable)
>  is used to close the socket, we don't see any exception if it has happened, 
> closeQuiet should be replaced with 
> org.apache.ignite.internal.util.IgniteUtils#close(java.lang.AutoCloseable, 
> org.apache.ignite.IgniteLogger) which works in the same way but prints 
> exception.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to