[
https://issues.apache.org/jira/browse/IGNITE-21630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17822422#comment-17822422
]
Ignite TC Bot commented on IGNITE-21630:
----------------------------------------
{panel:title=Branch: [pull/11260/head] Base: [master] : No blockers
found!|borderStyle=dashed|borderColor=#ccc|titleBGColor=#D6F7C1}{panel}
{panel:title=Branch: [pull/11260/head] Base: [master] : No new tests
found!|borderStyle=dashed|borderColor=#ccc|titleBGColor=#F7D6C1}{panel}
[TeamCity *--> Run :: All*
Results|https://ci2.ignite.apache.org/viewLog.html?buildId=7765945&buildTypeId=IgniteTests24Java8_RunAll]
> Cluster falls apart on topology change when DNS service is unavailable
> -----------------------------------------------------------------------
>
> Key: IGNITE-21630
> URL: https://issues.apache.org/jira/browse/IGNITE-21630
> Project: Ignite
> Issue Type: Bug
> Reporter: Aleksey Plekhanov
> Assignee: Aleksey Plekhanov
> Priority: Major
> Labels: ise
> Time Spent: 10m
> Remaining Estimate: 0h
>
> Requests to DNS service performed synchroniously by some critical discovery
> threads. Timeout for such requests can't be controlled by java code (see
> [https://bugs.openjdk.org/browse/JDK-6450279]). This leads to segmentation of
> nodes and falling apart cluster.
> For example, stack of {{tcp-disco-msg-worker}} thread with request to DNS
> service:
> {noformat}
> at java.net.InetAddress$2.lookupAllHostAddr(InetAddress.java:929)
> at java.net.InetAddress.getAddressesFromNameService(InetAddress.java:1330)
> at java.net.InetAddress.getAllByName0(InetAddress.java:1283)
> at java.net.InetAddress.getAllByName(InetAddress.java:1199)
> at java.net.InetAddress.getAllByName(InetAddress.java:1127)
> at java.net.InetAddress.getByName(InetAddress.java:1077)
> at java.net.InetSocketAddress.<init>(InetSocketAddress.java:220)
> at
> org.apache.ignite.internal.util.IgniteUtils.createResolved(IgniteUtils.java:9829)
> at
> org.apache.ignite.internal.util.IgniteUtils.toSocketAddresses(IgniteUtils.java:9792)
> at
> org.apache.ignite.internal.util.IgniteUtils.toSocketAddresses(IgniteUtils.java:9770)
> at
> org.apache.ignite.spi.discovery.tcp.internal.TcpDiscoveryNode.socketAddresses(TcpDiscoveryNode.java:392)
> at
> org.apache.ignite.spi.discovery.tcp.TcpDiscoverySpi.getNodeAddresses(TcpDiscoverySpi.java:1267)
> at
> org.apache.ignite.spi.discovery.tcp.ServerImpl.interruptPing(ServerImpl.java:985)
> at
> org.apache.ignite.spi.discovery.tcp.ServerImpl.access$6800(ServerImpl.java:206)
> at
> org.apache.ignite.spi.discovery.tcp.ServerImpl$RingMessageWorker.processNodeLeftMessage(ServerImpl.java:5433)
> at
> org.apache.ignite.spi.discovery.tcp.ServerImpl$RingMessageWorker.processMessage(ServerImpl.java:3221)
> at
> org.apache.ignite.spi.discovery.tcp.ServerImpl$RingMessageWorker.processMessage(ServerImpl.java:2894)
> {noformat}
--
This message was sent by Atlassian Jira
(v8.20.10#820010)