[jira] [Commented] (IGNITE-17542) Test CacheLateAffinityAssignmentTest.testAffinitySimpleNoCacheOnCoordinator2 became flaky after IGNITE-17507

2022-08-19 Thread Vyacheslav Koptilin (Jira)


[ 
https://issues.apache.org/jira/browse/IGNITE-17542?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17581802#comment-17581802
 ] 

Vyacheslav Koptilin commented on IGNITE-17542:
--

Hello [~ivandasch],

I have fixed the PR in accordance with your comments. Could you please take a 
look again?

> Test CacheLateAffinityAssignmentTest.testAffinitySimpleNoCacheOnCoordinator2 
> became flaky after IGNITE-17507
> 
>
> Key: IGNITE-17542
> URL: https://issues.apache.org/jira/browse/IGNITE-17542
> Project: Ignite
>  Issue Type: Bug
>Affects Versions: 2.14
>Reporter: Vyacheslav Koptilin
>Assignee: Vyacheslav Koptilin
>Priority: Major
> Fix For: 2.14
>
>  Time Spent: 40m
>  Remaining Estimate: 0h
>
> The test 
> CacheLateAffinityAssignmentTest.testAffinitySimpleNoCacheOnCoordinator2 
> became flay due to IGNITE-17507.
> The root cause of the issue that _CacheAffinityChangeMessage_ mutates the 
> message outside the _disco-notifier_ thread, and this fact may lead to the 
> following exception:
> {noformat}
> [2022-08-16T21:10:32,133][ERROR][tcp-disco-msg-worker-[0448095b 
> 127.0.0.1:47502]-#5308%distributed.CacheLateAffinityAssignmentTest3%-#98199%distributed.CacheLateAffinityAssignmentTest3%][TestTcpDiscoverySpi]
>  TcpDiscoverSpi's message worker thread failed abnormally. Stopping the node 
> in order to prevent cluster wide instability.
>   org.apache.ignite.IgniteException: Failed to marshal mutable discovery 
> message: CacheAffinityChangeMessage 
> [id=ea31ffaa281-0286b465-6baf-4ad8-9e3b-3f8cb755d1dd, topVer=null, 
> exchId=GridDhtPartitionExchangeId [topVer=AffinityTopologyVersion [topVer=6, 
> minorTopVer=0], discoEvt=null, nodeId=f9f9faf0, evt=NODE_LEFT], 
> partsMsg=GridDhtPartitionsFullMessage [parts=HashMap 
> {-2100569601=GridDhtPartitionFullMap 
> {f57cbb85-44ba-40d1-814e-937f96c3=GridDhtPartitionMap [moving=0, 
> top=AffinityTopologyVersion [topVer=6, minorTopVer=0], updateSeq=111, 
> size=100], 0448095b-02d8-470c-ab90-6a5bcf82=GridDhtPartitionMap 
> [moving=0, top=AffinityTopologyVersion [topVer=6, minorTopVer=0], 
> updateSeq=116, size=100]}, 1251687457=GridDhtPartitionFullMap 
> {f57cbb85-44ba-40d1-814e-937f96c3=GridDhtPartitionMap [moving=0, 
> top=AffinityTopologyVersion [topVer=6, minorTopVer=0], updateSeq=1035, 
> size=1024], 0448095b-02d8-470c-ab90-6a5bcf82=GridDhtPartitionMap 
> [moving=0, top=AffinityTopologyVersion [topVer=4, minorTopVer=0], 
> updateSeq=3, size=0]}}, partCntrs=IgniteDhtPartitionCountersMap [], 
> partCntrs2=null, partHistSuppliers=IgniteDhtPartitionHistorySuppliersMap [], 
> partsToReload=IgniteDhtPartitionsToReloadMap [], 
> topVer=AffinityTopologyVersion [topVer=6, minorTopVer=0], errs=null, 
> resTopVer=null, flags=0, partCnt=2, super=GridDhtPartitionsAbstractMessage 
> [exchId=GridDhtPartitionExchangeId [topVer=AffinityTopologyVersion [topVer=6, 
> minorTopVer=0], discoEvt=null, nodeId=f9f9faf0, evt=NODE_LEFT], 
> lastVer=GridCacheVersion [topVer=0, order=1660673425660, nodeOrder=0, 
> dataCenterId=0], super=GridCacheMessage [msgId=-1, depInfo=null, 
> lastAffChangedTopVer=null, err=null, skipPrepare=false]]], 
> exchangeNeeded=false, stopProc=false]
> at 
> org.apache.ignite.spi.discovery.tcp.ServerImpl$RingMessageWorker.notifyDiscoveryListener(ServerImpl.java:6423)
>  ~[classes/:?]
> at 
> org.apache.ignite.spi.discovery.tcp.ServerImpl$RingMessageWorker.processCustomMessage(ServerImpl.java:6243)
>  ~[classes/:?]
> at 
> org.apache.ignite.spi.discovery.tcp.ServerImpl$RingMessageWorker.processMessage(ServerImpl.java:3260)
>  ~[classes/:?]
> at 
> org.apache.ignite.spi.discovery.tcp.ServerImpl$RingMessageWorker.processMessage(ServerImpl.java:2918)
>  ~[classes/:?]
> at 
> org.apache.ignite.spi.discovery.tcp.ServerImpl$MessageWorker.body(ServerImpl.java:8058)
>  ~[classes/:?]
> at 
> org.apache.ignite.spi.discovery.tcp.ServerImpl$RingMessageWorker.body(ServerImpl.java:3089)
>  [classes/:?]
> at 
> org.apache.ignite.internal.util.worker.GridWorker.run(GridWorker.java:125) 
> [classes/:?]
> at 
> org.apache.ignite.spi.discovery.tcp.ServerImpl$MessageWorkerThread.body(ServerImpl.java:7989)
>  [classes/:?]
> at org.apache.ignite.spi.IgniteSpiThread.run(IgniteSpiThread.java:58) 
> [classes/:?]
>   Caused by: org.apache.ignite.IgniteCheckedException: Failed to serialize 
> object: CacheAffinityChangeMessage 
> [id=ea31ffaa281-0286b465-6baf-4ad8-9e3b-3f8cb755d1dd, topVer=null, 
> exchId=GridDhtPartitionExchangeId [topVer=AffinityTopologyVersion [topVer=6, 
> minorTopVer=0], discoEvt=null, nodeId=f9f9faf0, evt=NODE_LEFT], 
> partsMsg=GridDhtPartitionsFullMessage [parts=HashMap 
> {-2100569601=GridDhtPartitionFullMap 
> 

[jira] [Commented] (IGNITE-17542) Test CacheLateAffinityAssignmentTest.testAffinitySimpleNoCacheOnCoordinator2 became flaky after IGNITE-17507

2022-08-19 Thread Ignite TC Bot (Jira)


[ 
https://issues.apache.org/jira/browse/IGNITE-17542?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17581776#comment-17581776
 ] 

Ignite TC Bot commented on IGNITE-17542:


{panel:title=Branch: [pull/10201/head] Base: [master] : No blockers 
found!|borderStyle=dashed|borderColor=#ccc|titleBGColor=#D6F7C1}{panel}
{panel:title=Branch: [pull/10201/head] Base: [master] : No new tests 
found!|borderStyle=dashed|borderColor=#ccc|titleBGColor=#F7D6C1}{panel}
[TeamCity *-- Run :: All* 
Results|https://ci.ignite.apache.org/viewLog.html?buildId=6737235buildTypeId=IgniteTests24Java8_RunAll]

> Test CacheLateAffinityAssignmentTest.testAffinitySimpleNoCacheOnCoordinator2 
> became flaky after IGNITE-17507
> 
>
> Key: IGNITE-17542
> URL: https://issues.apache.org/jira/browse/IGNITE-17542
> Project: Ignite
>  Issue Type: Bug
>Affects Versions: 2.14
>Reporter: Vyacheslav Koptilin
>Assignee: Vyacheslav Koptilin
>Priority: Major
> Fix For: 2.14
>
>  Time Spent: 40m
>  Remaining Estimate: 0h
>
> The test 
> CacheLateAffinityAssignmentTest.testAffinitySimpleNoCacheOnCoordinator2 
> became flay due to IGNITE-17507.
> The root cause of the issue that _CacheAffinityChangeMessage_ mutates the 
> message outside the _disco-notifier_ thread, and this fact may lead to the 
> following exception:
> {noformat}
> [2022-08-16T21:10:32,133][ERROR][tcp-disco-msg-worker-[0448095b 
> 127.0.0.1:47502]-#5308%distributed.CacheLateAffinityAssignmentTest3%-#98199%distributed.CacheLateAffinityAssignmentTest3%][TestTcpDiscoverySpi]
>  TcpDiscoverSpi's message worker thread failed abnormally. Stopping the node 
> in order to prevent cluster wide instability.
>   org.apache.ignite.IgniteException: Failed to marshal mutable discovery 
> message: CacheAffinityChangeMessage 
> [id=ea31ffaa281-0286b465-6baf-4ad8-9e3b-3f8cb755d1dd, topVer=null, 
> exchId=GridDhtPartitionExchangeId [topVer=AffinityTopologyVersion [topVer=6, 
> minorTopVer=0], discoEvt=null, nodeId=f9f9faf0, evt=NODE_LEFT], 
> partsMsg=GridDhtPartitionsFullMessage [parts=HashMap 
> {-2100569601=GridDhtPartitionFullMap 
> {f57cbb85-44ba-40d1-814e-937f96c3=GridDhtPartitionMap [moving=0, 
> top=AffinityTopologyVersion [topVer=6, minorTopVer=0], updateSeq=111, 
> size=100], 0448095b-02d8-470c-ab90-6a5bcf82=GridDhtPartitionMap 
> [moving=0, top=AffinityTopologyVersion [topVer=6, minorTopVer=0], 
> updateSeq=116, size=100]}, 1251687457=GridDhtPartitionFullMap 
> {f57cbb85-44ba-40d1-814e-937f96c3=GridDhtPartitionMap [moving=0, 
> top=AffinityTopologyVersion [topVer=6, minorTopVer=0], updateSeq=1035, 
> size=1024], 0448095b-02d8-470c-ab90-6a5bcf82=GridDhtPartitionMap 
> [moving=0, top=AffinityTopologyVersion [topVer=4, minorTopVer=0], 
> updateSeq=3, size=0]}}, partCntrs=IgniteDhtPartitionCountersMap [], 
> partCntrs2=null, partHistSuppliers=IgniteDhtPartitionHistorySuppliersMap [], 
> partsToReload=IgniteDhtPartitionsToReloadMap [], 
> topVer=AffinityTopologyVersion [topVer=6, minorTopVer=0], errs=null, 
> resTopVer=null, flags=0, partCnt=2, super=GridDhtPartitionsAbstractMessage 
> [exchId=GridDhtPartitionExchangeId [topVer=AffinityTopologyVersion [topVer=6, 
> minorTopVer=0], discoEvt=null, nodeId=f9f9faf0, evt=NODE_LEFT], 
> lastVer=GridCacheVersion [topVer=0, order=1660673425660, nodeOrder=0, 
> dataCenterId=0], super=GridCacheMessage [msgId=-1, depInfo=null, 
> lastAffChangedTopVer=null, err=null, skipPrepare=false]]], 
> exchangeNeeded=false, stopProc=false]
> at 
> org.apache.ignite.spi.discovery.tcp.ServerImpl$RingMessageWorker.notifyDiscoveryListener(ServerImpl.java:6423)
>  ~[classes/:?]
> at 
> org.apache.ignite.spi.discovery.tcp.ServerImpl$RingMessageWorker.processCustomMessage(ServerImpl.java:6243)
>  ~[classes/:?]
> at 
> org.apache.ignite.spi.discovery.tcp.ServerImpl$RingMessageWorker.processMessage(ServerImpl.java:3260)
>  ~[classes/:?]
> at 
> org.apache.ignite.spi.discovery.tcp.ServerImpl$RingMessageWorker.processMessage(ServerImpl.java:2918)
>  ~[classes/:?]
> at 
> org.apache.ignite.spi.discovery.tcp.ServerImpl$MessageWorker.body(ServerImpl.java:8058)
>  ~[classes/:?]
> at 
> org.apache.ignite.spi.discovery.tcp.ServerImpl$RingMessageWorker.body(ServerImpl.java:3089)
>  [classes/:?]
> at 
> org.apache.ignite.internal.util.worker.GridWorker.run(GridWorker.java:125) 
> [classes/:?]
> at 
> org.apache.ignite.spi.discovery.tcp.ServerImpl$MessageWorkerThread.body(ServerImpl.java:7989)
>  [classes/:?]
> at org.apache.ignite.spi.IgniteSpiThread.run(IgniteSpiThread.java:58) 
> [classes/:?]
>   Caused by: org.apache.ignite.IgniteCheckedException: Failed to serialize 
> object: CacheAffinityChangeMessage 
>