[jira] [Updated] (IGNITE-11762) Test testClientStartCloseServersRestart causes hang of the whole Cache 2 suite in master
[ https://issues.apache.org/jira/browse/IGNITE-11762?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maxim Muzafarov updated IGNITE-11762: - Fix Version/s: (was: 2.8) > Test testClientStartCloseServersRestart causes hang of the whole Cache 2 > suite in master > > > Key: IGNITE-11762 > URL: https://issues.apache.org/jira/browse/IGNITE-11762 > Project: Ignite > Issue Type: Bug >Reporter: Ivan Rakov >Assignee: Pavel Kovalenko >Priority: Major > > Attempt to restart server node in test hangs: > {code:java} > [2019-04-16 19:56:45,049][WARN > ][restart-1][GridCachePartitionExchangeManager] Failed to wait for initial > partition map exchange. Possible reasons are: > ^-- Transactions in deadlock. > ^-- Long running transactions (ignore if this is the case). > ^-- Unreleased explicit locks. > {code} > The reason is that previous PME (late affinity assignment) still hangs due to > pending transaction: > {code:java} > [2019-04-16 19:56:23,717][WARN > ][exchange-worker-#1039%cache.IgniteClientCacheStartFailoverTest3%][diagnostic] > Pending transactions: > [2019-04-16 19:56:23,718][WARN > ][exchange-worker-#1039%cache.IgniteClientCacheStartFailoverTest3%][diagnostic] > >>> [txVer=AffinityTopologyVersion [topVer=11, minorTopVer=0], > exchWait=true, tx=GridDhtTxLocal > [nearNodeId=8559bfe0-3d4a-4090-a457-6df0eba5, > nearFutId=1edc7172a61-941f9dde-2b60-4a1f-8213-7d23d738bf33, nearMiniId=1, > nearFinFutId=null, nearFinMiniId=0, nearXidVer=GridCacheVersion > [topVer=166913752, order=1555433759036, nodeOrder=6], lb=null, > super=GridDhtTxLocalAdapter [nearOnOriginatingNode=false, > nearNodes=KeySetView [], dhtNodes=KeySetView > [9ef33532-0e4a-4561-b57e-042afe10], explicitLock=false, > super=IgniteTxLocalAdapter [completedBase=null, sndTransformedVals=false, > depEnabled=false, txState=IgniteTxStateImpl [activeCacheIds=[-1062368467], > recovery=false, mvccEnabled=true, mvccCachingCacheIds=[], txMap=HashSet []], > super=IgniteTxAdapter [xidVer=GridCacheVersion [topVer=166913752, > order=1555433759045, nodeOrder=10], writeVer=null, implicit=false, loc=true, > threadId=1210, startTime=1555433762847, > nodeId=0088e9b8-f859-4d14-8071-6388e473, startVer=GridCacheVersion > [topVer=166913752, order=1555433759045, nodeOrder=10], endVer=null, > isolation=REPEATABLE_READ, concurrency=PESSIMISTIC, timeout=0, > sysInvalidate=false, sys=false, plc=2, commitVer=GridCacheVersion > [topVer=166913752, order=1555433759045, nodeOrder=10], finalizing=NONE, > invalidParts=null, state=MARKED_ROLLBACK, timedOut=false, > topVer=AffinityTopologyVersion [topVer=11, minorTopVer=0], > mvccSnapshot=MvccSnapshotResponse [futId=292, crdVer=1555433741506, cntr=395, > opCntr=1, txs=[394], cleanupVer=390, tracking=0], skipCompletedVers=false, > parentTx=null, duration=20866ms, onePhaseCommit=false], size=0 > {code} > However, load threads don't start any explicit transactions: they either hang > on put()/get() or on clientCache.close(). > Rolling back IGNITE-10799 resolves the issue (however, test remains flaky > with ~10% fail rate due to unhandled TransactionSerializationException). > -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Updated] (IGNITE-11762) Test testClientStartCloseServersRestart causes hang of the whole Cache 2 suite in master
[ https://issues.apache.org/jira/browse/IGNITE-11762?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Goncharuk updated IGNITE-11762: -- Ignite Flags: (was: Docs Required) > Test testClientStartCloseServersRestart causes hang of the whole Cache 2 > suite in master > > > Key: IGNITE-11762 > URL: https://issues.apache.org/jira/browse/IGNITE-11762 > Project: Ignite > Issue Type: Bug >Reporter: Ivan Rakov >Assignee: Pavel Kovalenko >Priority: Major > Fix For: 2.8 > > > Attempt to restart server node in test hangs: > {code:java} > [2019-04-16 19:56:45,049][WARN > ][restart-1][GridCachePartitionExchangeManager] Failed to wait for initial > partition map exchange. Possible reasons are: > ^-- Transactions in deadlock. > ^-- Long running transactions (ignore if this is the case). > ^-- Unreleased explicit locks. > {code} > The reason is that previous PME (late affinity assignment) still hangs due to > pending transaction: > {code:java} > [2019-04-16 19:56:23,717][WARN > ][exchange-worker-#1039%cache.IgniteClientCacheStartFailoverTest3%][diagnostic] > Pending transactions: > [2019-04-16 19:56:23,718][WARN > ][exchange-worker-#1039%cache.IgniteClientCacheStartFailoverTest3%][diagnostic] > >>> [txVer=AffinityTopologyVersion [topVer=11, minorTopVer=0], > exchWait=true, tx=GridDhtTxLocal > [nearNodeId=8559bfe0-3d4a-4090-a457-6df0eba5, > nearFutId=1edc7172a61-941f9dde-2b60-4a1f-8213-7d23d738bf33, nearMiniId=1, > nearFinFutId=null, nearFinMiniId=0, nearXidVer=GridCacheVersion > [topVer=166913752, order=1555433759036, nodeOrder=6], lb=null, > super=GridDhtTxLocalAdapter [nearOnOriginatingNode=false, > nearNodes=KeySetView [], dhtNodes=KeySetView > [9ef33532-0e4a-4561-b57e-042afe10], explicitLock=false, > super=IgniteTxLocalAdapter [completedBase=null, sndTransformedVals=false, > depEnabled=false, txState=IgniteTxStateImpl [activeCacheIds=[-1062368467], > recovery=false, mvccEnabled=true, mvccCachingCacheIds=[], txMap=HashSet []], > super=IgniteTxAdapter [xidVer=GridCacheVersion [topVer=166913752, > order=1555433759045, nodeOrder=10], writeVer=null, implicit=false, loc=true, > threadId=1210, startTime=1555433762847, > nodeId=0088e9b8-f859-4d14-8071-6388e473, startVer=GridCacheVersion > [topVer=166913752, order=1555433759045, nodeOrder=10], endVer=null, > isolation=REPEATABLE_READ, concurrency=PESSIMISTIC, timeout=0, > sysInvalidate=false, sys=false, plc=2, commitVer=GridCacheVersion > [topVer=166913752, order=1555433759045, nodeOrder=10], finalizing=NONE, > invalidParts=null, state=MARKED_ROLLBACK, timedOut=false, > topVer=AffinityTopologyVersion [topVer=11, minorTopVer=0], > mvccSnapshot=MvccSnapshotResponse [futId=292, crdVer=1555433741506, cntr=395, > opCntr=1, txs=[394], cleanupVer=390, tracking=0], skipCompletedVers=false, > parentTx=null, duration=20866ms, onePhaseCommit=false], size=0 > {code} > However, load threads don't start any explicit transactions: they either hang > on put()/get() or on clientCache.close(). > Rolling back IGNITE-10799 resolves the issue (however, test remains flaky > with ~10% fail rate due to unhandled TransactionSerializationException). > -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Updated] (IGNITE-11762) Test testClientStartCloseServersRestart causes hang of the whole Cache 2 suite in master
[ https://issues.apache.org/jira/browse/IGNITE-11762?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ivan Rakov updated IGNITE-11762: Description: Attempt to restart server node in test hangs: {code:java} [2019-04-16 19:56:45,049][WARN ][restart-1][GridCachePartitionExchangeManager] Failed to wait for initial partition map exchange. Possible reasons are: ^-- Transactions in deadlock. ^-- Long running transactions (ignore if this is the case). ^-- Unreleased explicit locks. {code} The reason is that previous PME (late affinity assignment) still hangs due to pending transaction: {code:java} [2019-04-16 19:56:23,717][WARN ][exchange-worker-#1039%cache.IgniteClientCacheStartFailoverTest3%][diagnostic] Pending transactions: [2019-04-16 19:56:23,718][WARN ][exchange-worker-#1039%cache.IgniteClientCacheStartFailoverTest3%][diagnostic] >>> [txVer=AffinityTopologyVersion [topVer=11, minorTopVer=0], exchWait=true, tx=GridDhtTxLocal [nearNodeId=8559bfe0-3d4a-4090-a457-6df0eba5, nearFutId=1edc7172a61-941f9dde-2b60-4a1f-8213-7d23d738bf33, nearMiniId=1, nearFinFutId=null, nearFinMiniId=0, nearXidVer=GridCacheVersion [topVer=166913752, order=1555433759036, nodeOrder=6], lb=null, super=GridDhtTxLocalAdapter [nearOnOriginatingNode=false, nearNodes=KeySetView [], dhtNodes=KeySetView [9ef33532-0e4a-4561-b57e-042afe10], explicitLock=false, super=IgniteTxLocalAdapter [completedBase=null, sndTransformedVals=false, depEnabled=false, txState=IgniteTxStateImpl [activeCacheIds=[-1062368467], recovery=false, mvccEnabled=true, mvccCachingCacheIds=[], txMap=HashSet []], super=IgniteTxAdapter [xidVer=GridCacheVersion [topVer=166913752, order=1555433759045, nodeOrder=10], writeVer=null, implicit=false, loc=true, threadId=1210, startTime=1555433762847, nodeId=0088e9b8-f859-4d14-8071-6388e473, startVer=GridCacheVersion [topVer=166913752, order=1555433759045, nodeOrder=10], endVer=null, isolation=REPEATABLE_READ, concurrency=PESSIMISTIC, timeout=0, sysInvalidate=false, sys=false, plc=2, commitVer=GridCacheVersion [topVer=166913752, order=1555433759045, nodeOrder=10], finalizing=NONE, invalidParts=null, state=MARKED_ROLLBACK, timedOut=false, topVer=AffinityTopologyVersion [topVer=11, minorTopVer=0], mvccSnapshot=MvccSnapshotResponse [futId=292, crdVer=1555433741506, cntr=395, opCntr=1, txs=[394], cleanupVer=390, tracking=0], skipCompletedVers=false, parentTx=null, duration=20866ms, onePhaseCommit=false], size=0 {code} However, load threads don't start any explicit transactions: they either hang on put()/get() or on clientCache.close(). Rolling back IGNITE-10799 resolves the issue (however, test remains flaky with ~10% fail rate due to unhandled TransactionSerializationException). was: Attempt to restart server node in test hangs: {code:java} [2019-04-16 19:56:45,049][WARN ][restart-1][GridCachePartitionExchangeManager] Failed to wait for initial partition map exchange. Possible reasons are: ^-- Transactions in deadlock. ^-- Long running transactions (ignore if this is the case). ^-- Unreleased explicit locks. {code} The reason is that previous PME (late affinity assignment) still hangs due to pending transaction: {code:java} [2019-04-16 19:56:23,717][WARN ][exchange-worker-#1039%cache.IgniteClientCacheStartFailoverTest3%][diagnostic] Pending transactions: [2019-04-16 19:56:23,718][WARN ][exchange-worker-#1039%cache.IgniteClientCacheStartFailoverTest3%][diagnostic] >>> [txVer=AffinityTopologyVersion [topVer=11, minorTopVer=0], exchWait=true, tx=GridDhtTxLocal [nearNodeId=8559bfe0-3d4a-4090-a457-6df0eba5, nearFutId=1edc7172a61-941f9dde-2b60-4a1f-8213-7d23d738bf33, nearMiniId=1, nearFinFutId=null, nearFinMiniId=0, nearXidVer=GridCacheVersion [topVer=166913752, order=1555433759036, nodeOrder=6], lb=null, super=GridDhtTxLocalAdapter [nearOnOriginatingNode=false, nearNodes=KeySetView [], dhtNodes=KeySetView [9ef33532-0e4a-4561-b57e-042afe10], explicitLock=false, super=IgniteTxLocalAdapter [completedBase=null, sndTransformedVals=false, depEnabled=false, txState=IgniteTxStateImpl [activeCacheIds=[-1062368467], recovery=false, mvccEnabled=true, mvccCachingCacheIds=[], txMap=HashSet []], super=IgniteTxAdapter [xidVer=GridCacheVersion [topVer=166913752, order=1555433759045, nodeOrder=10], writeVer=null, implicit=false, loc=true, threadId=1210, startTime=1555433762847, nodeId=0088e9b8-f859-4d14-8071-6388e473, startVer=GridCacheVersion [topVer=166913752, order=1555433759045, nodeOrder=10], endVer=null, isolation=REPEATABLE_READ, concurrency=PESSIMISTIC, timeout=0, sysInvalidate=false, sys=false, plc=2, commitVer=GridCacheVersion [topVer=166913752, order=1555433759045, nodeOrder=10], finalizing=NONE, invalidParts=null, state=MARKED_ROLLBACK, timedOut=false, topVer=AffinityTopologyVersion [topVer=11, minorTopVer=0], mvccSnapshot=MvccSnapshotResponse [futId=292, crdVer=1555433741506,