Ivan Rakov created IGNITE-11762:
-----------------------------------
Summary: Test testClientStartCloseServersRestart causes hang of
the whole Cache 2 suite in master
Key: IGNITE-11762
URL: https://issues.apache.org/jira/browse/IGNITE-11762
Project: Ignite
Issue Type: Bug
Reporter: Ivan Rakov
Assignee: Pavel Kovalenko
Fix For: 2.8
Attempt to restart server node in test hangs:
{code:java}
[2019-04-16 19:56:45,049][WARN ][restart-1][GridCachePartitionExchangeManager]
Failed to wait for initial partition map exchange. Possible reasons are:
^-- Transactions in deadlock.
^-- Long running transactions (ignore if this is the case).
^-- Unreleased explicit locks.
{code}
The reason is that previous PME (late affinity assignment) still hangs due to
pending transaction:
{code:java}
[2019-04-16 19:56:23,717][WARN
][exchange-worker-#1039%cache.IgniteClientCacheStartFailoverTest3%][diagnostic]
Pending transactions:
[2019-04-16 19:56:23,718][WARN
][exchange-worker-#1039%cache.IgniteClientCacheStartFailoverTest3%][diagnostic]
>>> [txVer=AffinityTopologyVersion [topVer=11, minorTopVer=0], exchWait=true,
tx=GridDhtTxLocal [nearNodeId=8559bfe0-3d4a-4090-a457-6df0eba00005,
nearFutId=1edc7172a61-941f9dde-2b60-4a1f-8213-7d23d738bf33, nearMiniId=1,
nearFinFutId=null, nearFinMiniId=0, nearXidVer=GridCacheVersion
[topVer=166913752, order=1555433759036, nodeOrder=6], lb=null,
super=GridDhtTxLocalAdapter [nearOnOriginatingNode=false, nearNodes=KeySetView
[], dhtNodes=KeySetView [9ef33532-0e4a-4561-b57e-042afe100000],
explicitLock=false, super=IgniteTxLocalAdapter [completedBase=null,
sndTransformedVals=false, depEnabled=false, txState=IgniteTxStateImpl
[activeCacheIds=[-1062368467], recovery=false, mvccEnabled=true,
mvccCachingCacheIds=[], txMap=HashSet []], super=IgniteTxAdapter
[xidVer=GridCacheVersion [topVer=166913752, order=1555433759045, nodeOrder=10],
writeVer=null, implicit=false, loc=true, threadId=1210,
startTime=1555433762847, nodeId=0088e9b8-f859-4d14-8071-6388e4700003,
startVer=GridCacheVersion [topVer=166913752, order=1555433759045,
nodeOrder=10], endVer=null, isolation=REPEATABLE_READ, concurrency=PESSIMISTIC,
timeout=0, sysInvalidate=false, sys=false, plc=2, commitVer=GridCacheVersion
[topVer=166913752, order=1555433759045, nodeOrder=10], finalizing=NONE,
invalidParts=null, state=MARKED_ROLLBACK, timedOut=false,
topVer=AffinityTopologyVersion [topVer=11, minorTopVer=0],
mvccSnapshot=MvccSnapshotResponse [futId=292, crdVer=1555433741506, cntr=395,
opCntr=1, txs=[394], cleanupVer=390, tracking=0], skipCompletedVers=false,
parentTx=null, duration=20866ms, onePhaseCommit=false], size=0]]]]
{code}
However, load threads don't start any explicit transactions: they either hang
on put()/get() or on clientCache.close().
Rolling back IGNITE-10799 resolves the issue (however, test remains flaky with
~10% fail rate due to unhandled TransactionSerializationException).
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)