[jira] [Updated] (IGNITE-11762) Test testClientStartCloseServersRestart causes hang of the whole Cache 2 suite in master

2019-10-10 Thread Maxim Muzafarov (Jira)


 [ 
https://issues.apache.org/jira/browse/IGNITE-11762?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Maxim Muzafarov updated IGNITE-11762:
-
Fix Version/s: (was: 2.8)

> Test testClientStartCloseServersRestart causes hang of the whole Cache 2 
> suite in master
> 
>
> Key: IGNITE-11762
> URL: https://issues.apache.org/jira/browse/IGNITE-11762
> Project: Ignite
>  Issue Type: Bug
>Reporter: Ivan Rakov
>Assignee: Pavel Kovalenko
>Priority: Major
>
> Attempt to restart server node in test hangs:
> {code:java}
> [2019-04-16 19:56:45,049][WARN 
> ][restart-1][GridCachePartitionExchangeManager] Failed to wait for initial 
> partition map exchange. Possible reasons are:
> ^-- Transactions in deadlock.
> ^-- Long running transactions (ignore if this is the case).
> ^-- Unreleased explicit locks.
> {code}
> The reason is that previous PME (late affinity assignment) still hangs due to 
> pending transaction:
> {code:java}
> [2019-04-16 19:56:23,717][WARN 
> ][exchange-worker-#1039%cache.IgniteClientCacheStartFailoverTest3%][diagnostic]
>  Pending transactions:
> [2019-04-16 19:56:23,718][WARN 
> ][exchange-worker-#1039%cache.IgniteClientCacheStartFailoverTest3%][diagnostic]
>  >>> [txVer=AffinityTopologyVersion [topVer=11, minorTopVer=0], 
> exchWait=true, tx=GridDhtTxLocal 
> [nearNodeId=8559bfe0-3d4a-4090-a457-6df0eba5, 
> nearFutId=1edc7172a61-941f9dde-2b60-4a1f-8213-7d23d738bf33, nearMiniId=1, 
> nearFinFutId=null, nearFinMiniId=0, nearXidVer=GridCacheVersion 
> [topVer=166913752, order=1555433759036, nodeOrder=6], lb=null, 
> super=GridDhtTxLocalAdapter [nearOnOriginatingNode=false, 
> nearNodes=KeySetView [], dhtNodes=KeySetView 
> [9ef33532-0e4a-4561-b57e-042afe10], explicitLock=false, 
> super=IgniteTxLocalAdapter [completedBase=null, sndTransformedVals=false, 
> depEnabled=false, txState=IgniteTxStateImpl [activeCacheIds=[-1062368467], 
> recovery=false, mvccEnabled=true, mvccCachingCacheIds=[], txMap=HashSet []], 
> super=IgniteTxAdapter [xidVer=GridCacheVersion [topVer=166913752, 
> order=1555433759045, nodeOrder=10], writeVer=null, implicit=false, loc=true, 
> threadId=1210, startTime=1555433762847, 
> nodeId=0088e9b8-f859-4d14-8071-6388e473, startVer=GridCacheVersion 
> [topVer=166913752, order=1555433759045, nodeOrder=10], endVer=null, 
> isolation=REPEATABLE_READ, concurrency=PESSIMISTIC, timeout=0, 
> sysInvalidate=false, sys=false, plc=2, commitVer=GridCacheVersion 
> [topVer=166913752, order=1555433759045, nodeOrder=10], finalizing=NONE, 
> invalidParts=null, state=MARKED_ROLLBACK, timedOut=false, 
> topVer=AffinityTopologyVersion [topVer=11, minorTopVer=0], 
> mvccSnapshot=MvccSnapshotResponse [futId=292, crdVer=1555433741506, cntr=395, 
> opCntr=1, txs=[394], cleanupVer=390, tracking=0], skipCompletedVers=false, 
> parentTx=null, duration=20866ms, onePhaseCommit=false], size=0
> {code}
> However, load threads don't start any explicit transactions: they either hang 
> on put()/get() or on clientCache.close().
> Rolling back IGNITE-10799 resolves the issue (however, test remains flaky 
> with ~10% fail rate due to unhandled TransactionSerializationException).
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (IGNITE-11762) Test testClientStartCloseServersRestart causes hang of the whole Cache 2 suite in master

2019-04-22 Thread Alexey Goncharuk (JIRA)


 [ 
https://issues.apache.org/jira/browse/IGNITE-11762?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alexey Goncharuk updated IGNITE-11762:
--
Ignite Flags:   (was: Docs Required)

> Test testClientStartCloseServersRestart causes hang of the whole Cache 2 
> suite in master
> 
>
> Key: IGNITE-11762
> URL: https://issues.apache.org/jira/browse/IGNITE-11762
> Project: Ignite
>  Issue Type: Bug
>Reporter: Ivan Rakov
>Assignee: Pavel Kovalenko
>Priority: Major
> Fix For: 2.8
>
>
> Attempt to restart server node in test hangs:
> {code:java}
> [2019-04-16 19:56:45,049][WARN 
> ][restart-1][GridCachePartitionExchangeManager] Failed to wait for initial 
> partition map exchange. Possible reasons are:
> ^-- Transactions in deadlock.
> ^-- Long running transactions (ignore if this is the case).
> ^-- Unreleased explicit locks.
> {code}
> The reason is that previous PME (late affinity assignment) still hangs due to 
> pending transaction:
> {code:java}
> [2019-04-16 19:56:23,717][WARN 
> ][exchange-worker-#1039%cache.IgniteClientCacheStartFailoverTest3%][diagnostic]
>  Pending transactions:
> [2019-04-16 19:56:23,718][WARN 
> ][exchange-worker-#1039%cache.IgniteClientCacheStartFailoverTest3%][diagnostic]
>  >>> [txVer=AffinityTopologyVersion [topVer=11, minorTopVer=0], 
> exchWait=true, tx=GridDhtTxLocal 
> [nearNodeId=8559bfe0-3d4a-4090-a457-6df0eba5, 
> nearFutId=1edc7172a61-941f9dde-2b60-4a1f-8213-7d23d738bf33, nearMiniId=1, 
> nearFinFutId=null, nearFinMiniId=0, nearXidVer=GridCacheVersion 
> [topVer=166913752, order=1555433759036, nodeOrder=6], lb=null, 
> super=GridDhtTxLocalAdapter [nearOnOriginatingNode=false, 
> nearNodes=KeySetView [], dhtNodes=KeySetView 
> [9ef33532-0e4a-4561-b57e-042afe10], explicitLock=false, 
> super=IgniteTxLocalAdapter [completedBase=null, sndTransformedVals=false, 
> depEnabled=false, txState=IgniteTxStateImpl [activeCacheIds=[-1062368467], 
> recovery=false, mvccEnabled=true, mvccCachingCacheIds=[], txMap=HashSet []], 
> super=IgniteTxAdapter [xidVer=GridCacheVersion [topVer=166913752, 
> order=1555433759045, nodeOrder=10], writeVer=null, implicit=false, loc=true, 
> threadId=1210, startTime=1555433762847, 
> nodeId=0088e9b8-f859-4d14-8071-6388e473, startVer=GridCacheVersion 
> [topVer=166913752, order=1555433759045, nodeOrder=10], endVer=null, 
> isolation=REPEATABLE_READ, concurrency=PESSIMISTIC, timeout=0, 
> sysInvalidate=false, sys=false, plc=2, commitVer=GridCacheVersion 
> [topVer=166913752, order=1555433759045, nodeOrder=10], finalizing=NONE, 
> invalidParts=null, state=MARKED_ROLLBACK, timedOut=false, 
> topVer=AffinityTopologyVersion [topVer=11, minorTopVer=0], 
> mvccSnapshot=MvccSnapshotResponse [futId=292, crdVer=1555433741506, cntr=395, 
> opCntr=1, txs=[394], cleanupVer=390, tracking=0], skipCompletedVers=false, 
> parentTx=null, duration=20866ms, onePhaseCommit=false], size=0
> {code}
> However, load threads don't start any explicit transactions: they either hang 
> on put()/get() or on clientCache.close().
> Rolling back IGNITE-10799 resolves the issue (however, test remains flaky 
> with ~10% fail rate due to unhandled TransactionSerializationException).
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (IGNITE-11762) Test testClientStartCloseServersRestart causes hang of the whole Cache 2 suite in master

2019-04-16 Thread Ivan Rakov (JIRA)


 [ 
https://issues.apache.org/jira/browse/IGNITE-11762?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ivan Rakov updated IGNITE-11762:

Description: 
Attempt to restart server node in test hangs:
{code:java}
[2019-04-16 19:56:45,049][WARN ][restart-1][GridCachePartitionExchangeManager] 
Failed to wait for initial partition map exchange. Possible reasons are:
^-- Transactions in deadlock.
^-- Long running transactions (ignore if this is the case).
^-- Unreleased explicit locks.
{code}
The reason is that previous PME (late affinity assignment) still hangs due to 
pending transaction:
{code:java}
[2019-04-16 19:56:23,717][WARN 
][exchange-worker-#1039%cache.IgniteClientCacheStartFailoverTest3%][diagnostic] 
Pending transactions:
[2019-04-16 19:56:23,718][WARN 
][exchange-worker-#1039%cache.IgniteClientCacheStartFailoverTest3%][diagnostic] 
>>> [txVer=AffinityTopologyVersion [topVer=11, minorTopVer=0], exchWait=true, 
tx=GridDhtTxLocal [nearNodeId=8559bfe0-3d4a-4090-a457-6df0eba5, 
nearFutId=1edc7172a61-941f9dde-2b60-4a1f-8213-7d23d738bf33, nearMiniId=1, 
nearFinFutId=null, nearFinMiniId=0, nearXidVer=GridCacheVersion 
[topVer=166913752, order=1555433759036, nodeOrder=6], lb=null, 
super=GridDhtTxLocalAdapter [nearOnOriginatingNode=false, nearNodes=KeySetView 
[], dhtNodes=KeySetView [9ef33532-0e4a-4561-b57e-042afe10], 
explicitLock=false, super=IgniteTxLocalAdapter [completedBase=null, 
sndTransformedVals=false, depEnabled=false, txState=IgniteTxStateImpl 
[activeCacheIds=[-1062368467], recovery=false, mvccEnabled=true, 
mvccCachingCacheIds=[], txMap=HashSet []], super=IgniteTxAdapter 
[xidVer=GridCacheVersion [topVer=166913752, order=1555433759045, nodeOrder=10], 
writeVer=null, implicit=false, loc=true, threadId=1210, 
startTime=1555433762847, nodeId=0088e9b8-f859-4d14-8071-6388e473, 
startVer=GridCacheVersion [topVer=166913752, order=1555433759045, 
nodeOrder=10], endVer=null, isolation=REPEATABLE_READ, concurrency=PESSIMISTIC, 
timeout=0, sysInvalidate=false, sys=false, plc=2, commitVer=GridCacheVersion 
[topVer=166913752, order=1555433759045, nodeOrder=10], finalizing=NONE, 
invalidParts=null, state=MARKED_ROLLBACK, timedOut=false, 
topVer=AffinityTopologyVersion [topVer=11, minorTopVer=0], 
mvccSnapshot=MvccSnapshotResponse [futId=292, crdVer=1555433741506, cntr=395, 
opCntr=1, txs=[394], cleanupVer=390, tracking=0], skipCompletedVers=false, 
parentTx=null, duration=20866ms, onePhaseCommit=false], size=0

{code}
However, load threads don't start any explicit transactions: they either hang 
on put()/get() or on clientCache.close().

Rolling back IGNITE-10799 resolves the issue (however, test remains flaky with 
~10% fail rate due to unhandled TransactionSerializationException).

 

  was:
Attempt to restart server node in test hangs:

 
{code:java}
[2019-04-16 19:56:45,049][WARN ][restart-1][GridCachePartitionExchangeManager] 
Failed to wait for initial partition map exchange. Possible reasons are:
^-- Transactions in deadlock.
^-- Long running transactions (ignore if this is the case).
^-- Unreleased explicit locks.
{code}
The reason is that previous PME (late affinity assignment) still hangs due to 
pending transaction:

 

 
{code:java}
[2019-04-16 19:56:23,717][WARN 
][exchange-worker-#1039%cache.IgniteClientCacheStartFailoverTest3%][diagnostic] 
Pending transactions:
[2019-04-16 19:56:23,718][WARN 
][exchange-worker-#1039%cache.IgniteClientCacheStartFailoverTest3%][diagnostic] 
>>> [txVer=AffinityTopologyVersion [topVer=11, minorTopVer=0], exchWait=true, 
tx=GridDhtTxLocal [nearNodeId=8559bfe0-3d4a-4090-a457-6df0eba5, 
nearFutId=1edc7172a61-941f9dde-2b60-4a1f-8213-7d23d738bf33, nearMiniId=1, 
nearFinFutId=null, nearFinMiniId=0, nearXidVer=GridCacheVersion 
[topVer=166913752, order=1555433759036, nodeOrder=6], lb=null, 
super=GridDhtTxLocalAdapter [nearOnOriginatingNode=false, nearNodes=KeySetView 
[], dhtNodes=KeySetView [9ef33532-0e4a-4561-b57e-042afe10], 
explicitLock=false, super=IgniteTxLocalAdapter [completedBase=null, 
sndTransformedVals=false, depEnabled=false, txState=IgniteTxStateImpl 
[activeCacheIds=[-1062368467], recovery=false, mvccEnabled=true, 
mvccCachingCacheIds=[], txMap=HashSet []], super=IgniteTxAdapter 
[xidVer=GridCacheVersion [topVer=166913752, order=1555433759045, nodeOrder=10], 
writeVer=null, implicit=false, loc=true, threadId=1210, 
startTime=1555433762847, nodeId=0088e9b8-f859-4d14-8071-6388e473, 
startVer=GridCacheVersion [topVer=166913752, order=1555433759045, 
nodeOrder=10], endVer=null, isolation=REPEATABLE_READ, concurrency=PESSIMISTIC, 
timeout=0, sysInvalidate=false, sys=false, plc=2, commitVer=GridCacheVersion 
[topVer=166913752, order=1555433759045, nodeOrder=10], finalizing=NONE, 
invalidParts=null, state=MARKED_ROLLBACK, timedOut=false, 
topVer=AffinityTopologyVersion [topVer=11, minorTopVer=0], 
mvccSnapshot=MvccSnapshotResponse [futId=292, crdVer=1555433741506,