[jira] [Commented] (IGNITE-8863) Tx rollback can cause remote tx hang
[ https://issues.apache.org/jira/browse/IGNITE-8863?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16541422#comment-16541422 ] ASF GitHub Bot commented on IGNITE-8863: Github user asfgit closed the pull request at: https://github.com/apache/ignite/pull/4262 > Tx rollback can cause remote tx hang > > > Key: IGNITE-8863 > URL: https://issues.apache.org/jira/browse/IGNITE-8863 > Project: Ignite > Issue Type: Bug >Reporter: Alexei Scherbakov >Assignee: Alexei Scherbakov >Priority: Major > Fix For: 2.7 > > Attachments: Ignite_Tests_2.4_Java_8_Cache_5_1434.log.zip > > > {noformat} > [16:33:56]W: [org.apache.ignite:ignite-core] [2018-06-08 > 13:33:56,931][WARN ][sys-#66696%client%][GridNearTxLocal] The transaction was > forcibly rolled back because a timeout is reached: > GridNearTxLocal[xid=e198a9fd361--0857-6387--0004, > xidVersion=GridCacheVersion [topVer=139944839, order=1528464836894, > nodeOrder=4], concurrency=PESSIMISTIC, isolation=REPEATABLE_READ, > state=MARKED_ROLLBACK, invalidate=false, rollbackOnly=true, > nodeId=3c8d85b2-4eb9-46b2-8bd1-6f18f542fc7a, timeout=1, duration=11] > [16:35:55]W: [org.apache.ignite:ignite-core] [2018-06-08 > 13:35:55,056][WARN > ][grid-timeout-worker-#66394%transactions.TxRollbackOnTimeoutTest0%][diagnostic] > Found long running transaction [startTime=13:33:56.931, > curTime=13:35:55.054, tx=GridDhtTxRemote > [nearNodeId=3c8d85b2-4eb9-46b2-8bd1-6f18f542fc7a, > rmtFutId=af940d0e361-79c59341-3292-46e4-92ce-5c4ef4eddef8, > nearXidVer=GridCacheVersion [topVer=139944839, order=1528464836894, > nodeOrder=4], storeWriteThrough=false, super=GridDistributedTxRemoteAdapter > [explicitVers=null, started=true, commitAllowed=0, > txState=IgniteTxRemoteSingleStateImpl [entry=IgniteTxEntry > [key=KeyCacheObjectImpl [part=1, val=1, hasValBytes=true], cacheId=3556498, > txKey=IgniteTxKey [key=KeyCacheObjectImpl [part=1, val=1, hasValBytes=true], > cacheId=3556498], val=[op=CREATE, val=CacheObjectImpl [val=null, > hasValBytes=true]], prevVal=[op=NOOP, val=null], oldVal=[op=NOOP, val=null], > entryProcessorsCol=null, ttl=-1, conflictExpireTime=-1, conflictVer=null, > explicitVer=null, dhtVer=null, filters=[], filtersPassed=false, > filtersSet=false, entry=GridDhtCacheEntry [rdrs=[], part=1, > super=GridDistributedCacheEntry [super=GridCacheMapEntry > [key=KeyCacheObjectImpl [part=1, val=1, hasValBytes=true], > val=CacheObjectImpl [val=null, hasValBytes=true], startVer=1528464836879, > ver=GridCacheVersion [topVer=139944839, order=1528464836863, nodeOrder=2], > hash=1, extras=GridCacheMvccEntryExtras [mvcc=GridCacheMvcc [locs=null, > rmts=[GridCacheMvccCandidate [nodeId=97ee44cd-73c9-4e79-95df-e1a03481, > ver=GridCacheVersion [topVer=139944839, order=1528464836897, nodeOrder=2], > threadId=75880, id=2310313, topVer=AffinityTopologyVersion [topVer=-1, > minorTopVer=0], reentry=null, > otherNodeId=3c8d85b2-4eb9-46b2-8bd1-6f18f542fc7a, otherVer=null, > mappedDhtNodes=null, mappedNearNodes=null, ownerVer=null, serOrder=null, > key=KeyCacheObjectImpl [part=1, val=1, hasValBytes=true], > masks=local=0|owner=0|ready=0|reentry=0|used=0|tx=1|single_implicit=0|dht_local=0|near_local=0|removed=0|read=0, > prevVer=null, nextVer=null], GridCacheMvccCandidate > [nodeId=97ee44cd-73c9-4e79-95df-e1a03481, ver=GridCacheVersion > [topVer=139944839, order=1528464836900, nodeOrder=2], threadId=75875, > id=2310317, topVer=AffinityTopologyVersion [topVer=-1, minorTopVer=0], > reentry=null, otherNodeId=3c8d85b2-4eb9-46b2-8bd1-6f18f542fc7a, > otherVer=null, mappedDhtNodes=null, mappedNearNodes=null, ownerVer=null, > serOrder=null, key=KeyCacheObjectImpl [part=1, val=1, hasValBytes=true], > masks=local=0|owner=1|ready=0|reentry=0|used=1|tx=1|single_implicit=0|dht_local=0|near_local=0|removed=0|read=0, > prevVer=null, nextVer=null, flags=2]]], prepared=1, locked=false, > nodeId=null, locMapped=false, expiryPlc=null, transferExpiryPlc=false, > flags=0, partUpdateCntr=0, serReadVer=null, xidVer=null]], > skipCompletedVers=false, super=IgniteTxAdapter [xidVer=GridCacheVersion > [topVer=139944839, order=1528464836897, nodeOrder=2], > writeVer=GridCacheVersion [topVer=139944839, order=1528464836898, > nodeOrder=2], implicit=false, loc=false, threadId=75880, > startTime=1528464836931, nodeId=97ee44cd-73c9-4e79-95df-e1a03481, > startVer=GridCacheVersion [topVer=139944839, order=1528464836864, > nodeOrder=1], endVer=null, isolation=REPEATABLE_READ, > concurrency=PESSIMISTIC, timeout=1, sysInvalidate=false, sys=false, plc=2, > commitVer=null, finalizing=NONE, invalidParts=null, state=PREPARED, > timedOut=false, topVer=AffinityTopologyVersion [topVer=4, minorTopVer=0], > duration=118123ms,
[jira] [Commented] (IGNITE-8863) Tx rollback can cause remote tx hang
[ https://issues.apache.org/jira/browse/IGNITE-8863?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16541350#comment-16541350 ] Igor Seliverstov commented on IGNITE-8863: -- [~ascherbakov], have looked at the changes, looks OK to me. > Tx rollback can cause remote tx hang > > > Key: IGNITE-8863 > URL: https://issues.apache.org/jira/browse/IGNITE-8863 > Project: Ignite > Issue Type: Bug >Reporter: Alexei Scherbakov >Assignee: Alexei Scherbakov >Priority: Major > Fix For: 2.7 > > Attachments: Ignite_Tests_2.4_Java_8_Cache_5_1434.log.zip > > > {noformat} > [16:33:56]W: [org.apache.ignite:ignite-core] [2018-06-08 > 13:33:56,931][WARN ][sys-#66696%client%][GridNearTxLocal] The transaction was > forcibly rolled back because a timeout is reached: > GridNearTxLocal[xid=e198a9fd361--0857-6387--0004, > xidVersion=GridCacheVersion [topVer=139944839, order=1528464836894, > nodeOrder=4], concurrency=PESSIMISTIC, isolation=REPEATABLE_READ, > state=MARKED_ROLLBACK, invalidate=false, rollbackOnly=true, > nodeId=3c8d85b2-4eb9-46b2-8bd1-6f18f542fc7a, timeout=1, duration=11] > [16:35:55]W: [org.apache.ignite:ignite-core] [2018-06-08 > 13:35:55,056][WARN > ][grid-timeout-worker-#66394%transactions.TxRollbackOnTimeoutTest0%][diagnostic] > Found long running transaction [startTime=13:33:56.931, > curTime=13:35:55.054, tx=GridDhtTxRemote > [nearNodeId=3c8d85b2-4eb9-46b2-8bd1-6f18f542fc7a, > rmtFutId=af940d0e361-79c59341-3292-46e4-92ce-5c4ef4eddef8, > nearXidVer=GridCacheVersion [topVer=139944839, order=1528464836894, > nodeOrder=4], storeWriteThrough=false, super=GridDistributedTxRemoteAdapter > [explicitVers=null, started=true, commitAllowed=0, > txState=IgniteTxRemoteSingleStateImpl [entry=IgniteTxEntry > [key=KeyCacheObjectImpl [part=1, val=1, hasValBytes=true], cacheId=3556498, > txKey=IgniteTxKey [key=KeyCacheObjectImpl [part=1, val=1, hasValBytes=true], > cacheId=3556498], val=[op=CREATE, val=CacheObjectImpl [val=null, > hasValBytes=true]], prevVal=[op=NOOP, val=null], oldVal=[op=NOOP, val=null], > entryProcessorsCol=null, ttl=-1, conflictExpireTime=-1, conflictVer=null, > explicitVer=null, dhtVer=null, filters=[], filtersPassed=false, > filtersSet=false, entry=GridDhtCacheEntry [rdrs=[], part=1, > super=GridDistributedCacheEntry [super=GridCacheMapEntry > [key=KeyCacheObjectImpl [part=1, val=1, hasValBytes=true], > val=CacheObjectImpl [val=null, hasValBytes=true], startVer=1528464836879, > ver=GridCacheVersion [topVer=139944839, order=1528464836863, nodeOrder=2], > hash=1, extras=GridCacheMvccEntryExtras [mvcc=GridCacheMvcc [locs=null, > rmts=[GridCacheMvccCandidate [nodeId=97ee44cd-73c9-4e79-95df-e1a03481, > ver=GridCacheVersion [topVer=139944839, order=1528464836897, nodeOrder=2], > threadId=75880, id=2310313, topVer=AffinityTopologyVersion [topVer=-1, > minorTopVer=0], reentry=null, > otherNodeId=3c8d85b2-4eb9-46b2-8bd1-6f18f542fc7a, otherVer=null, > mappedDhtNodes=null, mappedNearNodes=null, ownerVer=null, serOrder=null, > key=KeyCacheObjectImpl [part=1, val=1, hasValBytes=true], > masks=local=0|owner=0|ready=0|reentry=0|used=0|tx=1|single_implicit=0|dht_local=0|near_local=0|removed=0|read=0, > prevVer=null, nextVer=null], GridCacheMvccCandidate > [nodeId=97ee44cd-73c9-4e79-95df-e1a03481, ver=GridCacheVersion > [topVer=139944839, order=1528464836900, nodeOrder=2], threadId=75875, > id=2310317, topVer=AffinityTopologyVersion [topVer=-1, minorTopVer=0], > reentry=null, otherNodeId=3c8d85b2-4eb9-46b2-8bd1-6f18f542fc7a, > otherVer=null, mappedDhtNodes=null, mappedNearNodes=null, ownerVer=null, > serOrder=null, key=KeyCacheObjectImpl [part=1, val=1, hasValBytes=true], > masks=local=0|owner=1|ready=0|reentry=0|used=1|tx=1|single_implicit=0|dht_local=0|near_local=0|removed=0|read=0, > prevVer=null, nextVer=null, flags=2]]], prepared=1, locked=false, > nodeId=null, locMapped=false, expiryPlc=null, transferExpiryPlc=false, > flags=0, partUpdateCntr=0, serReadVer=null, xidVer=null]], > skipCompletedVers=false, super=IgniteTxAdapter [xidVer=GridCacheVersion > [topVer=139944839, order=1528464836897, nodeOrder=2], > writeVer=GridCacheVersion [topVer=139944839, order=1528464836898, > nodeOrder=2], implicit=false, loc=false, threadId=75880, > startTime=1528464836931, nodeId=97ee44cd-73c9-4e79-95df-e1a03481, > startVer=GridCacheVersion [topVer=139944839, order=1528464836864, > nodeOrder=1], endVer=null, isolation=REPEATABLE_READ, > concurrency=PESSIMISTIC, timeout=1, sysInvalidate=false, sys=false, plc=2, > commitVer=null, finalizing=NONE, invalidParts=null, state=PREPARED, > timedOut=false, topVer=AffinityTopologyVersion [topVer=4, minorTopVer=0], > duration=118123ms, onePhaseCommit=false >
[jira] [Commented] (IGNITE-8863) Tx rollback can cause remote tx hang
[ https://issues.apache.org/jira/browse/IGNITE-8863?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16525229#comment-16525229 ] Alexei Scherbakov commented on IGNITE-8863: --- It's look like TC history for suite cache5 recently become very bad (see [1]) mainly due to CacheSerializableTransactionsTest and PartitionsExchangeOnDiscoveryHistoryOverflowTest. I've muted PartitionsExchangeOnDiscoveryHistoryOverflowTest because it produces OOM almost each run and someone should push PR from IGNITE-8869 to prevent OOM. I think my changes are not related to failures because these failures has started to happen before this commit. [~agoncharuk], please review. [1] https://ci.ignite.apache.org/viewType.html?buildTypeId=IgniteTests24Java8_Cache5_IgniteTests24Java8=%3Cdefault%3E=buildTypeStatusDiv > Tx rollback can cause remote tx hang > > > Key: IGNITE-8863 > URL: https://issues.apache.org/jira/browse/IGNITE-8863 > Project: Ignite > Issue Type: Bug >Reporter: Alexei Scherbakov >Assignee: Alexei Scherbakov >Priority: Major > Fix For: 2.7 > > Attachments: Ignite_Tests_2.4_Java_8_Cache_5_1434.log.zip > > > {noformat} > [16:33:56]W: [org.apache.ignite:ignite-core] [2018-06-08 > 13:33:56,931][WARN ][sys-#66696%client%][GridNearTxLocal] The transaction was > forcibly rolled back because a timeout is reached: > GridNearTxLocal[xid=e198a9fd361--0857-6387--0004, > xidVersion=GridCacheVersion [topVer=139944839, order=1528464836894, > nodeOrder=4], concurrency=PESSIMISTIC, isolation=REPEATABLE_READ, > state=MARKED_ROLLBACK, invalidate=false, rollbackOnly=true, > nodeId=3c8d85b2-4eb9-46b2-8bd1-6f18f542fc7a, timeout=1, duration=11] > [16:35:55]W: [org.apache.ignite:ignite-core] [2018-06-08 > 13:35:55,056][WARN > ][grid-timeout-worker-#66394%transactions.TxRollbackOnTimeoutTest0%][diagnostic] > Found long running transaction [startTime=13:33:56.931, > curTime=13:35:55.054, tx=GridDhtTxRemote > [nearNodeId=3c8d85b2-4eb9-46b2-8bd1-6f18f542fc7a, > rmtFutId=af940d0e361-79c59341-3292-46e4-92ce-5c4ef4eddef8, > nearXidVer=GridCacheVersion [topVer=139944839, order=1528464836894, > nodeOrder=4], storeWriteThrough=false, super=GridDistributedTxRemoteAdapter > [explicitVers=null, started=true, commitAllowed=0, > txState=IgniteTxRemoteSingleStateImpl [entry=IgniteTxEntry > [key=KeyCacheObjectImpl [part=1, val=1, hasValBytes=true], cacheId=3556498, > txKey=IgniteTxKey [key=KeyCacheObjectImpl [part=1, val=1, hasValBytes=true], > cacheId=3556498], val=[op=CREATE, val=CacheObjectImpl [val=null, > hasValBytes=true]], prevVal=[op=NOOP, val=null], oldVal=[op=NOOP, val=null], > entryProcessorsCol=null, ttl=-1, conflictExpireTime=-1, conflictVer=null, > explicitVer=null, dhtVer=null, filters=[], filtersPassed=false, > filtersSet=false, entry=GridDhtCacheEntry [rdrs=[], part=1, > super=GridDistributedCacheEntry [super=GridCacheMapEntry > [key=KeyCacheObjectImpl [part=1, val=1, hasValBytes=true], > val=CacheObjectImpl [val=null, hasValBytes=true], startVer=1528464836879, > ver=GridCacheVersion [topVer=139944839, order=1528464836863, nodeOrder=2], > hash=1, extras=GridCacheMvccEntryExtras [mvcc=GridCacheMvcc [locs=null, > rmts=[GridCacheMvccCandidate [nodeId=97ee44cd-73c9-4e79-95df-e1a03481, > ver=GridCacheVersion [topVer=139944839, order=1528464836897, nodeOrder=2], > threadId=75880, id=2310313, topVer=AffinityTopologyVersion [topVer=-1, > minorTopVer=0], reentry=null, > otherNodeId=3c8d85b2-4eb9-46b2-8bd1-6f18f542fc7a, otherVer=null, > mappedDhtNodes=null, mappedNearNodes=null, ownerVer=null, serOrder=null, > key=KeyCacheObjectImpl [part=1, val=1, hasValBytes=true], > masks=local=0|owner=0|ready=0|reentry=0|used=0|tx=1|single_implicit=0|dht_local=0|near_local=0|removed=0|read=0, > prevVer=null, nextVer=null], GridCacheMvccCandidate > [nodeId=97ee44cd-73c9-4e79-95df-e1a03481, ver=GridCacheVersion > [topVer=139944839, order=1528464836900, nodeOrder=2], threadId=75875, > id=2310317, topVer=AffinityTopologyVersion [topVer=-1, minorTopVer=0], > reentry=null, otherNodeId=3c8d85b2-4eb9-46b2-8bd1-6f18f542fc7a, > otherVer=null, mappedDhtNodes=null, mappedNearNodes=null, ownerVer=null, > serOrder=null, key=KeyCacheObjectImpl [part=1, val=1, hasValBytes=true], > masks=local=0|owner=1|ready=0|reentry=0|used=1|tx=1|single_implicit=0|dht_local=0|near_local=0|removed=0|read=0, > prevVer=null, nextVer=null, flags=2]]], prepared=1, locked=false, > nodeId=null, locMapped=false, expiryPlc=null, transferExpiryPlc=false, > flags=0, partUpdateCntr=0, serReadVer=null, xidVer=null]], > skipCompletedVers=false, super=IgniteTxAdapter [xidVer=GridCacheVersion > [topVer=139944839, order=1528464836897, nodeOrder=2], > writeVer=GridCacheVersion
[jira] [Commented] (IGNITE-8863) Tx rollback can cause remote tx hang
[ https://issues.apache.org/jira/browse/IGNITE-8863?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16525170#comment-16525170 ] Alexei Scherbakov commented on IGNITE-8863: --- There is suspicious failure in CacheSerializableTransactionsTest#testReadWriteTransactionsNoDeadlock leading to Cache5 suite hang: I see hanging transaction: {noformat} [00:23:01]W: [org.apache.ignite:ignite-core] [2018-06-26 21:23:01,176][WARN ][grid-timeout-worker-#55%cache.CacheSerializableTransactionsTest0%][diagnostic] Found long running transaction [startTime=21:21:27.406, curTime=21:23:01.174, tx=GridNearTxLocal [mappings=IgniteTxMappingsImpl [], nearLocallyMapped=false, colocatedLocallyMapped=false, needCheckBackup=null, hasRemoteLocks=false, trackTimeout=false, lb=null, thread=test-thread-22, mappings=IgniteTxMappingsImpl [], super=GridDhtTxLocalAdapter [nearOnOriginatingNode=false, nearNodes=[], dhtNodes=[], explicitLock=false, super=IgniteTxLocalAdapter [completedBase=null, sndTransformedVals=false, depEnabled=false, txState=IgniteTxStateImpl [activeCacheIds=[1544803905], recovery=false, txMap=[IgniteTxEntry [key=KeyCacheObjectImpl [part=27, val=27, hasValBytes=false], cacheId=1544803905, txKey=IgniteTxKey [key=KeyCacheObjectImpl [part=27, val=27, hasValBytes=false], cacheId=1544803905], val=[op=READ, val=null], prevVal=[op=READ, val=null], oldVal=[op=NOOP, val=null], entryProcessorsCol=null, ttl=-1, conflictExpireTime=-1, conflictVer=null, explicitVer=null, dhtVer=null, filters=null, filtersPassed=false, filtersSet=true, entry=GridDhtDetachedCacheEntry [super=GridDistributedCacheEntry [super=GridCacheMapEntry [key=KeyCacheObjectImpl [part=27, val=27, hasValBytes=false], val=null, ver=GridCacheVersion [topVer=0, order=0, nodeOrder=0], hash=27, extras=null, flags=0]]], prepared=0, locked=false, nodeId=null, locMapped=false, expiryPlc=null, transferExpiryPlc=false, flags=0, partUpdateCntr=0, serReadVer=GridCacheVersion [topVer=0, order=0, nodeOrder=0], xidVer=GridCacheVersion [topVer=141525768, order=1530052929764, nodeOrder=1]], IgniteTxEntry [key=KeyCacheObjectImpl [part=8, val=8, hasValBytes=false], cacheId=1544803905, txKey=IgniteTxKey [key=KeyCacheObjectImpl [part=8, val=8, hasValBytes=false], cacheId=1544803905], val=[op=CREATE, val=UserCacheObjectImpl [val=8, hasValBytes=false]], prevVal=[op=CREATE, val=UserCacheObjectImpl [val=8, hasValBytes=false]], oldVal=[op=NOOP, val=null], entryProcessorsCol=null, ttl=-1, conflictExpireTime=-1, conflictVer=null, explicitVer=null, dhtVer=null, filters=[], filtersPassed=false, filtersSet=true, entry=GridDhtDetachedCacheEntry [super=GridDistributedCacheEntry [super=GridCacheMapEntry [key=KeyCacheObjectImpl [part=8, val=8, hasValBytes=false], val=null, ver=GridCacheVersion [topVer=0, order=0, nodeOrder=0], hash=8, extras=null, flags=0]]], prepared=0, locked=false, nodeId=null, locMapped=false, expiryPlc=null, transferExpiryPlc=false, flags=0, partUpdateCntr=0, serReadVer=null, xidVer=GridCacheVersion [topVer=141525768, order=1530052929764, nodeOrder=1]], IgniteTxEntry [key=KeyCacheObjectImpl [part=24, val=24, hasValBytes=false], cacheId=1544803905, txKey=IgniteTxKey [key=KeyCacheObjectImpl [part=24, val=24, hasValBytes=false], cacheId=1544803905], val=[op=READ, val=null], prevVal=[op=NOOP, val=null], oldVal=[op=NOOP, val=null], entryProcessorsCol=null, ttl=-1, conflictExpireTime=-1, conflictVer=null, explicitVer=null, dhtVer=null, filters=null, filtersPassed=false, filtersSet=true, entry=GridDhtDetachedCacheEntry [super=GridDistributedCacheEntry [super=GridCacheMapEntry [key=KeyCacheObjectImpl [part=24, val=24, hasValBytes=false], val=null, ver=GridCacheVersion [topVer=0, order=0, nodeOrder=0], hash=24, extras=null, flags=0]]], prepared=0, locked=false, nodeId=null, locMapped=false, expiryPlc=null, transferExpiryPlc=false, flags=0, partUpdateCntr=0, serReadVer=null, xidVer=GridCacheVersion [topVer=141525768, order=1530052929764, nodeOrder=1]], IgniteTxEntry [key=KeyCacheObjectImpl [part=29, val=29, hasValBytes=false], cacheId=1544803905, txKey=IgniteTxKey [key=KeyCacheObjectImpl [part=29, val=29, hasValBytes=false], cacheId=1544803905], val=[op=READ, val=null], prevVal=[op=READ, val=null], oldVal=[op=NOOP, val=null], entryProcessorsCol=null, ttl=-1, conflictExpireTime=-1, conflictVer=null, explicitVer=null, dhtVer=null, filters=null, filtersPassed=false, filtersSet=true, entry=GridDhtDetachedCacheEntry [super=GridDistributedCacheEntry [super=GridCacheMapEntry [key=KeyCacheObjectImpl [part=29, val=29, hasValBytes=false], val=null, ver=GridCacheVersion [topVer=0, order=0, nodeOrder=0], hash=29, extras=null, flags=0]]], prepared=0, locked=false, nodeId=null, locMapped=false, expiryPlc=null, transferExpiryPlc=false, flags=0, partUpdateCntr=0, serReadVer=GridCacheVersion [topVer=0, order=0, nodeOrder=0], xidVer=GridCacheVersion
[jira] [Commented] (IGNITE-8863) Tx rollback can cause remote tx hang
[ https://issues.apache.org/jira/browse/IGNITE-8863?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16523961#comment-16523961 ] Alexei Scherbakov commented on IGNITE-8863: --- There are races between message which create new tx and rollback messages leading to hanging primary and backup transactions. Additional tests covering such scenarios will be introduced later in IGNITE-8509 due to urgency of the fix. > Tx rollback can cause remote tx hang > > > Key: IGNITE-8863 > URL: https://issues.apache.org/jira/browse/IGNITE-8863 > Project: Ignite > Issue Type: Bug >Reporter: Alexei Scherbakov >Assignee: Alexei Scherbakov >Priority: Major > Fix For: 2.7 > > > {noformat} > [16:33:56]W: [org.apache.ignite:ignite-core] [2018-06-08 > 13:33:56,931][WARN ][sys-#66696%client%][GridNearTxLocal] The transaction was > forcibly rolled back because a timeout is reached: > GridNearTxLocal[xid=e198a9fd361--0857-6387--0004, > xidVersion=GridCacheVersion [topVer=139944839, order=1528464836894, > nodeOrder=4], concurrency=PESSIMISTIC, isolation=REPEATABLE_READ, > state=MARKED_ROLLBACK, invalidate=false, rollbackOnly=true, > nodeId=3c8d85b2-4eb9-46b2-8bd1-6f18f542fc7a, timeout=1, duration=11] > [16:35:55]W: [org.apache.ignite:ignite-core] [2018-06-08 > 13:35:55,056][WARN > ][grid-timeout-worker-#66394%transactions.TxRollbackOnTimeoutTest0%][diagnostic] > Found long running transaction [startTime=13:33:56.931, > curTime=13:35:55.054, tx=GridDhtTxRemote > [nearNodeId=3c8d85b2-4eb9-46b2-8bd1-6f18f542fc7a, > rmtFutId=af940d0e361-79c59341-3292-46e4-92ce-5c4ef4eddef8, > nearXidVer=GridCacheVersion [topVer=139944839, order=1528464836894, > nodeOrder=4], storeWriteThrough=false, super=GridDistributedTxRemoteAdapter > [explicitVers=null, started=true, commitAllowed=0, > txState=IgniteTxRemoteSingleStateImpl [entry=IgniteTxEntry > [key=KeyCacheObjectImpl [part=1, val=1, hasValBytes=true], cacheId=3556498, > txKey=IgniteTxKey [key=KeyCacheObjectImpl [part=1, val=1, hasValBytes=true], > cacheId=3556498], val=[op=CREATE, val=CacheObjectImpl [val=null, > hasValBytes=true]], prevVal=[op=NOOP, val=null], oldVal=[op=NOOP, val=null], > entryProcessorsCol=null, ttl=-1, conflictExpireTime=-1, conflictVer=null, > explicitVer=null, dhtVer=null, filters=[], filtersPassed=false, > filtersSet=false, entry=GridDhtCacheEntry [rdrs=[], part=1, > super=GridDistributedCacheEntry [super=GridCacheMapEntry > [key=KeyCacheObjectImpl [part=1, val=1, hasValBytes=true], > val=CacheObjectImpl [val=null, hasValBytes=true], startVer=1528464836879, > ver=GridCacheVersion [topVer=139944839, order=1528464836863, nodeOrder=2], > hash=1, extras=GridCacheMvccEntryExtras [mvcc=GridCacheMvcc [locs=null, > rmts=[GridCacheMvccCandidate [nodeId=97ee44cd-73c9-4e79-95df-e1a03481, > ver=GridCacheVersion [topVer=139944839, order=1528464836897, nodeOrder=2], > threadId=75880, id=2310313, topVer=AffinityTopologyVersion [topVer=-1, > minorTopVer=0], reentry=null, > otherNodeId=3c8d85b2-4eb9-46b2-8bd1-6f18f542fc7a, otherVer=null, > mappedDhtNodes=null, mappedNearNodes=null, ownerVer=null, serOrder=null, > key=KeyCacheObjectImpl [part=1, val=1, hasValBytes=true], > masks=local=0|owner=0|ready=0|reentry=0|used=0|tx=1|single_implicit=0|dht_local=0|near_local=0|removed=0|read=0, > prevVer=null, nextVer=null], GridCacheMvccCandidate > [nodeId=97ee44cd-73c9-4e79-95df-e1a03481, ver=GridCacheVersion > [topVer=139944839, order=1528464836900, nodeOrder=2], threadId=75875, > id=2310317, topVer=AffinityTopologyVersion [topVer=-1, minorTopVer=0], > reentry=null, otherNodeId=3c8d85b2-4eb9-46b2-8bd1-6f18f542fc7a, > otherVer=null, mappedDhtNodes=null, mappedNearNodes=null, ownerVer=null, > serOrder=null, key=KeyCacheObjectImpl [part=1, val=1, hasValBytes=true], > masks=local=0|owner=1|ready=0|reentry=0|used=1|tx=1|single_implicit=0|dht_local=0|near_local=0|removed=0|read=0, > prevVer=null, nextVer=null, flags=2]]], prepared=1, locked=false, > nodeId=null, locMapped=false, expiryPlc=null, transferExpiryPlc=false, > flags=0, partUpdateCntr=0, serReadVer=null, xidVer=null]], > skipCompletedVers=false, super=IgniteTxAdapter [xidVer=GridCacheVersion > [topVer=139944839, order=1528464836897, nodeOrder=2], > writeVer=GridCacheVersion [topVer=139944839, order=1528464836898, > nodeOrder=2], implicit=false, loc=false, threadId=75880, > startTime=1528464836931, nodeId=97ee44cd-73c9-4e79-95df-e1a03481, > startVer=GridCacheVersion [topVer=139944839, order=1528464836864, > nodeOrder=1], endVer=null, isolation=REPEATABLE_READ, > concurrency=PESSIMISTIC, timeout=1, sysInvalidate=false, sys=false, plc=2, > commitVer=null, finalizing=NONE, invalidParts=null, state=PREPARED, > timedOut=false,
[jira] [Commented] (IGNITE-8863) Tx rollback can cause remote tx hang
[ https://issues.apache.org/jira/browse/IGNITE-8863?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16523952#comment-16523952 ] ASF GitHub Bot commented on IGNITE-8863: GitHub user ascherbakoff opened a pull request: https://github.com/apache/ignite/pull/4262 IGNITE-8863 Tx rollback can cause remote tx hang. You can merge this pull request into a Git repository by running: $ git pull https://github.com/gridgain/apache-ignite ignite-8863 Alternatively you can review and apply these changes as the patch at: https://github.com/apache/ignite/pull/4262.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #4262 commit cf01e782f19f84f466d35377e58039262361a8fa Author: Aleksei Scherbakov Date: 2018-06-26T16:36:04Z IGNITE-8863 Tx rollback can cause remote tx hang. > Tx rollback can cause remote tx hang > > > Key: IGNITE-8863 > URL: https://issues.apache.org/jira/browse/IGNITE-8863 > Project: Ignite > Issue Type: Bug >Reporter: Alexei Scherbakov >Assignee: Alexei Scherbakov >Priority: Major > Fix For: 2.7 > > > {noformat} > [16:33:56]W: [org.apache.ignite:ignite-core] [2018-06-08 > 13:33:56,931][WARN ][sys-#66696%client%][GridNearTxLocal] The transaction was > forcibly rolled back because a timeout is reached: > GridNearTxLocal[xid=e198a9fd361--0857-6387--0004, > xidVersion=GridCacheVersion [topVer=139944839, order=1528464836894, > nodeOrder=4], concurrency=PESSIMISTIC, isolation=REPEATABLE_READ, > state=MARKED_ROLLBACK, invalidate=false, rollbackOnly=true, > nodeId=3c8d85b2-4eb9-46b2-8bd1-6f18f542fc7a, timeout=1, duration=11] > [16:35:55]W: [org.apache.ignite:ignite-core] [2018-06-08 > 13:35:55,056][WARN > ][grid-timeout-worker-#66394%transactions.TxRollbackOnTimeoutTest0%][diagnostic] > Found long running transaction [startTime=13:33:56.931, > curTime=13:35:55.054, tx=GridDhtTxRemote > [nearNodeId=3c8d85b2-4eb9-46b2-8bd1-6f18f542fc7a, > rmtFutId=af940d0e361-79c59341-3292-46e4-92ce-5c4ef4eddef8, > nearXidVer=GridCacheVersion [topVer=139944839, order=1528464836894, > nodeOrder=4], storeWriteThrough=false, super=GridDistributedTxRemoteAdapter > [explicitVers=null, started=true, commitAllowed=0, > txState=IgniteTxRemoteSingleStateImpl [entry=IgniteTxEntry > [key=KeyCacheObjectImpl [part=1, val=1, hasValBytes=true], cacheId=3556498, > txKey=IgniteTxKey [key=KeyCacheObjectImpl [part=1, val=1, hasValBytes=true], > cacheId=3556498], val=[op=CREATE, val=CacheObjectImpl [val=null, > hasValBytes=true]], prevVal=[op=NOOP, val=null], oldVal=[op=NOOP, val=null], > entryProcessorsCol=null, ttl=-1, conflictExpireTime=-1, conflictVer=null, > explicitVer=null, dhtVer=null, filters=[], filtersPassed=false, > filtersSet=false, entry=GridDhtCacheEntry [rdrs=[], part=1, > super=GridDistributedCacheEntry [super=GridCacheMapEntry > [key=KeyCacheObjectImpl [part=1, val=1, hasValBytes=true], > val=CacheObjectImpl [val=null, hasValBytes=true], startVer=1528464836879, > ver=GridCacheVersion [topVer=139944839, order=1528464836863, nodeOrder=2], > hash=1, extras=GridCacheMvccEntryExtras [mvcc=GridCacheMvcc [locs=null, > rmts=[GridCacheMvccCandidate [nodeId=97ee44cd-73c9-4e79-95df-e1a03481, > ver=GridCacheVersion [topVer=139944839, order=1528464836897, nodeOrder=2], > threadId=75880, id=2310313, topVer=AffinityTopologyVersion [topVer=-1, > minorTopVer=0], reentry=null, > otherNodeId=3c8d85b2-4eb9-46b2-8bd1-6f18f542fc7a, otherVer=null, > mappedDhtNodes=null, mappedNearNodes=null, ownerVer=null, serOrder=null, > key=KeyCacheObjectImpl [part=1, val=1, hasValBytes=true], > masks=local=0|owner=0|ready=0|reentry=0|used=0|tx=1|single_implicit=0|dht_local=0|near_local=0|removed=0|read=0, > prevVer=null, nextVer=null], GridCacheMvccCandidate > [nodeId=97ee44cd-73c9-4e79-95df-e1a03481, ver=GridCacheVersion > [topVer=139944839, order=1528464836900, nodeOrder=2], threadId=75875, > id=2310317, topVer=AffinityTopologyVersion [topVer=-1, minorTopVer=0], > reentry=null, otherNodeId=3c8d85b2-4eb9-46b2-8bd1-6f18f542fc7a, > otherVer=null, mappedDhtNodes=null, mappedNearNodes=null, ownerVer=null, > serOrder=null, key=KeyCacheObjectImpl [part=1, val=1, hasValBytes=true], > masks=local=0|owner=1|ready=0|reentry=0|used=1|tx=1|single_implicit=0|dht_local=0|near_local=0|removed=0|read=0, > prevVer=null, nextVer=null, flags=2]]], prepared=1, locked=false, > nodeId=null, locMapped=false, expiryPlc=null, transferExpiryPlc=false, > flags=0, partUpdateCntr=0, serReadVer=null, xidVer=null]], > skipCompletedVers=false, super=IgniteTxAdapter [xidVer=GridCacheVersion > [topVer=139944839, order=1528464836897,