[jira] [Commented] (IGNITE-8863) Tx rollback can cause remote tx hang

2018-07-12 Thread ASF GitHub Bot (JIRA)


[ 
https://issues.apache.org/jira/browse/IGNITE-8863?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16541422#comment-16541422
 ] 

ASF GitHub Bot commented on IGNITE-8863:


Github user asfgit closed the pull request at:

https://github.com/apache/ignite/pull/4262


> Tx rollback can cause remote tx hang
> 
>
> Key: IGNITE-8863
> URL: https://issues.apache.org/jira/browse/IGNITE-8863
> Project: Ignite
>  Issue Type: Bug
>Reporter: Alexei Scherbakov
>Assignee: Alexei Scherbakov
>Priority: Major
> Fix For: 2.7
>
> Attachments: Ignite_Tests_2.4_Java_8_Cache_5_1434.log.zip
>
>
> {noformat}
> [16:33:56]W: [org.apache.ignite:ignite-core] [2018-06-08 
> 13:33:56,931][WARN ][sys-#66696%client%][GridNearTxLocal] The transaction was 
> forcibly rolled back because a timeout is reached: 
> GridNearTxLocal[xid=e198a9fd361--0857-6387--0004, 
> xidVersion=GridCacheVersion [topVer=139944839, order=1528464836894, 
> nodeOrder=4], concurrency=PESSIMISTIC, isolation=REPEATABLE_READ, 
> state=MARKED_ROLLBACK, invalidate=false, rollbackOnly=true, 
> nodeId=3c8d85b2-4eb9-46b2-8bd1-6f18f542fc7a, timeout=1, duration=11]
> [16:35:55]W: [org.apache.ignite:ignite-core] [2018-06-08 
> 13:35:55,056][WARN 
> ][grid-timeout-worker-#66394%transactions.TxRollbackOnTimeoutTest0%][diagnostic]
>  Found long running transaction [startTime=13:33:56.931, 
> curTime=13:35:55.054, tx=GridDhtTxRemote 
> [nearNodeId=3c8d85b2-4eb9-46b2-8bd1-6f18f542fc7a, 
> rmtFutId=af940d0e361-79c59341-3292-46e4-92ce-5c4ef4eddef8, 
> nearXidVer=GridCacheVersion [topVer=139944839, order=1528464836894, 
> nodeOrder=4], storeWriteThrough=false, super=GridDistributedTxRemoteAdapter 
> [explicitVers=null, started=true, commitAllowed=0, 
> txState=IgniteTxRemoteSingleStateImpl [entry=IgniteTxEntry 
> [key=KeyCacheObjectImpl [part=1, val=1, hasValBytes=true], cacheId=3556498, 
> txKey=IgniteTxKey [key=KeyCacheObjectImpl [part=1, val=1, hasValBytes=true], 
> cacheId=3556498], val=[op=CREATE, val=CacheObjectImpl [val=null, 
> hasValBytes=true]], prevVal=[op=NOOP, val=null], oldVal=[op=NOOP, val=null], 
> entryProcessorsCol=null, ttl=-1, conflictExpireTime=-1, conflictVer=null, 
> explicitVer=null, dhtVer=null, filters=[], filtersPassed=false, 
> filtersSet=false, entry=GridDhtCacheEntry [rdrs=[], part=1, 
> super=GridDistributedCacheEntry [super=GridCacheMapEntry 
> [key=KeyCacheObjectImpl [part=1, val=1, hasValBytes=true], 
> val=CacheObjectImpl [val=null, hasValBytes=true], startVer=1528464836879, 
> ver=GridCacheVersion [topVer=139944839, order=1528464836863, nodeOrder=2], 
> hash=1, extras=GridCacheMvccEntryExtras [mvcc=GridCacheMvcc [locs=null, 
> rmts=[GridCacheMvccCandidate [nodeId=97ee44cd-73c9-4e79-95df-e1a03481, 
> ver=GridCacheVersion [topVer=139944839, order=1528464836897, nodeOrder=2], 
> threadId=75880, id=2310313, topVer=AffinityTopologyVersion [topVer=-1, 
> minorTopVer=0], reentry=null, 
> otherNodeId=3c8d85b2-4eb9-46b2-8bd1-6f18f542fc7a, otherVer=null, 
> mappedDhtNodes=null, mappedNearNodes=null, ownerVer=null, serOrder=null, 
> key=KeyCacheObjectImpl [part=1, val=1, hasValBytes=true], 
> masks=local=0|owner=0|ready=0|reentry=0|used=0|tx=1|single_implicit=0|dht_local=0|near_local=0|removed=0|read=0,
>  prevVer=null, nextVer=null], GridCacheMvccCandidate 
> [nodeId=97ee44cd-73c9-4e79-95df-e1a03481, ver=GridCacheVersion 
> [topVer=139944839, order=1528464836900, nodeOrder=2], threadId=75875, 
> id=2310317, topVer=AffinityTopologyVersion [topVer=-1, minorTopVer=0], 
> reentry=null, otherNodeId=3c8d85b2-4eb9-46b2-8bd1-6f18f542fc7a, 
> otherVer=null, mappedDhtNodes=null, mappedNearNodes=null, ownerVer=null, 
> serOrder=null, key=KeyCacheObjectImpl [part=1, val=1, hasValBytes=true], 
> masks=local=0|owner=1|ready=0|reentry=0|used=1|tx=1|single_implicit=0|dht_local=0|near_local=0|removed=0|read=0,
>  prevVer=null, nextVer=null, flags=2]]], prepared=1, locked=false, 
> nodeId=null, locMapped=false, expiryPlc=null, transferExpiryPlc=false, 
> flags=0, partUpdateCntr=0, serReadVer=null, xidVer=null]], 
> skipCompletedVers=false, super=IgniteTxAdapter [xidVer=GridCacheVersion 
> [topVer=139944839, order=1528464836897, nodeOrder=2], 
> writeVer=GridCacheVersion [topVer=139944839, order=1528464836898, 
> nodeOrder=2], implicit=false, loc=false, threadId=75880, 
> startTime=1528464836931, nodeId=97ee44cd-73c9-4e79-95df-e1a03481, 
> startVer=GridCacheVersion [topVer=139944839, order=1528464836864, 
> nodeOrder=1], endVer=null, isolation=REPEATABLE_READ, 
> concurrency=PESSIMISTIC, timeout=1, sysInvalidate=false, sys=false, plc=2, 
> commitVer=null, finalizing=NONE, invalidParts=null, state=PREPARED, 
> timedOut=false, topVer=AffinityTopologyVersion [topVer=4, minorTopVer=0], 
> duration=118123ms, 

[jira] [Commented] (IGNITE-8863) Tx rollback can cause remote tx hang

2018-07-12 Thread Igor Seliverstov (JIRA)


[ 
https://issues.apache.org/jira/browse/IGNITE-8863?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16541350#comment-16541350
 ] 

Igor Seliverstov commented on IGNITE-8863:
--

[~ascherbakov], have looked at the changes, looks OK to me.

> Tx rollback can cause remote tx hang
> 
>
> Key: IGNITE-8863
> URL: https://issues.apache.org/jira/browse/IGNITE-8863
> Project: Ignite
>  Issue Type: Bug
>Reporter: Alexei Scherbakov
>Assignee: Alexei Scherbakov
>Priority: Major
> Fix For: 2.7
>
> Attachments: Ignite_Tests_2.4_Java_8_Cache_5_1434.log.zip
>
>
> {noformat}
> [16:33:56]W: [org.apache.ignite:ignite-core] [2018-06-08 
> 13:33:56,931][WARN ][sys-#66696%client%][GridNearTxLocal] The transaction was 
> forcibly rolled back because a timeout is reached: 
> GridNearTxLocal[xid=e198a9fd361--0857-6387--0004, 
> xidVersion=GridCacheVersion [topVer=139944839, order=1528464836894, 
> nodeOrder=4], concurrency=PESSIMISTIC, isolation=REPEATABLE_READ, 
> state=MARKED_ROLLBACK, invalidate=false, rollbackOnly=true, 
> nodeId=3c8d85b2-4eb9-46b2-8bd1-6f18f542fc7a, timeout=1, duration=11]
> [16:35:55]W: [org.apache.ignite:ignite-core] [2018-06-08 
> 13:35:55,056][WARN 
> ][grid-timeout-worker-#66394%transactions.TxRollbackOnTimeoutTest0%][diagnostic]
>  Found long running transaction [startTime=13:33:56.931, 
> curTime=13:35:55.054, tx=GridDhtTxRemote 
> [nearNodeId=3c8d85b2-4eb9-46b2-8bd1-6f18f542fc7a, 
> rmtFutId=af940d0e361-79c59341-3292-46e4-92ce-5c4ef4eddef8, 
> nearXidVer=GridCacheVersion [topVer=139944839, order=1528464836894, 
> nodeOrder=4], storeWriteThrough=false, super=GridDistributedTxRemoteAdapter 
> [explicitVers=null, started=true, commitAllowed=0, 
> txState=IgniteTxRemoteSingleStateImpl [entry=IgniteTxEntry 
> [key=KeyCacheObjectImpl [part=1, val=1, hasValBytes=true], cacheId=3556498, 
> txKey=IgniteTxKey [key=KeyCacheObjectImpl [part=1, val=1, hasValBytes=true], 
> cacheId=3556498], val=[op=CREATE, val=CacheObjectImpl [val=null, 
> hasValBytes=true]], prevVal=[op=NOOP, val=null], oldVal=[op=NOOP, val=null], 
> entryProcessorsCol=null, ttl=-1, conflictExpireTime=-1, conflictVer=null, 
> explicitVer=null, dhtVer=null, filters=[], filtersPassed=false, 
> filtersSet=false, entry=GridDhtCacheEntry [rdrs=[], part=1, 
> super=GridDistributedCacheEntry [super=GridCacheMapEntry 
> [key=KeyCacheObjectImpl [part=1, val=1, hasValBytes=true], 
> val=CacheObjectImpl [val=null, hasValBytes=true], startVer=1528464836879, 
> ver=GridCacheVersion [topVer=139944839, order=1528464836863, nodeOrder=2], 
> hash=1, extras=GridCacheMvccEntryExtras [mvcc=GridCacheMvcc [locs=null, 
> rmts=[GridCacheMvccCandidate [nodeId=97ee44cd-73c9-4e79-95df-e1a03481, 
> ver=GridCacheVersion [topVer=139944839, order=1528464836897, nodeOrder=2], 
> threadId=75880, id=2310313, topVer=AffinityTopologyVersion [topVer=-1, 
> minorTopVer=0], reentry=null, 
> otherNodeId=3c8d85b2-4eb9-46b2-8bd1-6f18f542fc7a, otherVer=null, 
> mappedDhtNodes=null, mappedNearNodes=null, ownerVer=null, serOrder=null, 
> key=KeyCacheObjectImpl [part=1, val=1, hasValBytes=true], 
> masks=local=0|owner=0|ready=0|reentry=0|used=0|tx=1|single_implicit=0|dht_local=0|near_local=0|removed=0|read=0,
>  prevVer=null, nextVer=null], GridCacheMvccCandidate 
> [nodeId=97ee44cd-73c9-4e79-95df-e1a03481, ver=GridCacheVersion 
> [topVer=139944839, order=1528464836900, nodeOrder=2], threadId=75875, 
> id=2310317, topVer=AffinityTopologyVersion [topVer=-1, minorTopVer=0], 
> reentry=null, otherNodeId=3c8d85b2-4eb9-46b2-8bd1-6f18f542fc7a, 
> otherVer=null, mappedDhtNodes=null, mappedNearNodes=null, ownerVer=null, 
> serOrder=null, key=KeyCacheObjectImpl [part=1, val=1, hasValBytes=true], 
> masks=local=0|owner=1|ready=0|reentry=0|used=1|tx=1|single_implicit=0|dht_local=0|near_local=0|removed=0|read=0,
>  prevVer=null, nextVer=null, flags=2]]], prepared=1, locked=false, 
> nodeId=null, locMapped=false, expiryPlc=null, transferExpiryPlc=false, 
> flags=0, partUpdateCntr=0, serReadVer=null, xidVer=null]], 
> skipCompletedVers=false, super=IgniteTxAdapter [xidVer=GridCacheVersion 
> [topVer=139944839, order=1528464836897, nodeOrder=2], 
> writeVer=GridCacheVersion [topVer=139944839, order=1528464836898, 
> nodeOrder=2], implicit=false, loc=false, threadId=75880, 
> startTime=1528464836931, nodeId=97ee44cd-73c9-4e79-95df-e1a03481, 
> startVer=GridCacheVersion [topVer=139944839, order=1528464836864, 
> nodeOrder=1], endVer=null, isolation=REPEATABLE_READ, 
> concurrency=PESSIMISTIC, timeout=1, sysInvalidate=false, sys=false, plc=2, 
> commitVer=null, finalizing=NONE, invalidParts=null, state=PREPARED, 
> timedOut=false, topVer=AffinityTopologyVersion [topVer=4, minorTopVer=0], 
> duration=118123ms, onePhaseCommit=false
> 

[jira] [Commented] (IGNITE-8863) Tx rollback can cause remote tx hang

2018-06-27 Thread Alexei Scherbakov (JIRA)


[ 
https://issues.apache.org/jira/browse/IGNITE-8863?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16525229#comment-16525229
 ] 

Alexei Scherbakov commented on IGNITE-8863:
---

It's look like TC history for suite cache5 recently become very bad (see [1]) 
mainly due to CacheSerializableTransactionsTest and 
PartitionsExchangeOnDiscoveryHistoryOverflowTest.

I've muted PartitionsExchangeOnDiscoveryHistoryOverflowTest because it produces 
OOM almost each run and someone should push PR from IGNITE-8869 to prevent OOM.

I think my changes are not related to failures because these failures has 
started to happen before this commit.

[~agoncharuk], please review.

[1] 
https://ci.ignite.apache.org/viewType.html?buildTypeId=IgniteTests24Java8_Cache5_IgniteTests24Java8=%3Cdefault%3E=buildTypeStatusDiv

> Tx rollback can cause remote tx hang
> 
>
> Key: IGNITE-8863
> URL: https://issues.apache.org/jira/browse/IGNITE-8863
> Project: Ignite
>  Issue Type: Bug
>Reporter: Alexei Scherbakov
>Assignee: Alexei Scherbakov
>Priority: Major
> Fix For: 2.7
>
> Attachments: Ignite_Tests_2.4_Java_8_Cache_5_1434.log.zip
>
>
> {noformat}
> [16:33:56]W: [org.apache.ignite:ignite-core] [2018-06-08 
> 13:33:56,931][WARN ][sys-#66696%client%][GridNearTxLocal] The transaction was 
> forcibly rolled back because a timeout is reached: 
> GridNearTxLocal[xid=e198a9fd361--0857-6387--0004, 
> xidVersion=GridCacheVersion [topVer=139944839, order=1528464836894, 
> nodeOrder=4], concurrency=PESSIMISTIC, isolation=REPEATABLE_READ, 
> state=MARKED_ROLLBACK, invalidate=false, rollbackOnly=true, 
> nodeId=3c8d85b2-4eb9-46b2-8bd1-6f18f542fc7a, timeout=1, duration=11]
> [16:35:55]W: [org.apache.ignite:ignite-core] [2018-06-08 
> 13:35:55,056][WARN 
> ][grid-timeout-worker-#66394%transactions.TxRollbackOnTimeoutTest0%][diagnostic]
>  Found long running transaction [startTime=13:33:56.931, 
> curTime=13:35:55.054, tx=GridDhtTxRemote 
> [nearNodeId=3c8d85b2-4eb9-46b2-8bd1-6f18f542fc7a, 
> rmtFutId=af940d0e361-79c59341-3292-46e4-92ce-5c4ef4eddef8, 
> nearXidVer=GridCacheVersion [topVer=139944839, order=1528464836894, 
> nodeOrder=4], storeWriteThrough=false, super=GridDistributedTxRemoteAdapter 
> [explicitVers=null, started=true, commitAllowed=0, 
> txState=IgniteTxRemoteSingleStateImpl [entry=IgniteTxEntry 
> [key=KeyCacheObjectImpl [part=1, val=1, hasValBytes=true], cacheId=3556498, 
> txKey=IgniteTxKey [key=KeyCacheObjectImpl [part=1, val=1, hasValBytes=true], 
> cacheId=3556498], val=[op=CREATE, val=CacheObjectImpl [val=null, 
> hasValBytes=true]], prevVal=[op=NOOP, val=null], oldVal=[op=NOOP, val=null], 
> entryProcessorsCol=null, ttl=-1, conflictExpireTime=-1, conflictVer=null, 
> explicitVer=null, dhtVer=null, filters=[], filtersPassed=false, 
> filtersSet=false, entry=GridDhtCacheEntry [rdrs=[], part=1, 
> super=GridDistributedCacheEntry [super=GridCacheMapEntry 
> [key=KeyCacheObjectImpl [part=1, val=1, hasValBytes=true], 
> val=CacheObjectImpl [val=null, hasValBytes=true], startVer=1528464836879, 
> ver=GridCacheVersion [topVer=139944839, order=1528464836863, nodeOrder=2], 
> hash=1, extras=GridCacheMvccEntryExtras [mvcc=GridCacheMvcc [locs=null, 
> rmts=[GridCacheMvccCandidate [nodeId=97ee44cd-73c9-4e79-95df-e1a03481, 
> ver=GridCacheVersion [topVer=139944839, order=1528464836897, nodeOrder=2], 
> threadId=75880, id=2310313, topVer=AffinityTopologyVersion [topVer=-1, 
> minorTopVer=0], reentry=null, 
> otherNodeId=3c8d85b2-4eb9-46b2-8bd1-6f18f542fc7a, otherVer=null, 
> mappedDhtNodes=null, mappedNearNodes=null, ownerVer=null, serOrder=null, 
> key=KeyCacheObjectImpl [part=1, val=1, hasValBytes=true], 
> masks=local=0|owner=0|ready=0|reentry=0|used=0|tx=1|single_implicit=0|dht_local=0|near_local=0|removed=0|read=0,
>  prevVer=null, nextVer=null], GridCacheMvccCandidate 
> [nodeId=97ee44cd-73c9-4e79-95df-e1a03481, ver=GridCacheVersion 
> [topVer=139944839, order=1528464836900, nodeOrder=2], threadId=75875, 
> id=2310317, topVer=AffinityTopologyVersion [topVer=-1, minorTopVer=0], 
> reentry=null, otherNodeId=3c8d85b2-4eb9-46b2-8bd1-6f18f542fc7a, 
> otherVer=null, mappedDhtNodes=null, mappedNearNodes=null, ownerVer=null, 
> serOrder=null, key=KeyCacheObjectImpl [part=1, val=1, hasValBytes=true], 
> masks=local=0|owner=1|ready=0|reentry=0|used=1|tx=1|single_implicit=0|dht_local=0|near_local=0|removed=0|read=0,
>  prevVer=null, nextVer=null, flags=2]]], prepared=1, locked=false, 
> nodeId=null, locMapped=false, expiryPlc=null, transferExpiryPlc=false, 
> flags=0, partUpdateCntr=0, serReadVer=null, xidVer=null]], 
> skipCompletedVers=false, super=IgniteTxAdapter [xidVer=GridCacheVersion 
> [topVer=139944839, order=1528464836897, nodeOrder=2], 
> writeVer=GridCacheVersion 

[jira] [Commented] (IGNITE-8863) Tx rollback can cause remote tx hang

2018-06-27 Thread Alexei Scherbakov (JIRA)


[ 
https://issues.apache.org/jira/browse/IGNITE-8863?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16525170#comment-16525170
 ] 

Alexei Scherbakov commented on IGNITE-8863:
---

There is suspicious failure in  
CacheSerializableTransactionsTest#testReadWriteTransactionsNoDeadlock leading 
to Cache5 suite hang:

I see hanging transaction:
{noformat}
[00:23:01]W: [org.apache.ignite:ignite-core] [2018-06-26 
21:23:01,176][WARN 
][grid-timeout-worker-#55%cache.CacheSerializableTransactionsTest0%][diagnostic]
 Found long running transaction [startTime=21:21:27.406, curTime=21:23:01.174, 
tx=GridNearTxLocal [mappings=IgniteTxMappingsImpl [], nearLocallyMapped=false, 
colocatedLocallyMapped=false, needCheckBackup=null, hasRemoteLocks=false, 
trackTimeout=false, lb=null, thread=test-thread-22, 
mappings=IgniteTxMappingsImpl [], super=GridDhtTxLocalAdapter 
[nearOnOriginatingNode=false, nearNodes=[], dhtNodes=[], explicitLock=false, 
super=IgniteTxLocalAdapter [completedBase=null, sndTransformedVals=false, 
depEnabled=false, txState=IgniteTxStateImpl [activeCacheIds=[1544803905], 
recovery=false, txMap=[IgniteTxEntry [key=KeyCacheObjectImpl [part=27, val=27, 
hasValBytes=false], cacheId=1544803905, txKey=IgniteTxKey 
[key=KeyCacheObjectImpl [part=27, val=27, hasValBytes=false], 
cacheId=1544803905], val=[op=READ, val=null], prevVal=[op=READ, val=null], 
oldVal=[op=NOOP, val=null], entryProcessorsCol=null, ttl=-1, 
conflictExpireTime=-1, conflictVer=null, explicitVer=null, dhtVer=null, 
filters=null, filtersPassed=false, filtersSet=true, 
entry=GridDhtDetachedCacheEntry [super=GridDistributedCacheEntry 
[super=GridCacheMapEntry [key=KeyCacheObjectImpl [part=27, val=27, 
hasValBytes=false], val=null, ver=GridCacheVersion [topVer=0, order=0, 
nodeOrder=0], hash=27, extras=null, flags=0]]], prepared=0, locked=false, 
nodeId=null, locMapped=false, expiryPlc=null, transferExpiryPlc=false, flags=0, 
partUpdateCntr=0, serReadVer=GridCacheVersion [topVer=0, order=0, nodeOrder=0], 
xidVer=GridCacheVersion [topVer=141525768, order=1530052929764, nodeOrder=1]], 
IgniteTxEntry [key=KeyCacheObjectImpl [part=8, val=8, hasValBytes=false], 
cacheId=1544803905, txKey=IgniteTxKey [key=KeyCacheObjectImpl [part=8, val=8, 
hasValBytes=false], cacheId=1544803905], val=[op=CREATE, 
val=UserCacheObjectImpl [val=8, hasValBytes=false]], prevVal=[op=CREATE, 
val=UserCacheObjectImpl [val=8, hasValBytes=false]], oldVal=[op=NOOP, 
val=null], entryProcessorsCol=null, ttl=-1, conflictExpireTime=-1, 
conflictVer=null, explicitVer=null, dhtVer=null, filters=[], 
filtersPassed=false, filtersSet=true, entry=GridDhtDetachedCacheEntry 
[super=GridDistributedCacheEntry [super=GridCacheMapEntry 
[key=KeyCacheObjectImpl [part=8, val=8, hasValBytes=false], val=null, 
ver=GridCacheVersion [topVer=0, order=0, nodeOrder=0], hash=8, extras=null, 
flags=0]]], prepared=0, locked=false, nodeId=null, locMapped=false, 
expiryPlc=null, transferExpiryPlc=false, flags=0, partUpdateCntr=0, 
serReadVer=null, xidVer=GridCacheVersion [topVer=141525768, 
order=1530052929764, nodeOrder=1]], IgniteTxEntry [key=KeyCacheObjectImpl 
[part=24, val=24, hasValBytes=false], cacheId=1544803905, txKey=IgniteTxKey 
[key=KeyCacheObjectImpl [part=24, val=24, hasValBytes=false], 
cacheId=1544803905], val=[op=READ, val=null], prevVal=[op=NOOP, val=null], 
oldVal=[op=NOOP, val=null], entryProcessorsCol=null, ttl=-1, 
conflictExpireTime=-1, conflictVer=null, explicitVer=null, dhtVer=null, 
filters=null, filtersPassed=false, filtersSet=true, 
entry=GridDhtDetachedCacheEntry [super=GridDistributedCacheEntry 
[super=GridCacheMapEntry [key=KeyCacheObjectImpl [part=24, val=24, 
hasValBytes=false], val=null, ver=GridCacheVersion [topVer=0, order=0, 
nodeOrder=0], hash=24, extras=null, flags=0]]], prepared=0, locked=false, 
nodeId=null, locMapped=false, expiryPlc=null, transferExpiryPlc=false, flags=0, 
partUpdateCntr=0, serReadVer=null, xidVer=GridCacheVersion [topVer=141525768, 
order=1530052929764, nodeOrder=1]], IgniteTxEntry [key=KeyCacheObjectImpl 
[part=29, val=29, hasValBytes=false], cacheId=1544803905, txKey=IgniteTxKey 
[key=KeyCacheObjectImpl [part=29, val=29, hasValBytes=false], 
cacheId=1544803905], val=[op=READ, val=null], prevVal=[op=READ, val=null], 
oldVal=[op=NOOP, val=null], entryProcessorsCol=null, ttl=-1, 
conflictExpireTime=-1, conflictVer=null, explicitVer=null, dhtVer=null, 
filters=null, filtersPassed=false, filtersSet=true, 
entry=GridDhtDetachedCacheEntry [super=GridDistributedCacheEntry 
[super=GridCacheMapEntry [key=KeyCacheObjectImpl [part=29, val=29, 
hasValBytes=false], val=null, ver=GridCacheVersion [topVer=0, order=0, 
nodeOrder=0], hash=29, extras=null, flags=0]]], prepared=0, locked=false, 
nodeId=null, locMapped=false, expiryPlc=null, transferExpiryPlc=false, flags=0, 
partUpdateCntr=0, serReadVer=GridCacheVersion [topVer=0, order=0, nodeOrder=0], 
xidVer=GridCacheVersion 

[jira] [Commented] (IGNITE-8863) Tx rollback can cause remote tx hang

2018-06-26 Thread Alexei Scherbakov (JIRA)


[ 
https://issues.apache.org/jira/browse/IGNITE-8863?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16523961#comment-16523961
 ] 

Alexei Scherbakov commented on IGNITE-8863:
---

There are races between message which create new tx and rollback messages 
leading to hanging primary and backup transactions.

Additional tests covering such scenarios will be introduced later in 
IGNITE-8509 due to urgency of the fix.

> Tx rollback can cause remote tx hang
> 
>
> Key: IGNITE-8863
> URL: https://issues.apache.org/jira/browse/IGNITE-8863
> Project: Ignite
>  Issue Type: Bug
>Reporter: Alexei Scherbakov
>Assignee: Alexei Scherbakov
>Priority: Major
> Fix For: 2.7
>
>
> {noformat}
> [16:33:56]W: [org.apache.ignite:ignite-core] [2018-06-08 
> 13:33:56,931][WARN ][sys-#66696%client%][GridNearTxLocal] The transaction was 
> forcibly rolled back because a timeout is reached: 
> GridNearTxLocal[xid=e198a9fd361--0857-6387--0004, 
> xidVersion=GridCacheVersion [topVer=139944839, order=1528464836894, 
> nodeOrder=4], concurrency=PESSIMISTIC, isolation=REPEATABLE_READ, 
> state=MARKED_ROLLBACK, invalidate=false, rollbackOnly=true, 
> nodeId=3c8d85b2-4eb9-46b2-8bd1-6f18f542fc7a, timeout=1, duration=11]
> [16:35:55]W: [org.apache.ignite:ignite-core] [2018-06-08 
> 13:35:55,056][WARN 
> ][grid-timeout-worker-#66394%transactions.TxRollbackOnTimeoutTest0%][diagnostic]
>  Found long running transaction [startTime=13:33:56.931, 
> curTime=13:35:55.054, tx=GridDhtTxRemote 
> [nearNodeId=3c8d85b2-4eb9-46b2-8bd1-6f18f542fc7a, 
> rmtFutId=af940d0e361-79c59341-3292-46e4-92ce-5c4ef4eddef8, 
> nearXidVer=GridCacheVersion [topVer=139944839, order=1528464836894, 
> nodeOrder=4], storeWriteThrough=false, super=GridDistributedTxRemoteAdapter 
> [explicitVers=null, started=true, commitAllowed=0, 
> txState=IgniteTxRemoteSingleStateImpl [entry=IgniteTxEntry 
> [key=KeyCacheObjectImpl [part=1, val=1, hasValBytes=true], cacheId=3556498, 
> txKey=IgniteTxKey [key=KeyCacheObjectImpl [part=1, val=1, hasValBytes=true], 
> cacheId=3556498], val=[op=CREATE, val=CacheObjectImpl [val=null, 
> hasValBytes=true]], prevVal=[op=NOOP, val=null], oldVal=[op=NOOP, val=null], 
> entryProcessorsCol=null, ttl=-1, conflictExpireTime=-1, conflictVer=null, 
> explicitVer=null, dhtVer=null, filters=[], filtersPassed=false, 
> filtersSet=false, entry=GridDhtCacheEntry [rdrs=[], part=1, 
> super=GridDistributedCacheEntry [super=GridCacheMapEntry 
> [key=KeyCacheObjectImpl [part=1, val=1, hasValBytes=true], 
> val=CacheObjectImpl [val=null, hasValBytes=true], startVer=1528464836879, 
> ver=GridCacheVersion [topVer=139944839, order=1528464836863, nodeOrder=2], 
> hash=1, extras=GridCacheMvccEntryExtras [mvcc=GridCacheMvcc [locs=null, 
> rmts=[GridCacheMvccCandidate [nodeId=97ee44cd-73c9-4e79-95df-e1a03481, 
> ver=GridCacheVersion [topVer=139944839, order=1528464836897, nodeOrder=2], 
> threadId=75880, id=2310313, topVer=AffinityTopologyVersion [topVer=-1, 
> minorTopVer=0], reentry=null, 
> otherNodeId=3c8d85b2-4eb9-46b2-8bd1-6f18f542fc7a, otherVer=null, 
> mappedDhtNodes=null, mappedNearNodes=null, ownerVer=null, serOrder=null, 
> key=KeyCacheObjectImpl [part=1, val=1, hasValBytes=true], 
> masks=local=0|owner=0|ready=0|reentry=0|used=0|tx=1|single_implicit=0|dht_local=0|near_local=0|removed=0|read=0,
>  prevVer=null, nextVer=null], GridCacheMvccCandidate 
> [nodeId=97ee44cd-73c9-4e79-95df-e1a03481, ver=GridCacheVersion 
> [topVer=139944839, order=1528464836900, nodeOrder=2], threadId=75875, 
> id=2310317, topVer=AffinityTopologyVersion [topVer=-1, minorTopVer=0], 
> reentry=null, otherNodeId=3c8d85b2-4eb9-46b2-8bd1-6f18f542fc7a, 
> otherVer=null, mappedDhtNodes=null, mappedNearNodes=null, ownerVer=null, 
> serOrder=null, key=KeyCacheObjectImpl [part=1, val=1, hasValBytes=true], 
> masks=local=0|owner=1|ready=0|reentry=0|used=1|tx=1|single_implicit=0|dht_local=0|near_local=0|removed=0|read=0,
>  prevVer=null, nextVer=null, flags=2]]], prepared=1, locked=false, 
> nodeId=null, locMapped=false, expiryPlc=null, transferExpiryPlc=false, 
> flags=0, partUpdateCntr=0, serReadVer=null, xidVer=null]], 
> skipCompletedVers=false, super=IgniteTxAdapter [xidVer=GridCacheVersion 
> [topVer=139944839, order=1528464836897, nodeOrder=2], 
> writeVer=GridCacheVersion [topVer=139944839, order=1528464836898, 
> nodeOrder=2], implicit=false, loc=false, threadId=75880, 
> startTime=1528464836931, nodeId=97ee44cd-73c9-4e79-95df-e1a03481, 
> startVer=GridCacheVersion [topVer=139944839, order=1528464836864, 
> nodeOrder=1], endVer=null, isolation=REPEATABLE_READ, 
> concurrency=PESSIMISTIC, timeout=1, sysInvalidate=false, sys=false, plc=2, 
> commitVer=null, finalizing=NONE, invalidParts=null, state=PREPARED, 
> timedOut=false, 

[jira] [Commented] (IGNITE-8863) Tx rollback can cause remote tx hang

2018-06-26 Thread ASF GitHub Bot (JIRA)


[ 
https://issues.apache.org/jira/browse/IGNITE-8863?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16523952#comment-16523952
 ] 

ASF GitHub Bot commented on IGNITE-8863:


GitHub user ascherbakoff opened a pull request:

https://github.com/apache/ignite/pull/4262

IGNITE-8863 Tx rollback can cause remote tx hang.



You can merge this pull request into a Git repository by running:

$ git pull https://github.com/gridgain/apache-ignite ignite-8863

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/ignite/pull/4262.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #4262


commit cf01e782f19f84f466d35377e58039262361a8fa
Author: Aleksei Scherbakov 
Date:   2018-06-26T16:36:04Z

IGNITE-8863 Tx rollback can cause remote tx hang.




> Tx rollback can cause remote tx hang
> 
>
> Key: IGNITE-8863
> URL: https://issues.apache.org/jira/browse/IGNITE-8863
> Project: Ignite
>  Issue Type: Bug
>Reporter: Alexei Scherbakov
>Assignee: Alexei Scherbakov
>Priority: Major
> Fix For: 2.7
>
>
> {noformat}
> [16:33:56]W: [org.apache.ignite:ignite-core] [2018-06-08 
> 13:33:56,931][WARN ][sys-#66696%client%][GridNearTxLocal] The transaction was 
> forcibly rolled back because a timeout is reached: 
> GridNearTxLocal[xid=e198a9fd361--0857-6387--0004, 
> xidVersion=GridCacheVersion [topVer=139944839, order=1528464836894, 
> nodeOrder=4], concurrency=PESSIMISTIC, isolation=REPEATABLE_READ, 
> state=MARKED_ROLLBACK, invalidate=false, rollbackOnly=true, 
> nodeId=3c8d85b2-4eb9-46b2-8bd1-6f18f542fc7a, timeout=1, duration=11]
> [16:35:55]W: [org.apache.ignite:ignite-core] [2018-06-08 
> 13:35:55,056][WARN 
> ][grid-timeout-worker-#66394%transactions.TxRollbackOnTimeoutTest0%][diagnostic]
>  Found long running transaction [startTime=13:33:56.931, 
> curTime=13:35:55.054, tx=GridDhtTxRemote 
> [nearNodeId=3c8d85b2-4eb9-46b2-8bd1-6f18f542fc7a, 
> rmtFutId=af940d0e361-79c59341-3292-46e4-92ce-5c4ef4eddef8, 
> nearXidVer=GridCacheVersion [topVer=139944839, order=1528464836894, 
> nodeOrder=4], storeWriteThrough=false, super=GridDistributedTxRemoteAdapter 
> [explicitVers=null, started=true, commitAllowed=0, 
> txState=IgniteTxRemoteSingleStateImpl [entry=IgniteTxEntry 
> [key=KeyCacheObjectImpl [part=1, val=1, hasValBytes=true], cacheId=3556498, 
> txKey=IgniteTxKey [key=KeyCacheObjectImpl [part=1, val=1, hasValBytes=true], 
> cacheId=3556498], val=[op=CREATE, val=CacheObjectImpl [val=null, 
> hasValBytes=true]], prevVal=[op=NOOP, val=null], oldVal=[op=NOOP, val=null], 
> entryProcessorsCol=null, ttl=-1, conflictExpireTime=-1, conflictVer=null, 
> explicitVer=null, dhtVer=null, filters=[], filtersPassed=false, 
> filtersSet=false, entry=GridDhtCacheEntry [rdrs=[], part=1, 
> super=GridDistributedCacheEntry [super=GridCacheMapEntry 
> [key=KeyCacheObjectImpl [part=1, val=1, hasValBytes=true], 
> val=CacheObjectImpl [val=null, hasValBytes=true], startVer=1528464836879, 
> ver=GridCacheVersion [topVer=139944839, order=1528464836863, nodeOrder=2], 
> hash=1, extras=GridCacheMvccEntryExtras [mvcc=GridCacheMvcc [locs=null, 
> rmts=[GridCacheMvccCandidate [nodeId=97ee44cd-73c9-4e79-95df-e1a03481, 
> ver=GridCacheVersion [topVer=139944839, order=1528464836897, nodeOrder=2], 
> threadId=75880, id=2310313, topVer=AffinityTopologyVersion [topVer=-1, 
> minorTopVer=0], reentry=null, 
> otherNodeId=3c8d85b2-4eb9-46b2-8bd1-6f18f542fc7a, otherVer=null, 
> mappedDhtNodes=null, mappedNearNodes=null, ownerVer=null, serOrder=null, 
> key=KeyCacheObjectImpl [part=1, val=1, hasValBytes=true], 
> masks=local=0|owner=0|ready=0|reentry=0|used=0|tx=1|single_implicit=0|dht_local=0|near_local=0|removed=0|read=0,
>  prevVer=null, nextVer=null], GridCacheMvccCandidate 
> [nodeId=97ee44cd-73c9-4e79-95df-e1a03481, ver=GridCacheVersion 
> [topVer=139944839, order=1528464836900, nodeOrder=2], threadId=75875, 
> id=2310317, topVer=AffinityTopologyVersion [topVer=-1, minorTopVer=0], 
> reentry=null, otherNodeId=3c8d85b2-4eb9-46b2-8bd1-6f18f542fc7a, 
> otherVer=null, mappedDhtNodes=null, mappedNearNodes=null, ownerVer=null, 
> serOrder=null, key=KeyCacheObjectImpl [part=1, val=1, hasValBytes=true], 
> masks=local=0|owner=1|ready=0|reentry=0|used=1|tx=1|single_implicit=0|dht_local=0|near_local=0|removed=0|read=0,
>  prevVer=null, nextVer=null, flags=2]]], prepared=1, locked=false, 
> nodeId=null, locMapped=false, expiryPlc=null, transferExpiryPlc=false, 
> flags=0, partUpdateCntr=0, serReadVer=null, xidVer=null]], 
> skipCompletedVers=false, super=IgniteTxAdapter [xidVer=GridCacheVersion 
> [topVer=139944839, order=1528464836897,