[jira] [Commented] (GEODE-5056) ParallelGatewaySenderOperationsDUnitTest.testParallelPropagationSenderStartAfterStop_Scenario2 intermittently fail
[ https://issues.apache.org/jira/browse/GEODE-5056?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16475067#comment-16475067 ] ASF subversion and git services commented on GEODE-5056: Commit 6b54ae977abcf575513d21c608c6ce0e36014e32 in geode's branch refs/heads/support/9.5 from Xiaojian Zhou [ https://gitbox.apache.org/repos/asf?p=geode.git;h=6b54ae9 ] GEODE-5056: when found the dropped events at primary sender, send (#1794) QueueRemovalMessage for it (cherry picked from commit f7bb77c89a3d19673e8929275fc6c407a4b382bd) > ParallelGatewaySenderOperationsDUnitTest.testParallelPropagationSenderStartAfterStop_Scenario2 > intermittently fail > --- > > Key: GEODE-5056 > URL: https://issues.apache.org/jira/browse/GEODE-5056 > Project: Geode > Issue Type: Bug > Components: wan >Reporter: xiaojian zhou >Assignee: xiaojian zhou >Priority: Major > Labels: pull-request-available > Fix For: 1.6.0 > > Time Spent: 20m > Remaining Estimate: 0h > > After fixe GEODE-4942, I found there's at least one race condition is not > covered. > > [vm6] [debug 2018/04/11 16:47:35.189 PDT Processor2> tid=110] WAN: On primary bucket 57, setting the seq number as 1357 > > [vm7] [info 2018/04/11 16:47:35.150 PDT > tid=19] Started ParallelGatewaySender\{id=ln,remoteDsId=2,isRunning =true} > > [vm7] [debug 2018/04/11 16:47:35.189 PDT 10.118.19.25(27489):32781 shared ordered uid=7 port=59148> tid=95] WAN: > On secondary bucket 57, setting the seq number as 1357 > [vm7] [debug 2018/04/11 16:47:35.190 PDT 10.118.19.25(27489):32781 shared ordered uid=7 port=59148> tid=95] Key : > > 1357 > [vm6] [debug 2018/04/11 16:47:35.190 PDT Processor2> tid=110] register dropped event for primary queue. BucketId is > 57, shadowKey is 1357, prQ is /ln_PARALLEL_GATEWAY_SENDER_QUEUE > > - Note: vm6's sender is restarted and cleanup the map, before the > QueueRemvalMessage is sent out for the map. > [vm6] [info 2018/04/11 16:47:35.249 PDT > tid=19] Started ParallelGatewaySender\{id=ln,remoteDsId=2,isRunning =true} > [vm6] [debug 2018/04/11 16:47:35.437 PDT GatewaySender_ln_0> tid=118] BatchRemovalThread about to query the batch > removal map \{/ln_PARALLEL_GATEWAY_SENDER_QUEUE={96=[1396], 2=[1402], > 83=[1383], 6=[1406], 71=[1371], 87=[1387], 73=[1373], 90=[1390], 77=[1377], > 94=[1394]}} > [vm6] [debug 2018/04/11 16:47:35.753 PDT GatewaySender_ln_0> tid=118] BatchRemovalThread about to query the batch > removal map {/ln_PARALLEL_GATEWAY_SENDER_QUEUE={49=[1449], 65=[1465], > 83=[1483], 53=[1453], 71=[1471], 87=[1487], *57=[1457]*, 73=[1473], > 77=[1477], 62=[1462]}} > shadowKey 1457 was created after the sender is restarted > > [vm6] [debug 2018/04/11 16:47:35.438 PDT GatewaySender_ln_0> tid=118] Sending (ParallelQueueRemovalMessage@2344969b > processorId=0 sender=10.118.19.25(27489):32781) to 3 peers > ([10.118.19.25(27492):32783@4(GEODE 1.6.0), > 10.118.19.25(27485):32779@1(GEODE 1.6.0), > 10.118.19.25(27482):32778@2(GEODE 1.6.0)]) via tcp/ip > [vm7] [debug 2018/04/11 16:47:35.439 PDT 10.118.19.25(27489):32781 shared unordered uid=4 port=59119> tid=52] > Received message 'ParallelQueueRemovalMessage@11583f5b processorId=0 > sender=10.118.19.25(27489):32781' from <10.118.19.25(27489):32781> > > i.e. the dropped key was in the map, but before sending a QueueRemovalMessage > the sender is closed and cleared the map. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (GEODE-5056) ParallelGatewaySenderOperationsDUnitTest.testParallelPropagationSenderStartAfterStop_Scenario2 intermittently fail
[ https://issues.apache.org/jira/browse/GEODE-5056?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16461424#comment-16461424 ] ASF subversion and git services commented on GEODE-5056: Commit 6cd24a877c6a19a1243d9582dd7b1f7edbe2859c in geode's branch refs/heads/develop from Xiaojian Zhou [ https://gitbox.apache.org/repos/asf?p=geode.git;h=6cd24a8 ] GEODE-5056: when found the dropped events at primary sender, send (#1794) QueueRemovalMessage for it (cherry picked from commit f7bb77c89a3d19673e8929275fc6c407a4b382bd) > ParallelGatewaySenderOperationsDUnitTest.testParallelPropagationSenderStartAfterStop_Scenario2 > intermittently fail > --- > > Key: GEODE-5056 > URL: https://issues.apache.org/jira/browse/GEODE-5056 > Project: Geode > Issue Type: Bug > Components: wan >Reporter: xiaojian zhou >Assignee: xiaojian zhou >Priority: Major > Labels: pull-request-available > Fix For: 1.6.0 > > Time Spent: 20m > Remaining Estimate: 0h > > After fixe GEODE-4942, I found there's at least one race condition is not > covered. > > [vm6] [debug 2018/04/11 16:47:35.189 PDT Processor2> tid=110] WAN: On primary bucket 57, setting the seq number as 1357 > > [vm7] [info 2018/04/11 16:47:35.150 PDT > tid=19] Started ParallelGatewaySender\{id=ln,remoteDsId=2,isRunning =true} > > [vm7] [debug 2018/04/11 16:47:35.189 PDT 10.118.19.25(27489):32781 shared ordered uid=7 port=59148> tid=95] WAN: > On secondary bucket 57, setting the seq number as 1357 > [vm7] [debug 2018/04/11 16:47:35.190 PDT 10.118.19.25(27489):32781 shared ordered uid=7 port=59148> tid=95] Key : > > 1357 > [vm6] [debug 2018/04/11 16:47:35.190 PDT Processor2> tid=110] register dropped event for primary queue. BucketId is > 57, shadowKey is 1357, prQ is /ln_PARALLEL_GATEWAY_SENDER_QUEUE > > - Note: vm6's sender is restarted and cleanup the map, before the > QueueRemvalMessage is sent out for the map. > [vm6] [info 2018/04/11 16:47:35.249 PDT > tid=19] Started ParallelGatewaySender\{id=ln,remoteDsId=2,isRunning =true} > [vm6] [debug 2018/04/11 16:47:35.437 PDT GatewaySender_ln_0> tid=118] BatchRemovalThread about to query the batch > removal map \{/ln_PARALLEL_GATEWAY_SENDER_QUEUE={96=[1396], 2=[1402], > 83=[1383], 6=[1406], 71=[1371], 87=[1387], 73=[1373], 90=[1390], 77=[1377], > 94=[1394]}} > [vm6] [debug 2018/04/11 16:47:35.753 PDT GatewaySender_ln_0> tid=118] BatchRemovalThread about to query the batch > removal map {/ln_PARALLEL_GATEWAY_SENDER_QUEUE={49=[1449], 65=[1465], > 83=[1483], 53=[1453], 71=[1471], 87=[1487], *57=[1457]*, 73=[1473], > 77=[1477], 62=[1462]}} > shadowKey 1457 was created after the sender is restarted > > [vm6] [debug 2018/04/11 16:47:35.438 PDT GatewaySender_ln_0> tid=118] Sending (ParallelQueueRemovalMessage@2344969b > processorId=0 sender=10.118.19.25(27489):32781) to 3 peers > ([10.118.19.25(27492):32783@4(GEODE 1.6.0), > 10.118.19.25(27485):32779@1(GEODE 1.6.0), > 10.118.19.25(27482):32778@2(GEODE 1.6.0)]) via tcp/ip > [vm7] [debug 2018/04/11 16:47:35.439 PDT 10.118.19.25(27489):32781 shared unordered uid=4 port=59119> tid=52] > Received message 'ParallelQueueRemovalMessage@11583f5b processorId=0 > sender=10.118.19.25(27489):32781' from <10.118.19.25(27489):32781> > > i.e. the dropped key was in the map, but before sending a QueueRemovalMessage > the sender is closed and cleared the map. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (GEODE-5056) ParallelGatewaySenderOperationsDUnitTest.testParallelPropagationSenderStartAfterStop_Scenario2 intermittently fail
[ https://issues.apache.org/jira/browse/GEODE-5056?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16461408#comment-16461408 ] ASF subversion and git services commented on GEODE-5056: Commit 6cd24a877c6a19a1243d9582dd7b1f7edbe2859c in geode's branch refs/heads/master from Xiaojian Zhou [ https://gitbox.apache.org/repos/asf?p=geode.git;h=6cd24a8 ] GEODE-5056: when found the dropped events at primary sender, send (#1794) QueueRemovalMessage for it (cherry picked from commit f7bb77c89a3d19673e8929275fc6c407a4b382bd) > ParallelGatewaySenderOperationsDUnitTest.testParallelPropagationSenderStartAfterStop_Scenario2 > intermittently fail > --- > > Key: GEODE-5056 > URL: https://issues.apache.org/jira/browse/GEODE-5056 > Project: Geode > Issue Type: Bug > Components: wan >Reporter: xiaojian zhou >Assignee: xiaojian zhou >Priority: Major > Labels: pull-request-available > Fix For: 1.6.0 > > Time Spent: 20m > Remaining Estimate: 0h > > After fixe GEODE-4942, I found there's at least one race condition is not > covered. > > [vm6] [debug 2018/04/11 16:47:35.189 PDT Processor2> tid=110] WAN: On primary bucket 57, setting the seq number as 1357 > > [vm7] [info 2018/04/11 16:47:35.150 PDT > tid=19] Started ParallelGatewaySender\{id=ln,remoteDsId=2,isRunning =true} > > [vm7] [debug 2018/04/11 16:47:35.189 PDT 10.118.19.25(27489):32781 shared ordered uid=7 port=59148> tid=95] WAN: > On secondary bucket 57, setting the seq number as 1357 > [vm7] [debug 2018/04/11 16:47:35.190 PDT 10.118.19.25(27489):32781 shared ordered uid=7 port=59148> tid=95] Key : > > 1357 > [vm6] [debug 2018/04/11 16:47:35.190 PDT Processor2> tid=110] register dropped event for primary queue. BucketId is > 57, shadowKey is 1357, prQ is /ln_PARALLEL_GATEWAY_SENDER_QUEUE > > - Note: vm6's sender is restarted and cleanup the map, before the > QueueRemvalMessage is sent out for the map. > [vm6] [info 2018/04/11 16:47:35.249 PDT > tid=19] Started ParallelGatewaySender\{id=ln,remoteDsId=2,isRunning =true} > [vm6] [debug 2018/04/11 16:47:35.437 PDT GatewaySender_ln_0> tid=118] BatchRemovalThread about to query the batch > removal map \{/ln_PARALLEL_GATEWAY_SENDER_QUEUE={96=[1396], 2=[1402], > 83=[1383], 6=[1406], 71=[1371], 87=[1387], 73=[1373], 90=[1390], 77=[1377], > 94=[1394]}} > [vm6] [debug 2018/04/11 16:47:35.753 PDT GatewaySender_ln_0> tid=118] BatchRemovalThread about to query the batch > removal map {/ln_PARALLEL_GATEWAY_SENDER_QUEUE={49=[1449], 65=[1465], > 83=[1483], 53=[1453], 71=[1471], 87=[1487], *57=[1457]*, 73=[1473], > 77=[1477], 62=[1462]}} > shadowKey 1457 was created after the sender is restarted > > [vm6] [debug 2018/04/11 16:47:35.438 PDT GatewaySender_ln_0> tid=118] Sending (ParallelQueueRemovalMessage@2344969b > processorId=0 sender=10.118.19.25(27489):32781) to 3 peers > ([10.118.19.25(27492):32783@4(GEODE 1.6.0), > 10.118.19.25(27485):32779@1(GEODE 1.6.0), > 10.118.19.25(27482):32778@2(GEODE 1.6.0)]) via tcp/ip > [vm7] [debug 2018/04/11 16:47:35.439 PDT 10.118.19.25(27489):32781 shared unordered uid=4 port=59119> tid=52] > Received message 'ParallelQueueRemovalMessage@11583f5b processorId=0 > sender=10.118.19.25(27489):32781' from <10.118.19.25(27489):32781> > > i.e. the dropped key was in the map, but before sending a QueueRemovalMessage > the sender is closed and cleared the map. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (GEODE-5056) ParallelGatewaySenderOperationsDUnitTest.testParallelPropagationSenderStartAfterStop_Scenario2 intermittently fail
[ https://issues.apache.org/jira/browse/GEODE-5056?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16444387#comment-16444387 ] ASF subversion and git services commented on GEODE-5056: Commit 6cd24a877c6a19a1243d9582dd7b1f7edbe2859c in geode's branch refs/heads/release/1.6.0 from Xiaojian Zhou [ https://gitbox.apache.org/repos/asf?p=geode.git;h=6cd24a8 ] GEODE-5056: when found the dropped events at primary sender, send (#1794) QueueRemovalMessage for it (cherry picked from commit f7bb77c89a3d19673e8929275fc6c407a4b382bd) > ParallelGatewaySenderOperationsDUnitTest.testParallelPropagationSenderStartAfterStop_Scenario2 > intermittently fail > --- > > Key: GEODE-5056 > URL: https://issues.apache.org/jira/browse/GEODE-5056 > Project: Geode > Issue Type: Bug > Components: wan >Reporter: xiaojian zhou >Assignee: xiaojian zhou >Priority: Major > Labels: pull-request-available > Fix For: 1.6.0 > > Time Spent: 20m > Remaining Estimate: 0h > > After fixe GEODE-4942, I found there's at least one race condition is not > covered. > > [vm6] [debug 2018/04/11 16:47:35.189 PDT Processor2> tid=110] WAN: On primary bucket 57, setting the seq number as 1357 > > [vm7] [info 2018/04/11 16:47:35.150 PDT > tid=19] Started ParallelGatewaySender\{id=ln,remoteDsId=2,isRunning =true} > > [vm7] [debug 2018/04/11 16:47:35.189 PDT 10.118.19.25(27489):32781 shared ordered uid=7 port=59148> tid=95] WAN: > On secondary bucket 57, setting the seq number as 1357 > [vm7] [debug 2018/04/11 16:47:35.190 PDT 10.118.19.25(27489):32781 shared ordered uid=7 port=59148> tid=95] Key : > > 1357 > [vm6] [debug 2018/04/11 16:47:35.190 PDT Processor2> tid=110] register dropped event for primary queue. BucketId is > 57, shadowKey is 1357, prQ is /ln_PARALLEL_GATEWAY_SENDER_QUEUE > > - Note: vm6's sender is restarted and cleanup the map, before the > QueueRemvalMessage is sent out for the map. > [vm6] [info 2018/04/11 16:47:35.249 PDT > tid=19] Started ParallelGatewaySender\{id=ln,remoteDsId=2,isRunning =true} > [vm6] [debug 2018/04/11 16:47:35.437 PDT GatewaySender_ln_0> tid=118] BatchRemovalThread about to query the batch > removal map \{/ln_PARALLEL_GATEWAY_SENDER_QUEUE={96=[1396], 2=[1402], > 83=[1383], 6=[1406], 71=[1371], 87=[1387], 73=[1373], 90=[1390], 77=[1377], > 94=[1394]}} > [vm6] [debug 2018/04/11 16:47:35.753 PDT GatewaySender_ln_0> tid=118] BatchRemovalThread about to query the batch > removal map {/ln_PARALLEL_GATEWAY_SENDER_QUEUE={49=[1449], 65=[1465], > 83=[1483], 53=[1453], 71=[1471], 87=[1487], *57=[1457]*, 73=[1473], > 77=[1477], 62=[1462]}} > shadowKey 1457 was created after the sender is restarted > > [vm6] [debug 2018/04/11 16:47:35.438 PDT GatewaySender_ln_0> tid=118] Sending (ParallelQueueRemovalMessage@2344969b > processorId=0 sender=10.118.19.25(27489):32781) to 3 peers > ([10.118.19.25(27492):32783@4(GEODE 1.6.0), > 10.118.19.25(27485):32779@1(GEODE 1.6.0), > 10.118.19.25(27482):32778@2(GEODE 1.6.0)]) via tcp/ip > [vm7] [debug 2018/04/11 16:47:35.439 PDT 10.118.19.25(27489):32781 shared unordered uid=4 port=59119> tid=52] > Received message 'ParallelQueueRemovalMessage@11583f5b processorId=0 > sender=10.118.19.25(27489):32781' from <10.118.19.25(27489):32781> > > i.e. the dropped key was in the map, but before sending a QueueRemovalMessage > the sender is closed and cleared the map. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (GEODE-5056) ParallelGatewaySenderOperationsDUnitTest.testParallelPropagationSenderStartAfterStop_Scenario2 intermittently fail
[ https://issues.apache.org/jira/browse/GEODE-5056?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16437593#comment-16437593 ] ASF subversion and git services commented on GEODE-5056: Commit f7bb77c89a3d19673e8929275fc6c407a4b382bd in geode's branch refs/heads/develop from Xiaojian Zhou [ https://gitbox.apache.org/repos/asf?p=geode.git;h=f7bb77c ] GEODE-5056: when found the dropped events at primary sender, send (#1794) QueueRemovalMessage for it > ParallelGatewaySenderOperationsDUnitTest.testParallelPropagationSenderStartAfterStop_Scenario2 > intermittently fail > --- > > Key: GEODE-5056 > URL: https://issues.apache.org/jira/browse/GEODE-5056 > Project: Geode > Issue Type: Bug > Components: wan >Reporter: xiaojian zhou >Assignee: xiaojian zhou >Priority: Major > Labels: pull-request-available > Time Spent: 20m > Remaining Estimate: 0h > > After fixe GEODE-4942, I found there's at least one race condition is not > covered. > > [vm6] [debug 2018/04/11 16:47:35.189 PDT Processor2> tid=110] WAN: On primary bucket 57, setting the seq number as 1357 > > [vm7] [info 2018/04/11 16:47:35.150 PDT > tid=19] Started ParallelGatewaySender\{id=ln,remoteDsId=2,isRunning =true} > > [vm7] [debug 2018/04/11 16:47:35.189 PDT 10.118.19.25(27489):32781 shared ordered uid=7 port=59148> tid=95] WAN: > On secondary bucket 57, setting the seq number as 1357 > [vm7] [debug 2018/04/11 16:47:35.190 PDT 10.118.19.25(27489):32781 shared ordered uid=7 port=59148> tid=95] Key : > > 1357 > [vm6] [debug 2018/04/11 16:47:35.190 PDT Processor2> tid=110] register dropped event for primary queue. BucketId is > 57, shadowKey is 1357, prQ is /ln_PARALLEL_GATEWAY_SENDER_QUEUE > > - Note: vm6's sender is restarted and cleanup the map, before the > QueueRemvalMessage is sent out for the map. > [vm6] [info 2018/04/11 16:47:35.249 PDT > tid=19] Started ParallelGatewaySender\{id=ln,remoteDsId=2,isRunning =true} > [vm6] [debug 2018/04/11 16:47:35.437 PDT GatewaySender_ln_0> tid=118] BatchRemovalThread about to query the batch > removal map \{/ln_PARALLEL_GATEWAY_SENDER_QUEUE={96=[1396], 2=[1402], > 83=[1383], 6=[1406], 71=[1371], 87=[1387], 73=[1373], 90=[1390], 77=[1377], > 94=[1394]}} > [vm6] [debug 2018/04/11 16:47:35.753 PDT GatewaySender_ln_0> tid=118] BatchRemovalThread about to query the batch > removal map {/ln_PARALLEL_GATEWAY_SENDER_QUEUE={49=[1449], 65=[1465], > 83=[1483], 53=[1453], 71=[1471], 87=[1487], *57=[1457]*, 73=[1473], > 77=[1477], 62=[1462]}} > shadowKey 1457 was created after the sender is restarted > > [vm6] [debug 2018/04/11 16:47:35.438 PDT GatewaySender_ln_0> tid=118] Sending (ParallelQueueRemovalMessage@2344969b > processorId=0 sender=10.118.19.25(27489):32781) to 3 peers > ([10.118.19.25(27492):32783@4(GEODE 1.6.0), > 10.118.19.25(27485):32779@1(GEODE 1.6.0), > 10.118.19.25(27482):32778@2(GEODE 1.6.0)]) via tcp/ip > [vm7] [debug 2018/04/11 16:47:35.439 PDT 10.118.19.25(27489):32781 shared unordered uid=4 port=59119> tid=52] > Received message 'ParallelQueueRemovalMessage@11583f5b processorId=0 > sender=10.118.19.25(27489):32781' from <10.118.19.25(27489):32781> > > i.e. the dropped key was in the map, but before sending a QueueRemovalMessage > the sender is closed and cleared the map. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (GEODE-5056) ParallelGatewaySenderOperationsDUnitTest.testParallelPropagationSenderStartAfterStop_Scenario2 intermittently fail
[ https://issues.apache.org/jira/browse/GEODE-5056?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16437549#comment-16437549 ] ASF subversion and git services commented on GEODE-5056: Commit bf6864bb362f774ac2acfa2d002c83df4ca33707 in geode's branch refs/heads/feature/GEODE-5056 from zhouxh [ https://gitbox.apache.org/repos/asf?p=geode.git;h=bf6864b ] GEODE-5056: when found the dropped events at primary sender, send QueueRemovalMessage for it > ParallelGatewaySenderOperationsDUnitTest.testParallelPropagationSenderStartAfterStop_Scenario2 > intermittently fail > --- > > Key: GEODE-5056 > URL: https://issues.apache.org/jira/browse/GEODE-5056 > Project: Geode > Issue Type: Bug > Components: wan >Reporter: xiaojian zhou >Assignee: xiaojian zhou >Priority: Major > Labels: pull-request-available > Time Spent: 10m > Remaining Estimate: 0h > > After fixe GEODE-4942, I found there's at least one race condition is not > covered. > > [vm6] [debug 2018/04/11 16:47:35.189 PDT Processor2> tid=110] WAN: On primary bucket 57, setting the seq number as 1357 > > [vm7] [info 2018/04/11 16:47:35.150 PDT > tid=19] Started ParallelGatewaySender\{id=ln,remoteDsId=2,isRunning =true} > > [vm7] [debug 2018/04/11 16:47:35.189 PDT 10.118.19.25(27489):32781 shared ordered uid=7 port=59148> tid=95] WAN: > On secondary bucket 57, setting the seq number as 1357 > [vm7] [debug 2018/04/11 16:47:35.190 PDT 10.118.19.25(27489):32781 shared ordered uid=7 port=59148> tid=95] Key : > > 1357 > [vm6] [debug 2018/04/11 16:47:35.190 PDT Processor2> tid=110] register dropped event for primary queue. BucketId is > 57, shadowKey is 1357, prQ is /ln_PARALLEL_GATEWAY_SENDER_QUEUE > > - Note: vm6's sender is restarted and cleanup the map, before the > QueueRemvalMessage is sent out for the map. > [vm6] [info 2018/04/11 16:47:35.249 PDT > tid=19] Started ParallelGatewaySender\{id=ln,remoteDsId=2,isRunning =true} > [vm6] [debug 2018/04/11 16:47:35.437 PDT GatewaySender_ln_0> tid=118] BatchRemovalThread about to query the batch > removal map \{/ln_PARALLEL_GATEWAY_SENDER_QUEUE={96=[1396], 2=[1402], > 83=[1383], 6=[1406], 71=[1371], 87=[1387], 73=[1373], 90=[1390], 77=[1377], > 94=[1394]}} > [vm6] [debug 2018/04/11 16:47:35.753 PDT GatewaySender_ln_0> tid=118] BatchRemovalThread about to query the batch > removal map {/ln_PARALLEL_GATEWAY_SENDER_QUEUE={49=[1449], 65=[1465], > 83=[1483], 53=[1453], 71=[1471], 87=[1487], *57=[1457]*, 73=[1473], > 77=[1477], 62=[1462]}} > shadowKey 1457 was created after the sender is restarted > > [vm6] [debug 2018/04/11 16:47:35.438 PDT GatewaySender_ln_0> tid=118] Sending (ParallelQueueRemovalMessage@2344969b > processorId=0 sender=10.118.19.25(27489):32781) to 3 peers > ([10.118.19.25(27492):32783@4(GEODE 1.6.0), > 10.118.19.25(27485):32779@1(GEODE 1.6.0), > 10.118.19.25(27482):32778@2(GEODE 1.6.0)]) via tcp/ip > [vm7] [debug 2018/04/11 16:47:35.439 PDT 10.118.19.25(27489):32781 shared unordered uid=4 port=59119> tid=52] > Received message 'ParallelQueueRemovalMessage@11583f5b processorId=0 > sender=10.118.19.25(27489):32781' from <10.118.19.25(27489):32781> > > i.e. the dropped key was in the map, but before sending a QueueRemovalMessage > the sender is closed and cleared the map. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (GEODE-5056) ParallelGatewaySenderOperationsDUnitTest.testParallelPropagationSenderStartAfterStop_Scenario2 intermittently fail
[ https://issues.apache.org/jira/browse/GEODE-5056?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16436647#comment-16436647 ] ASF subversion and git services commented on GEODE-5056: Commit 2b31ba8e3c1fe08d33e7fa2361dbe6ce2fedd687 in geode's branch refs/heads/feature/GEODE-5056 from zhouxh [ https://gitbox.apache.org/repos/asf?p=geode.git;h=2b31ba8 ] GEODE-5056: when found the dropped events at primary sender, send QueueRemovalMessage for it > ParallelGatewaySenderOperationsDUnitTest.testParallelPropagationSenderStartAfterStop_Scenario2 > intermittently fail > --- > > Key: GEODE-5056 > URL: https://issues.apache.org/jira/browse/GEODE-5056 > Project: Geode > Issue Type: Bug > Components: wan >Reporter: xiaojian zhou >Assignee: xiaojian zhou >Priority: Major > Labels: pull-request-available > Time Spent: 10m > Remaining Estimate: 0h > > After fixe GEODE-4942, I found there's at least one race condition is not > covered. > > [vm6] [debug 2018/04/11 16:47:35.189 PDT Processor2> tid=110] WAN: On primary bucket 57, setting the seq number as 1357 > > [vm7] [info 2018/04/11 16:47:35.150 PDT > tid=19] Started ParallelGatewaySender\{id=ln,remoteDsId=2,isRunning =true} > > [vm7] [debug 2018/04/11 16:47:35.189 PDT 10.118.19.25(27489):32781 shared ordered uid=7 port=59148> tid=95] WAN: > On secondary bucket 57, setting the seq number as 1357 > [vm7] [debug 2018/04/11 16:47:35.190 PDT 10.118.19.25(27489):32781 shared ordered uid=7 port=59148> tid=95] Key : > > 1357 > [vm6] [debug 2018/04/11 16:47:35.190 PDT Processor2> tid=110] register dropped event for primary queue. BucketId is > 57, shadowKey is 1357, prQ is /ln_PARALLEL_GATEWAY_SENDER_QUEUE > > - Note: vm6's sender is restarted and cleanup the map, before the > QueueRemvalMessage is sent out for the map. > [vm6] [info 2018/04/11 16:47:35.249 PDT > tid=19] Started ParallelGatewaySender\{id=ln,remoteDsId=2,isRunning =true} > [vm6] [debug 2018/04/11 16:47:35.437 PDT GatewaySender_ln_0> tid=118] BatchRemovalThread about to query the batch > removal map \{/ln_PARALLEL_GATEWAY_SENDER_QUEUE={96=[1396], 2=[1402], > 83=[1383], 6=[1406], 71=[1371], 87=[1387], 73=[1373], 90=[1390], 77=[1377], > 94=[1394]}} > [vm6] [debug 2018/04/11 16:47:35.753 PDT GatewaySender_ln_0> tid=118] BatchRemovalThread about to query the batch > removal map {/ln_PARALLEL_GATEWAY_SENDER_QUEUE={49=[1449], 65=[1465], > 83=[1483], 53=[1453], 71=[1471], 87=[1487], *57=[1457]*, 73=[1473], > 77=[1477], 62=[1462]}} > shadowKey 1457 was created after the sender is restarted > > [vm6] [debug 2018/04/11 16:47:35.438 PDT GatewaySender_ln_0> tid=118] Sending (ParallelQueueRemovalMessage@2344969b > processorId=0 sender=10.118.19.25(27489):32781) to 3 peers > ([10.118.19.25(27492):32783@4(GEODE 1.6.0), > 10.118.19.25(27485):32779@1(GEODE 1.6.0), > 10.118.19.25(27482):32778@2(GEODE 1.6.0)]) via tcp/ip > [vm7] [debug 2018/04/11 16:47:35.439 PDT 10.118.19.25(27489):32781 shared unordered uid=4 port=59119> tid=52] > Received message 'ParallelQueueRemovalMessage@11583f5b processorId=0 > sender=10.118.19.25(27489):32781' from <10.118.19.25(27489):32781> > > i.e. the dropped key was in the map, but before sending a QueueRemovalMessage > the sender is closed and cleared the map. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (GEODE-5056) ParallelGatewaySenderOperationsDUnitTest.testParallelPropagationSenderStartAfterStop_Scenario2 intermittently fail
[ https://issues.apache.org/jira/browse/GEODE-5056?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16436147#comment-16436147 ] ASF subversion and git services commented on GEODE-5056: Commit cd74f9f3a684c41d5cd2b04edf49fc72615087b3 in geode's branch refs/heads/develop from zhouxh [ https://gitbox.apache.org/repos/asf?p=geode.git;h=cd74f9f ] GEODE-5056: set testParallelPropagationSenderStartAfterStop_Scenario2 to be flaky > ParallelGatewaySenderOperationsDUnitTest.testParallelPropagationSenderStartAfterStop_Scenario2 > intermittently fail > --- > > Key: GEODE-5056 > URL: https://issues.apache.org/jira/browse/GEODE-5056 > Project: Geode > Issue Type: Bug > Components: wan >Reporter: xiaojian zhou >Assignee: xiaojian zhou >Priority: Major > > After fixe GEODE-4942, I found there's at least one race condition is not > covered. > > [vm6] [debug 2018/04/11 16:47:35.189 PDT Processor2> tid=110] WAN: On primary bucket 57, setting the seq number as 1357 > > [vm7] [info 2018/04/11 16:47:35.150 PDT > tid=19] Started ParallelGatewaySender\{id=ln,remoteDsId=2,isRunning =true} > > [vm7] [debug 2018/04/11 16:47:35.189 PDT 10.118.19.25(27489):32781 shared ordered uid=7 port=59148> tid=95] WAN: > On secondary bucket 57, setting the seq number as 1357 > [vm7] [debug 2018/04/11 16:47:35.190 PDT 10.118.19.25(27489):32781 shared ordered uid=7 port=59148> tid=95] Key : > > 1357 > [vm6] [debug 2018/04/11 16:47:35.190 PDT Processor2> tid=110] register dropped event for primary queue. BucketId is > 57, shadowKey is 1357, prQ is /ln_PARALLEL_GATEWAY_SENDER_QUEUE > > - Note: vm6's sender is restarted and cleanup the map, before the > QueueRemvalMessage is sent out for the map. > [vm6] [info 2018/04/11 16:47:35.249 PDT > tid=19] Started ParallelGatewaySender\{id=ln,remoteDsId=2,isRunning =true} > [vm6] [debug 2018/04/11 16:47:35.437 PDT GatewaySender_ln_0> tid=118] BatchRemovalThread about to query the batch > removal map \{/ln_PARALLEL_GATEWAY_SENDER_QUEUE={96=[1396], 2=[1402], > 83=[1383], 6=[1406], 71=[1371], 87=[1387], 73=[1373], 90=[1390], 77=[1377], > 94=[1394]}} > [vm6] [debug 2018/04/11 16:47:35.753 PDT GatewaySender_ln_0> tid=118] BatchRemovalThread about to query the batch > removal map {/ln_PARALLEL_GATEWAY_SENDER_QUEUE={49=[1449], 65=[1465], > 83=[1483], 53=[1453], 71=[1471], 87=[1487], *57=[1457]*, 73=[1473], > 77=[1477], 62=[1462]}} > shadowKey 1457 was created after the sender is restarted > > [vm6] [debug 2018/04/11 16:47:35.438 PDT GatewaySender_ln_0> tid=118] Sending (ParallelQueueRemovalMessage@2344969b > processorId=0 sender=10.118.19.25(27489):32781) to 3 peers > ([10.118.19.25(27492):32783@4(GEODE 1.6.0), > 10.118.19.25(27485):32779@1(GEODE 1.6.0), > 10.118.19.25(27482):32778@2(GEODE 1.6.0)]) via tcp/ip > [vm7] [debug 2018/04/11 16:47:35.439 PDT 10.118.19.25(27489):32781 shared unordered uid=4 port=59119> tid=52] > Received message 'ParallelQueueRemovalMessage@11583f5b processorId=0 > sender=10.118.19.25(27489):32781' from <10.118.19.25(27489):32781> > > i.e. the dropped key was in the map, but before sending a QueueRemovalMessage > the sender is closed and cleared the map. -- This message was sent by Atlassian JIRA (v7.6.3#76005)