[jira] [Commented] (GEODE-10148) [CI Failure] : JMXMBeanFederationDUnitTest > MBeanFederationAddRemoveServer FAILED

2022-04-15 Thread Barrett Oglesby (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-10148?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17522937#comment-17522937
 ] 

Barrett Oglesby commented on GEODE-10148:
-

I think here is where the problem is:

{{LocalManager.startLocalManagement}} runs the {{ManagementTask}} once right 
when it starts.

With logging added, the call to {{managementTask.get().run()}} returns right 
away. Even though the comment says its a synchronous call, it isn't.
{noformat}
[vm3] [warn 2022/03/23 16:16:02.173 PDT server-3  tid=0x12] XXX LocalManager.startLocalManagement about 
to run managementTask

[vm3] [warn 2022/03/23 16:16:02.173 PDT server-3  tid=0x12] XXX LocalManager.startLocalManagement done 
managementTask
{noformat}
Then, {{LocalManager.markForFederation}} adds the mbeans to the 
{{federatedComponentMap}}:
{noformat}
[vm3] [warn 2022/03/23 16:16:02.209 PDT server-3  tid=0x12] XXX LocalManager.markForFederation about to 
add to federatedComponentMap objName=GemFire:type=Member,member=server-3

[vm3] [warn 2022/03/23 16:16:02.364 PDT server-3  tid=0x12] XXX LocalManager.markForFederation about to 
add to federatedComponentMap 
objName=GemFire:service=Region,name="/test-region-1",type=Member,member=server-3

[vm3] [warn 2022/03/23 16:16:02.437 PDT server-3  tid=0x12] XXX LocalManager.markForFederation about to 
add to federatedComponentMap 
objName=GemFire:service=CacheServer,port=20017,type=Member,member=server-3
{noformat}
The CacheServer mbean above is the one that is missing in the failed run.

Then, the {{Management Task}} thread runs the {{ManagementTask}} started above 
to put the mbeans into the region:
{noformat}
[vm3] [warn 2022/03/23 16:16:04.177 PDT server-3  tid=0x46] 
XXX LocalManager.doManagementTask about to putAll 
replicaMap={GemFire:service=CacheServer,port=20017,type=Member,member=server-3=ObjectName
 = GemFire:service=CacheServer,port=20017,type=Member,member=server-3, 
GemFire:service=Region,name="/test-region-1",type=Member,member=server-3=ObjectName
 = GemFire:service=Region,name="/test-region-1",type=Member,member=server-3, 
GemFire:type=Member,member=server-3=ObjectName = 
GemFire:type=Member,member=server-3}

[vm3] [warn 2022/03/23 16:16:04.211 PDT server-3  tid=0x46] 
XXX LocalManager.doManagementTask done putAll 
replicaMap={GemFire:service=CacheServer,port=20017,type=Member,member=server-3=ObjectName
 = GemFire:service=CacheServer,port=20017,type=Member,member=server-3, 
GemFire:service=Region,name="/test-region-1",type=Member,member=server-3=ObjectName
 = GemFire:service=Region,name="/test-region-1",type=Member,member=server-3, 
GemFire:type=Member,member=server-3=ObjectName = 
GemFire:type=Member,member=server-3}
{noformat}
If the {{Management Task}} thread runs between the added Region and CacheServer 
mbeans, this issue would reproduce.


> [CI Failure] : JMXMBeanFederationDUnitTest > MBeanFederationAddRemoveServer 
> FAILED
> --
>
> Key: GEODE-10148
> URL: https://issues.apache.org/jira/browse/GEODE-10148
> Project: Geode
>  Issue Type: Bug
>  Components: jmx
>Affects Versions: 1.15.0
>Reporter: Nabarun Nag
>Priority: Major
>  Labels: test-stability
>
> JMXMBeanFederationDUnitTest > MBeanFederationAddRemoveServer FAILED
> java.lang.AssertionError: 
> Expecting actual:
>   ["GemFire:service=AccessControl,type=Distributed",
> "GemFire:service=CacheServer,port=20842,type=Member,member=server-1",
> "GemFire:service=CacheServer,port=20846,type=Member,member=server-2",
> 
> "GemFire:service=DiskStore,name=cluster_config,type=Member,member=locator-one",
> "GemFire:service=FileUploader,type=Distributed",
> "GemFire:service=Locator,type=Member,member=locator-one",
> 
> "GemFire:service=LockService,name=__CLUSTER_CONFIG_LS,type=Distributed",
> 
> "GemFire:service=LockService,name=__CLUSTER_CONFIG_LS,type=Member,member=locator-one",
> "GemFire:service=Manager,type=Member,member=locator-one",
> "GemFire:service=Region,name="/test-region-1",type=Distributed",
> 
> "GemFire:service=Region,name="/test-region-1",type=Member,member=server-1",
> 
> "GemFire:service=Region,name="/test-region-1",type=Member,member=server-2",
> 
> "GemFire:service=Region,name="/test-region-1",type=Member,member=server-3",
> "GemFire:service=System,type=Distributed",
> "GemFire:type=Member,member=locator-one",
> "GemFire:type=Member,member=server-1",
> "GemFire:type=Member,member=server-2",
> "GemFire:type=Member,member=server-3"]
> to contain exactly (and in same order):
>   ["GemFire:service=AccessControl,type=Distributed",
> "GemFire:service=CacheServer,port=20842,type=Member,member=server-1",
> 

[jira] [Commented] (GEODE-10148) [CI Failure] : JMXMBeanFederationDUnitTest > MBeanFederationAddRemoveServer FAILED

2022-03-23 Thread Barrett Oglesby (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-10148?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17511520#comment-17511520
 ] 

Barrett Oglesby commented on GEODE-10148:
-

The test is saying that the result of this call to the locator is missing the 
CacheServer MBean that exists in the expectedMBeans list.

List intermediateMBeans = getFederatedGemfireBeansFrom(locator1);

That mbean list in the locator is updated asynchronously by the ManagementTask 
in each member.

See ManagementResourceRepo.putAllInLocalMonitoringRegion. The 
localMonitoringRegion is DISTRIBUTED_NO_ACK.



> [CI Failure] : JMXMBeanFederationDUnitTest > MBeanFederationAddRemoveServer 
> FAILED
> --
>
> Key: GEODE-10148
> URL: https://issues.apache.org/jira/browse/GEODE-10148
> Project: Geode
>  Issue Type: Bug
>  Components: jmx
>Affects Versions: 1.15.0
>Reporter: Nabarun Nag
>Assignee: Owen Nichols
>Priority: Major
>  Labels: needsTriage
>
> JMXMBeanFederationDUnitTest > MBeanFederationAddRemoveServer FAILED
> java.lang.AssertionError: 
> Expecting actual:
>   ["GemFire:service=AccessControl,type=Distributed",
> "GemFire:service=CacheServer,port=20842,type=Member,member=server-1",
> "GemFire:service=CacheServer,port=20846,type=Member,member=server-2",
> 
> "GemFire:service=DiskStore,name=cluster_config,type=Member,member=locator-one",
> "GemFire:service=FileUploader,type=Distributed",
> "GemFire:service=Locator,type=Member,member=locator-one",
> 
> "GemFire:service=LockService,name=__CLUSTER_CONFIG_LS,type=Distributed",
> 
> "GemFire:service=LockService,name=__CLUSTER_CONFIG_LS,type=Member,member=locator-one",
> "GemFire:service=Manager,type=Member,member=locator-one",
> "GemFire:service=Region,name="/test-region-1",type=Distributed",
> 
> "GemFire:service=Region,name="/test-region-1",type=Member,member=server-1",
> 
> "GemFire:service=Region,name="/test-region-1",type=Member,member=server-2",
> 
> "GemFire:service=Region,name="/test-region-1",type=Member,member=server-3",
> "GemFire:service=System,type=Distributed",
> "GemFire:type=Member,member=locator-one",
> "GemFire:type=Member,member=server-1",
> "GemFire:type=Member,member=server-2",
> "GemFire:type=Member,member=server-3"]
> to contain exactly (and in same order):
>   ["GemFire:service=AccessControl,type=Distributed",
> "GemFire:service=CacheServer,port=20842,type=Member,member=server-1",
> "GemFire:service=CacheServer,port=20846,type=Member,member=server-2",
> "GemFire:service=CacheServer,port=20850,type=Member,member=server-3",
> 
> "GemFire:service=DiskStore,name=cluster_config,type=Member,member=locator-one",
> "GemFire:service=FileUploader,type=Distributed",
> "GemFire:service=Locator,type=Member,member=locator-one",
> 
> "GemFire:service=LockService,name=__CLUSTER_CONFIG_LS,type=Distributed",
> 
> "GemFire:service=LockService,name=__CLUSTER_CONFIG_LS,type=Member,member=locator-one",
> "GemFire:service=Manager,type=Member,member=locator-one",
> "GemFire:service=Region,name="/test-region-1",type=Distributed",
> 
> "GemFire:service=Region,name="/test-region-1",type=Member,member=server-1",
> 
> "GemFire:service=Region,name="/test-region-1",type=Member,member=server-2",
> 
> "GemFire:service=Region,name="/test-region-1",type=Member,member=server-3",
> "GemFire:service=System,type=Distributed",
> "GemFire:type=Member,member=locator-one",
> "GemFire:type=Member,member=server-1",
> "GemFire:type=Member,member=server-2",
> "GemFire:type=Member,member=server-3"]
> but could not find the following elements:
>   ["GemFire:service=CacheServer,port=20850,type=Member,member=server-3"]
> at 
> org.apache.geode.management.internal.JMXMBeanFederationDUnitTest.MBeanFederationAddRemoveServer(JMXMBeanFederationDUnitTest.java:130)
> 8352 tests completed, 1 failed, 414 skipped



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Commented] (GEODE-10148) [CI Failure] : JMXMBeanFederationDUnitTest > MBeanFederationAddRemoveServer FAILED

2022-03-23 Thread Anilkumar Gingade (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-10148?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17511454#comment-17511454
 ] 

Anilkumar Gingade commented on GEODE-10148:
---

>From Barry:
>> The communication between servers and the JMX manager (locator) is async (a 
>> no-ack region). This test is most likely failing because of that.


> [CI Failure] : JMXMBeanFederationDUnitTest > MBeanFederationAddRemoveServer 
> FAILED
> --
>
> Key: GEODE-10148
> URL: https://issues.apache.org/jira/browse/GEODE-10148
> Project: Geode
>  Issue Type: Bug
>  Components: jmx
>Affects Versions: 1.15.0
>Reporter: Nabarun Nag
>Priority: Major
>  Labels: needsTriage
>
> JMXMBeanFederationDUnitTest > MBeanFederationAddRemoveServer FAILED
> java.lang.AssertionError: 
> Expecting actual:
>   ["GemFire:service=AccessControl,type=Distributed",
> "GemFire:service=CacheServer,port=20842,type=Member,member=server-1",
> "GemFire:service=CacheServer,port=20846,type=Member,member=server-2",
> 
> "GemFire:service=DiskStore,name=cluster_config,type=Member,member=locator-one",
> "GemFire:service=FileUploader,type=Distributed",
> "GemFire:service=Locator,type=Member,member=locator-one",
> 
> "GemFire:service=LockService,name=__CLUSTER_CONFIG_LS,type=Distributed",
> 
> "GemFire:service=LockService,name=__CLUSTER_CONFIG_LS,type=Member,member=locator-one",
> "GemFire:service=Manager,type=Member,member=locator-one",
> "GemFire:service=Region,name="/test-region-1",type=Distributed",
> 
> "GemFire:service=Region,name="/test-region-1",type=Member,member=server-1",
> 
> "GemFire:service=Region,name="/test-region-1",type=Member,member=server-2",
> 
> "GemFire:service=Region,name="/test-region-1",type=Member,member=server-3",
> "GemFire:service=System,type=Distributed",
> "GemFire:type=Member,member=locator-one",
> "GemFire:type=Member,member=server-1",
> "GemFire:type=Member,member=server-2",
> "GemFire:type=Member,member=server-3"]
> to contain exactly (and in same order):
>   ["GemFire:service=AccessControl,type=Distributed",
> "GemFire:service=CacheServer,port=20842,type=Member,member=server-1",
> "GemFire:service=CacheServer,port=20846,type=Member,member=server-2",
> "GemFire:service=CacheServer,port=20850,type=Member,member=server-3",
> 
> "GemFire:service=DiskStore,name=cluster_config,type=Member,member=locator-one",
> "GemFire:service=FileUploader,type=Distributed",
> "GemFire:service=Locator,type=Member,member=locator-one",
> 
> "GemFire:service=LockService,name=__CLUSTER_CONFIG_LS,type=Distributed",
> 
> "GemFire:service=LockService,name=__CLUSTER_CONFIG_LS,type=Member,member=locator-one",
> "GemFire:service=Manager,type=Member,member=locator-one",
> "GemFire:service=Region,name="/test-region-1",type=Distributed",
> 
> "GemFire:service=Region,name="/test-region-1",type=Member,member=server-1",
> 
> "GemFire:service=Region,name="/test-region-1",type=Member,member=server-2",
> 
> "GemFire:service=Region,name="/test-region-1",type=Member,member=server-3",
> "GemFire:service=System,type=Distributed",
> "GemFire:type=Member,member=locator-one",
> "GemFire:type=Member,member=server-1",
> "GemFire:type=Member,member=server-2",
> "GemFire:type=Member,member=server-3"]
> but could not find the following elements:
>   ["GemFire:service=CacheServer,port=20850,type=Member,member=server-3"]
> at 
> org.apache.geode.management.internal.JMXMBeanFederationDUnitTest.MBeanFederationAddRemoveServer(JMXMBeanFederationDUnitTest.java:130)
> 8352 tests completed, 1 failed, 414 skipped



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Commented] (GEODE-10148) [CI Failure] : JMXMBeanFederationDUnitTest > MBeanFederationAddRemoveServer FAILED

2022-03-21 Thread Geode Integration (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-10148?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17510158#comment-17510158
 ] 

Geode Integration commented on GEODE-10148:
---

Seen in [distributed-test-openjdk8 
#1511|https://concourse.apachegeode-ci.info/teams/main/pipelines/apache-develop-mass-test-run/jobs/distributed-test-openjdk8/builds/1511]
 ... see [test 
results|http://files.apachegeode-ci.info/builds/apache-develop-mass-test-run/1.15.0-build.1014/test-results/distributedTest/1647663595/]
 or download 
[artifacts|http://files.apachegeode-ci.info/builds/apache-develop-mass-test-run/1.15.0-build.1014/test-artifacts/1647663595/distributedtestfiles-openjdk8-1.15.0-build.1014.tgz].

> [CI Failure] : JMXMBeanFederationDUnitTest > MBeanFederationAddRemoveServer 
> FAILED
> --
>
> Key: GEODE-10148
> URL: https://issues.apache.org/jira/browse/GEODE-10148
> Project: Geode
>  Issue Type: Bug
>  Components: jmx
>Affects Versions: 1.15.0
>Reporter: Nabarun Nag
>Priority: Major
>  Labels: needsTriage
>
> JMXMBeanFederationDUnitTest > MBeanFederationAddRemoveServer FAILED
> java.lang.AssertionError: 
> Expecting actual:
>   ["GemFire:service=AccessControl,type=Distributed",
> "GemFire:service=CacheServer,port=20842,type=Member,member=server-1",
> "GemFire:service=CacheServer,port=20846,type=Member,member=server-2",
> 
> "GemFire:service=DiskStore,name=cluster_config,type=Member,member=locator-one",
> "GemFire:service=FileUploader,type=Distributed",
> "GemFire:service=Locator,type=Member,member=locator-one",
> 
> "GemFire:service=LockService,name=__CLUSTER_CONFIG_LS,type=Distributed",
> 
> "GemFire:service=LockService,name=__CLUSTER_CONFIG_LS,type=Member,member=locator-one",
> "GemFire:service=Manager,type=Member,member=locator-one",
> "GemFire:service=Region,name="/test-region-1",type=Distributed",
> 
> "GemFire:service=Region,name="/test-region-1",type=Member,member=server-1",
> 
> "GemFire:service=Region,name="/test-region-1",type=Member,member=server-2",
> 
> "GemFire:service=Region,name="/test-region-1",type=Member,member=server-3",
> "GemFire:service=System,type=Distributed",
> "GemFire:type=Member,member=locator-one",
> "GemFire:type=Member,member=server-1",
> "GemFire:type=Member,member=server-2",
> "GemFire:type=Member,member=server-3"]
> to contain exactly (and in same order):
>   ["GemFire:service=AccessControl,type=Distributed",
> "GemFire:service=CacheServer,port=20842,type=Member,member=server-1",
> "GemFire:service=CacheServer,port=20846,type=Member,member=server-2",
> "GemFire:service=CacheServer,port=20850,type=Member,member=server-3",
> 
> "GemFire:service=DiskStore,name=cluster_config,type=Member,member=locator-one",
> "GemFire:service=FileUploader,type=Distributed",
> "GemFire:service=Locator,type=Member,member=locator-one",
> 
> "GemFire:service=LockService,name=__CLUSTER_CONFIG_LS,type=Distributed",
> 
> "GemFire:service=LockService,name=__CLUSTER_CONFIG_LS,type=Member,member=locator-one",
> "GemFire:service=Manager,type=Member,member=locator-one",
> "GemFire:service=Region,name="/test-region-1",type=Distributed",
> 
> "GemFire:service=Region,name="/test-region-1",type=Member,member=server-1",
> 
> "GemFire:service=Region,name="/test-region-1",type=Member,member=server-2",
> 
> "GemFire:service=Region,name="/test-region-1",type=Member,member=server-3",
> "GemFire:service=System,type=Distributed",
> "GemFire:type=Member,member=locator-one",
> "GemFire:type=Member,member=server-1",
> "GemFire:type=Member,member=server-2",
> "GemFire:type=Member,member=server-3"]
> but could not find the following elements:
>   ["GemFire:service=CacheServer,port=20850,type=Member,member=server-3"]
> at 
> org.apache.geode.management.internal.JMXMBeanFederationDUnitTest.MBeanFederationAddRemoveServer(JMXMBeanFederationDUnitTest.java:130)
> 8352 tests completed, 1 failed, 414 skipped



--
This message was sent by Atlassian Jira
(v8.20.1#820001)