[
https://issues.apache.org/jira/browse/GEODE-7072?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16912718#comment-16912718
]
Bruce Schuchardt commented on GEODE-7072:
-----------------------------------------
The failure that Aaron mentioned is in the same test and is a deserialization
issue but does not concern locator recovery. Instead it is a member running
v1.3.0 receiving a FinalCheckPassedMessage, but that class didn't exist until
v1.4.0
{noformat}
[error 2019/08/20 21:29:02.924 GMT <unicast receiver,c25ef1397f7e-51259>
tid=0x2f] Exception deserializing message payload: [dst: 172.17.0.13<v1>:32770,
src: 172.17.0.13<v12>:41002 (2 headers), size=107 bytes,
flags=OOB|DONT_BUNDLE|NO_FC|SKIP_BARRIER]
org.apache.geode.internal.DSFIDNotFoundException: Unknown
DataSerializableFixedID: -158
at org.apache.geode.internal.DSFIDFactory.create(DSFIDFactory.java:1003)
at
org.apache.geode.internal.InternalDataSerializer.basicReadObject(InternalDataSerializer.java:2693)
at org.apache.geode.DataSerializer.readObject(DataSerializer.java:2961)
at
org.apache.geode.distributed.internal.membership.gms.messenger.JGroupsMessenger.deserializeMessage(JGroupsMessenger.java:1121)
at
org.apache.geode.distributed.internal.membership.gms.messenger.JGroupsMessenger.readJGMessage(JGroupsMessenger.java:1013)
at
org.apache.geode.distributed.internal.membership.gms.messenger.JGroupsMessenger$JGroupsReceiver.receive(JGroupsMessenger.java:1279)
at org.jgroups.JChannel.invokeCallback(JChannel.java:816)
at org.jgroups.JChannel.up(JChannel.java:741)
at org.jgroups.stack.ProtocolStack.up(ProtocolStack.java:1030)
at org.jgroups.protocols.FRAG2.up(FRAG2.java:165)
at org.jgroups.protocols.FlowControl.up(FlowControl.java:390)
at org.jgroups.protocols.UNICAST3.deliverMessage(UNICAST3.java:1070)
at org.jgroups.protocols.UNICAST3.handleDataReceived(UNICAST3.java:785)
{noformat}
A coding error in GMSHealthMonitor causes this message to be sent to a
suspect-initiator if an availability check passes for the suspect and the
suspect-initiator is running with v1.3.0 or above. That needs to be changed to
v1.4.0 and above.
> CI Failure: WANRollingUpgradeEventProcessingMixedSiteOneCurrentSiteTwo >
> EventProcessingMixedSiteOneCurrentSiteTwo[from_v130] FAILED
> ------------------------------------------------------------------------------------------------------------------------------------
>
> Key: GEODE-7072
> URL: https://issues.apache.org/jira/browse/GEODE-7072
> Project: Geode
> Issue Type: Test
> Components: wan
> Reporter: Owen Nichols
> Assignee: Bruce Schuchardt
> Priority: Major
> Fix For: 1.11.0
>
> Time Spent: 20m
> Remaining Estimate: 0h
>
> {noformat}
> org.apache.geode.cache.wan.WANRollingUpgradeEventProcessingMixedSiteOneCurrentSiteTwo
> > EventProcessingMixedSiteOneCurrentSiteTwo[from_v130] FAILED
> org.apache.geode.test.dunit.RMIException: While invoking
> org.apache.geode.cache.wan.WANRollingUpgradeEventProcessingMixedSiteOneCurrentSiteTwo$$Lambda$47/1509632157.run
> in VM 0 running on Host aac3b458d9ea with 7 VMs with version 130
> at org.apache.geode.test.dunit.VM.executeMethodOnObject(VM.java:579)
> at org.apache.geode.test.dunit.VM.invoke(VM.java:406)
> at
> org.apache.geode.cache.wan.WANRollingUpgradeEventProcessingMixedSiteOneCurrentSiteTwo.EventProcessingMixedSiteOneCurrentSiteTwo(WANRollingUpgradeEventProcessingMixedSiteOneCurrentSiteTwo.java:63)
> Caused by:
> org.apache.geode.InternalGemFireException: Unable to recover previous
> membership view from locator26547view.dat
> at
> org.apache.geode.distributed.internal.membership.gms.locator.GMSLocator.recoverFromFile(GMSLocator.java:462)
> at
> org.apache.geode.distributed.internal.membership.gms.locator.GMSLocator.recover(GMSLocator.java:387)
> at
> org.apache.geode.distributed.internal.membership.gms.locator.GMSLocator.init(GMSLocator.java:146)
> at
> org.apache.geode.distributed.internal.InternalLocator$PrimaryHandler.init(InternalLocator.java:1225)
> at
> org.apache.geode.distributed.internal.tcpserver.TcpServer.start(TcpServer.java:232)
> at
> org.apache.geode.distributed.internal.InternalLocator.startTcpServer(InternalLocator.java:517)
> at
> org.apache.geode.distributed.internal.InternalLocator.startPeerLocation(InternalLocator.java:575)
> at
> org.apache.geode.distributed.internal.InternalLocator.startLocator(InternalLocator.java:321)
> at
> org.apache.geode.distributed.Locator.startLocator(Locator.java:253)
> at
> org.apache.geode.distributed.Locator.startLocatorAndDS(Locator.java:140)
> at
> org.apache.geode.cache.wan.WANRollingUpgradeDUnitTest.startLocator(WANRollingUpgradeDUnitTest.java:105)
> at
> org.apache.geode.cache.wan.WANRollingUpgradeDUnitTest.startLocator(WANRollingUpgradeDUnitTest.java:97)
> at
> org.apache.geode.cache.wan.WANRollingUpgradeEventProcessingMixedSiteOneCurrentSiteTwo.lambda$EventProcessingMixedSiteOneCurrentSiteTwo$6f8ee815$1(WANRollingUpgradeEventProcessingMixedSiteOneCurrentSiteTwo.java:63)
> Caused by:
> org.apache.geode.SerializationException: Could not create an
> instance of org.apache.geode.distributed.internal.membership.NetView .
> at
> org.apache.geode.internal.InternalDataSerializer.invokeFromData(InternalDataSerializer.java:2381)
> at
> org.apache.geode.internal.DSFIDFactory.create(DSFIDFactory.java:986)
> at
> org.apache.geode.internal.InternalDataSerializer.basicReadObject(InternalDataSerializer.java:2693)
> at
> org.apache.geode.DataSerializer.readObject(DataSerializer.java:2961)
> at
> org.apache.geode.distributed.internal.membership.gms.locator.GMSLocator.recoverFromFile(GMSLocator.java:440)
> ... 12 more
> Caused by:
> org.apache.geode.SerializationException: Could not create an
> instance of org.apache.geode.distributed.internal.membership.gms.GMSMember .
> at
> org.apache.geode.internal.InternalDataSerializer.invokeFromData(InternalDataSerializer.java:2381)
> at
> org.apache.geode.internal.DSFIDFactory.create(DSFIDFactory.java:986)
> at
> org.apache.geode.internal.InternalDataSerializer.basicReadObject(InternalDataSerializer.java:2693)
> at
> org.apache.geode.DataSerializer.readObject(DataSerializer.java:2961)
> at
> org.apache.geode.distributed.internal.membership.NetView.fromData(NetView.java:603)
> at
> org.apache.geode.internal.InternalDataSerializer.invokeFromData(InternalDataSerializer.java:2370)
> ... 16 more
> Caused by:
> java.io.IOException: While reading an InetAddress
> at
> org.apache.geode.DataSerializer.readInetAddress(DataSerializer.java:474)
> at
> org.apache.geode.distributed.internal.membership.gms.GMSMember.readEssentialData(GMSMember.java:512)
> at
> org.apache.geode.distributed.internal.membership.gms.GMSMember.fromData(GMSMember.java:495)
> at
> org.apache.geode.internal.InternalDataSerializer.invokeFromData(InternalDataSerializer.java:2370)
> ... 21 more
> Caused by:
> java.net.UnknownHostException: addr is of illegal
> length
> at
> java.net.InetAddress.getByAddress(InetAddress.java:1043)
> at
> java.net.InetAddress.getByAddress(InetAddress.java:1440)
> at
> org.apache.geode.DataSerializer.readInetAddress(DataSerializer.java:467)
> ... 24 more
> {noformat}
> seen in
> https://concourse.apachegeode-ci.info/teams/main/pipelines/apache-develop-main/jobs/UpgradeTestOpenJDK8/builds/969
> =-=-=-=-=-=-=-=-=-=-=-=-=-=-= Test Results URI
> =-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=
> http://files.apachegeode-ci.info/builds/apache-develop-main/1.11.0-SNAPSHOT.0023/test-results/upgradeTest/1565373787/
> =-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=
> Test report artifacts from this job are available at:
> http://files.apachegeode-ci.info/builds/apache-develop-main/1.11.0-SNAPSHOT.0023/test-artifacts/1565373787/upgradetestfiles-OpenJDK8-1.11.0-SNAPSHOT.0023.tgz
--
This message was sent by Atlassian Jira
(v8.3.2#803003)