While running PersistentColocatedPartitionedRegionDistributedTest a
thousand times to verify that I've fixed a flaky issue in the test, it hit
an interesting failure trying to send a RequestImageMessage. This generated
a stack trace which caused the test to fail grep for suspect strings. I can
easily suppress this failure, BUT it looks like a bug in message
distribution which may have been introduced by the recent membership
changes (modularization).

Here's the stack trace for anyone who wants it (I'm not working on this):

[fatal 2020/02/19 02:50:04.862 GMT <Pooled Waiting Message Processor 1>
tid=8410] While pushing message
<InitialImageOperation$RequestImageMessage(region
path='/__PR/_B__region2_1'; sender=172.17.0.4(185)<v758>:41003;
keysOnly=false; processorId=40462; waitForInit=false;
checkTombstoneVersions=true;
versionVector=RegionVersionVector[2ab5849689d446bd-a7da0400b0e718f7={rv0
gc0 localVersion=0 local exceptions=[]} others={}, gc={}]; unfinished
keys=[])> to recipients: <172.17.0.4(179)<v757>:41002>
    java.lang.IllegalArgumentException: newPosition > limit: (32768 > 90)
        at
java.base/java.nio.Buffer.createPositionException(Buffer.java:318)
        at java.base/java.nio.Buffer.position(Buffer.java:293)
        at java.base/java.nio.ByteBuffer.position(ByteBuffer.java:1086)
        at
java.base/java.nio.MappedByteBuffer.position(MappedByteBuffer.java:226)
        at
java.base/java.nio.MappedByteBuffer.position(MappedByteBuffer.java:67)
        at
java.base/sun.nio.ch.IOUtil.writeFromNativeBuffer(IOUtil.java:116)
        at java.base/sun.nio.ch.IOUtil.write(IOUtil.java:58)
        at java.base/sun.nio.ch.IOUtil.write(IOUtil.java:50)
        at
java.base/sun.nio.ch.SocketChannelImpl.write(SocketChannelImpl.java:463)
        at
org.apache.geode.internal.tcp.Connection.writeFully(Connection.java:2587)
        at
org.apache.geode.internal.tcp.Connection.sendPreserialized(Connection.java:1867)
        at
org.apache.geode.internal.tcp.MsgStreamer.realFlush(MsgStreamer.java:324)
        at
org.apache.geode.internal.tcp.MsgStreamer.writeMessage(MsgStreamer.java:249)
        at
org.apache.geode.distributed.internal.direct.DirectChannel.sendToMany(DirectChannel.java:393)
        at
org.apache.geode.distributed.internal.direct.DirectChannel.sendToOne(DirectChannel.java:248)
        at
org.apache.geode.distributed.internal.direct.DirectChannel.send(DirectChannel.java:604)
        at
org.apache.geode.distributed.internal.DistributionImpl.directChannelSend(DistributionImpl.java:348)
        at
org.apache.geode.distributed.internal.DistributionImpl.send(DistributionImpl.java:293)
        at
org.apache.geode.distributed.internal.ClusterDistributionManager.sendViaMembershipManager(ClusterDistributionManager.java:2060)
        at
org.apache.geode.distributed.internal.ClusterDistributionManager.sendOutgoing(ClusterDistributionManager.java:1987)
        at
org.apache.geode.distributed.internal.ClusterDistributionManager.sendMessage(ClusterDistributionManager.java:2024)
        at
org.apache.geode.distributed.internal.ClusterDistributionManager.putOutgoing(ClusterDistributionManager.java:1084)
        at
org.apache.geode.internal.cache.InitialImageOperation.getFromOne(InitialImageOperation.java:514)
        at
org.apache.geode.internal.cache.DistributedRegion.getInitialImageAndRecovery(DistributedRegion.java:1222)
        at
org.apache.geode.internal.cache.DistributedRegion.initialize(DistributedRegion.java:1082)
        at
org.apache.geode.internal.cache.BucketRegion.initialize(BucketRegion.java:259)
        at
org.apache.geode.internal.cache.LocalRegion.createSubregion(LocalRegion.java:983)
        at
org.apache.geode.internal.cache.PartitionedRegionDataStore.createBucketRegion(PartitionedRegionDataStore.java:785)
        at
org.apache.geode.internal.cache.PartitionedRegionDataStore.grabFreeBucket(PartitionedRegionDataStore.java:460)
        at
org.apache.geode.internal.cache.PartitionedRegionDataStore.grabFreeBucketRecursively(PartitionedRegionDataStore.java:319)
        at
org.apache.geode.internal.cache.PartitionedRegionDataStore.grabBucket(PartitionedRegionDataStore.java:2896)
        at
org.apache.geode.internal.cache.partitioned.ManageBackupBucketMessage.operateOnPartitionedRegion(ManageBackupBucketMessage.java:159)
        at
org.apache.geode.internal.cache.partitioned.PartitionMessage.process(PartitionMessage.java:333)
        at
org.apache.geode.distributed.internal.DistributionMessage.scheduleAction(DistributionMessage.java:394)
        at
org.apache.geode.distributed.internal.DistributionMessage$1.run(DistributionMessage.java:458)
        at
java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128)
        at
java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628)
        at
org.apache.geode.distributed.internal.ClusterOperationExecutors.runUntilShutdown(ClusterOperationExecutors.java:449)
        at
org.apache.geode.distributed.internal.ClusterOperationExecutors.doWaitingThread(ClusterOperationExecutors.java:416)
        at
org.apache.geode.logging.internal.executors.LoggingThreadFactory.lambda$newThread$0(LoggingThreadFactory.java:119)
        at java.base/java.lang.Thread.run(Thread.java:834)

Reply via email to