Juan Ramos created GEODE-8004:
---------------------------------

             Summary: Regression Introduced Through GEODE-7565
                 Key: GEODE-8004
                 URL: https://issues.apache.org/jira/browse/GEODE-8004
             Project: Geode
          Issue Type: Bug
          Components: client/server
            Reporter: Juan Ramos


Intermittent errors were observed while executing some internal tests and 
commit 
[dd23ee8|https://github.com/apache/geode/commit/dd23ee8200cba67cea82e57e2e4ccedcdf9e8266]
 was determined to be responsible. As of yet, no local reproduction of the 
issue is available, but work is ongoing to provide a test that can be used to 
debug the issue (a [PR|https://github.com/apache/geode/pull/4974] to revert of 
the original commit has been opened and will be merged shortly, though, this 
ticket is to investigate the root cause so the original commit can be merged 
again into {{develop}}).
---
It seems that a server is trying to read an {{ack}} response and, instead, it 
receives a {{PING}} message:
{noformat}
[error 2020/04/18 23:44:22.758 PDT <poolTimer-edgeDescript-31> tid=0x165] 
Unexpected error in pool task 
<org.apache.geode.cache.client.internal.LiveServerPinger$PingTask@3483b110>
org.apache.geode.InternalGemFireError: Unexpected message type PING
        at 
org.apache.geode.cache.client.internal.AbstractOp.processAck(AbstractOp.java:264)
        at 
org.apache.geode.cache.client.internal.PingOp$PingOpImpl.processResponse(PingOp.java:82)
        at 
org.apache.geode.cache.client.internal.AbstractOp.processResponse(AbstractOp.java:222)
        at 
org.apache.geode.cache.client.internal.AbstractOp.attemptReadResponse(AbstractOp.java:207)
        at 
org.apache.geode.cache.client.internal.AbstractOp.attempt(AbstractOp.java:382)
        at 
org.apache.geode.cache.client.internal.ConnectionImpl.execute(ConnectionImpl.java:268)
        at 
org.apache.geode.cache.client.internal.pooling.PooledConnection.execute(PooledConnection.java:352)
        at 
org.apache.geode.cache.client.internal.OpExecutorImpl.executeWithPossibleReAuthentication(OpExecutorImpl.java:753)
        at 
org.apache.geode.cache.client.internal.OpExecutorImpl.executeOnServer(OpExecutorImpl.java:332)
        at 
org.apache.geode.cache.client.internal.OpExecutorImpl.executeOn(OpExecutorImpl.java:303)
        at 
org.apache.geode.cache.client.internal.PoolImpl.executeOn(PoolImpl.java:839)
        at org.apache.geode.cache.client.internal.PingOp.execute(PingOp.java:38)
        at 
org.apache.geode.cache.client.internal.LiveServerPinger$PingTask.run2(LiveServerPinger.java:90)
        at 
org.apache.geode.cache.client.internal.PoolImpl$PoolTask.run(PoolImpl.java:1329)
        at 
java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
        at java.util.concurrent.FutureTask.runAndReset(FutureTask.java:308)
        at 
org.apache.geode.internal.ScheduledThreadPoolExecutorWithKeepAlive$DelegatingScheduledFuture.run(ScheduledThreadPoolExecutorWithKeepAlive.java:276)
        at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
        at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
        at java.lang.Thread.run(Thread.java:748)
{noformat}

Around the same time, another member of the distributed system logs the 
following warning, which seems to be related to the original changes as well:
{noformat}
[warn 2020/04/18 23:44:22.757 PDT <ServerConnection on port 29019 Thread 1> 
tid=0x298] Unable to ping non-member 
rs-FullRegression19040559a2i32xlarge-hydra-client-63(bridgegemfire1_host1_4749:4749)<ec><v39>:41003
 for client 
identity(rs-FullRegression19040559a2i32xlarge-hydra-client-63(edgegemfire3_host1_1071:1071:loner):50046:5a182991:edgegemfire3_host1_1071,connection=2
{noformat}




--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to