[jira] [Updated] (GEODE-10383) lucene may log many warnings about BucketNotFound during normal operations

2022-06-16 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10383?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-10383:
--
Labels:   (was: needsTriage)

> lucene may log many warnings about BucketNotFound during normal operations
> --
>
> Key: GEODE-10383
> URL: https://issues.apache.org/jira/browse/GEODE-10383
> Project: Geode
>  Issue Type: Bug
>  Components: lucene
>Affects Versions: 1.16.0
>Reporter: Darrel Schneider
>Priority: Major
>
> This seems much like GEODE-1557 but it may be this exception is coming from a 
> different place. Once again it involves a test that has primaries moving 
> around due to new servers starting up.
> I don't see any value in logging the stack trace in this case.
> If we are going to log about this should the message also include an 
> explanation as to why it can happen during normal operations?
> Also should it really be a warning since it can occur during normal 
> operations? It looks like the fix for GEODE-1557 made it "debug" level.
> I see a bunch of the following warnings logged:
> {noformat}
> .cache.lucene.internal.distributed.LuceneQueryFunction@14894ec8
> org.apache.geode.cache.execute.FunctionException: 
> org.apache.geode.internal.cache.execute.InternalFunctionInvocationTargetException:
>  org.apache.geode.internal.cache.BucketNotFo
> undException: Unable to find lucene index because no longer primary for 
> bucket 326
> at 
> org.apache.geode.internal.cache.partitioned.PRFunctionStreamingResultCollector.getResultInternal(PRFunctionStreamingResultCollector.java:126)
> at 
> org.apache.geode.internal.cache.execute.ResultCollectorHolder.getResult(ResultCollectorHolder.java:53)
> at 
> org.apache.geode.internal.cache.partitioned.PRFunctionStreamingResultCollector.getResult(PRFunctionStreamingResultCollector.java:87)
> at 
> org.apache.geode.internal.cache.tier.sockets.command.ExecuteRegionFunctionGeode18.executeFunctionWithResult(ExecuteRegionFunctionGeode18.java:50)
> at 
> org.apache.geode.internal.cache.tier.sockets.command.ExecuteRegionFunction66.cmdExecute(ExecuteRegionFunction66.java:203)
> at 
> org.apache.geode.internal.cache.tier.sockets.BaseCommand.execute(BaseCommand.java:187)
> at 
> org.apache.geode.internal.cache.tier.sockets.ServerConnection.doNormalMessage(ServerConnection.java:880)
> at 
> org.apache.geode.internal.cache.tier.sockets.ServerConnection.doOneMessage(ServerConnection.java:1074)
> at 
> org.apache.geode.internal.cache.tier.sockets.ServerConnection.run(ServerConnection.java:1356)
> at 
> java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1136)
> at 
> java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:635)
> at 
> org.apache.geode.internal.cache.tier.sockets.AcceptorImpl.lambda$initializeServerConnectionThreadPool$3(AcceptorImpl.java:690)
> at 
> org.apache.geode.logging.internal.executors.LoggingThreadFactory.lambda$newThread$0(LoggingThreadFactory.java:120)
> at java.base/java.lang.Thread.run(Thread.java:833)
> Caused by: 
> org.apache.geode.internal.cache.execute.InternalFunctionInvocationTargetException:
>  org.apache.geode.internal.cache.BucketNotFoundException: Unable to find 
> lucene ind
> ex because no longer primary for bucket 326
> at 
> org.apache.geode.internal.cache.partitioned.PRFunctionStreamingResultCollector.getResultInternal(PRFunctionStreamingResultCollector.java:125)
> ... 13 more
> {noformat}



--
This message was sent by Atlassian Jira
(v8.20.7#820007)


[jira] [Updated] (GEODE-10383) lucene may log many warnings about BucketNotFound during normal operations

2022-06-16 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10383?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-10383:
--
Affects Version/s: 1.16.0

> lucene may log many warnings about BucketNotFound during normal operations
> --
>
> Key: GEODE-10383
> URL: https://issues.apache.org/jira/browse/GEODE-10383
> Project: Geode
>  Issue Type: Bug
>  Components: lucene
>Affects Versions: 1.16.0
>Reporter: Darrel Schneider
>Priority: Major
>  Labels: needsTriage
>
> This seems much like GEODE-1557 but it may be this exception is coming from a 
> different place. Once again it involves a test that has primaries moving 
> around due to new servers starting up.
> I don't see any value in logging the stack trace in this case.
> If we are going to log about this should the message also include an 
> explanation as to why it can happen during normal operations?
> Also should it really be a warning since it can occur during normal 
> operations? It looks like the fix for GEODE-1557 made it "debug" level.
> I see a bunch of the following warnings logged:
> {noformat}
> .cache.lucene.internal.distributed.LuceneQueryFunction@14894ec8
> org.apache.geode.cache.execute.FunctionException: 
> org.apache.geode.internal.cache.execute.InternalFunctionInvocationTargetException:
>  org.apache.geode.internal.cache.BucketNotFo
> undException: Unable to find lucene index because no longer primary for 
> bucket 326
> at 
> org.apache.geode.internal.cache.partitioned.PRFunctionStreamingResultCollector.getResultInternal(PRFunctionStreamingResultCollector.java:126)
> at 
> org.apache.geode.internal.cache.execute.ResultCollectorHolder.getResult(ResultCollectorHolder.java:53)
> at 
> org.apache.geode.internal.cache.partitioned.PRFunctionStreamingResultCollector.getResult(PRFunctionStreamingResultCollector.java:87)
> at 
> org.apache.geode.internal.cache.tier.sockets.command.ExecuteRegionFunctionGeode18.executeFunctionWithResult(ExecuteRegionFunctionGeode18.java:50)
> at 
> org.apache.geode.internal.cache.tier.sockets.command.ExecuteRegionFunction66.cmdExecute(ExecuteRegionFunction66.java:203)
> at 
> org.apache.geode.internal.cache.tier.sockets.BaseCommand.execute(BaseCommand.java:187)
> at 
> org.apache.geode.internal.cache.tier.sockets.ServerConnection.doNormalMessage(ServerConnection.java:880)
> at 
> org.apache.geode.internal.cache.tier.sockets.ServerConnection.doOneMessage(ServerConnection.java:1074)
> at 
> org.apache.geode.internal.cache.tier.sockets.ServerConnection.run(ServerConnection.java:1356)
> at 
> java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1136)
> at 
> java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:635)
> at 
> org.apache.geode.internal.cache.tier.sockets.AcceptorImpl.lambda$initializeServerConnectionThreadPool$3(AcceptorImpl.java:690)
> at 
> org.apache.geode.logging.internal.executors.LoggingThreadFactory.lambda$newThread$0(LoggingThreadFactory.java:120)
> at java.base/java.lang.Thread.run(Thread.java:833)
> Caused by: 
> org.apache.geode.internal.cache.execute.InternalFunctionInvocationTargetException:
>  org.apache.geode.internal.cache.BucketNotFoundException: Unable to find 
> lucene ind
> ex because no longer primary for bucket 326
> at 
> org.apache.geode.internal.cache.partitioned.PRFunctionStreamingResultCollector.getResultInternal(PRFunctionStreamingResultCollector.java:125)
> ... 13 more
> {noformat}



--
This message was sent by Atlassian Jira
(v8.20.7#820007)


[jira] [Updated] (GEODE-10376) Client event processing waits for event registrations to finish.

2022-06-10 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-10376:
--
Labels:   (was: needsTriage)

> Client event processing waits for event registrations to finish.  
> --
>
> Key: GEODE-10376
> URL: https://issues.apache.org/jira/browse/GEODE-10376
> Project: Geode
>  Issue Type: Bug
>  Components: client/server
>Affects Versions: 1.16.0
>Reporter: Anilkumar Gingade
>Priority: Major
>
> Currently while processing the events from server the client event processing 
> (Cache Client Updater) thread will wait for event registrations to finish; 
> this will slowdown the event processing and may cause event build of on 
> server.
> The code in question:
> {code:java}
> In CacheClientUpdater.processMessage():
>   // Wait for the previously failed cache client updater
>   // to finish. This will avoid out of order messages.
>   waitForFailedUpdater();
>   cache.waitForRegisterInterestsInProgress();
> {code}
> We need to see if "waitForInterestRegistration" needs to be done if there is 
> no failure in updater thread.
>  



--
This message was sent by Atlassian Jira
(v8.20.7#820007)


[jira] [Updated] (GEODE-10376) Client event processing waits for event registrations to finish

2022-06-10 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-10376:
--
Summary: Client event processing waits for event registrations to finish  
(was: Client event processing waits for event registrations to finish.  )

> Client event processing waits for event registrations to finish
> ---
>
> Key: GEODE-10376
> URL: https://issues.apache.org/jira/browse/GEODE-10376
> Project: Geode
>  Issue Type: Bug
>  Components: client/server
>Affects Versions: 1.16.0
>Reporter: Anilkumar Gingade
>Priority: Major
>
> Currently while processing the events from server the client event processing 
> (Cache Client Updater) thread will wait for event registrations to finish; 
> this will slowdown the event processing and may cause event build of on 
> server.
> The code in question:
> {code:java}
> In CacheClientUpdater.processMessage():
>   // Wait for the previously failed cache client updater
>   // to finish. This will avoid out of order messages.
>   waitForFailedUpdater();
>   cache.waitForRegisterInterestsInProgress();
> {code}
> We need to see if "waitForInterestRegistration" needs to be done if there is 
> no failure in updater thread.
>  



--
This message was sent by Atlassian Jira
(v8.20.7#820007)


[jira] [Created] (GEODE-10376) Client event processing waits for event registrations to finish.

2022-06-10 Thread Anilkumar Gingade (Jira)
Anilkumar Gingade created GEODE-10376:
-

 Summary: Client event processing waits for event registrations to 
finish.  
 Key: GEODE-10376
 URL: https://issues.apache.org/jira/browse/GEODE-10376
 Project: Geode
  Issue Type: Bug
  Components: client/server
Affects Versions: 1.16.0
Reporter: Anilkumar Gingade


Currently while processing the events from server the client event processing 
(Cache Client Updater) thread will wait for event registrations to finish; this 
will slowdown the event processing and may cause event build of on server.
The code in question:

{code:java}
In CacheClientUpdater.processMessage():
  // Wait for the previously failed cache client updater
  // to finish. This will avoid out of order messages.
  waitForFailedUpdater();
  cache.waitForRegisterInterestsInProgress();
{code}

We need to see if "waitForInterestRegistration" needs to be done if there is no 
failure in updater thread.
 



--
This message was sent by Atlassian Jira
(v8.20.7#820007)


[jira] [Updated] (GEODE-10373) CI Failures: Windows geode-lucene:integrationTest hitting OOM cause CI test run to timeout

2022-06-10 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10373?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-10373:
--
Labels:   (was: needsTriage)

> CI Failures: Windows geode-lucene:integrationTest hitting OOM cause CI test 
> run to timeout
> --
>
> Key: GEODE-10373
> URL: https://issues.apache.org/jira/browse/GEODE-10373
> Project: Geode
>  Issue Type: Bug
>  Components: lucene
>Reporter: Eric Shu
>Assignee: Robert Houghton
>Priority: Major
>
> LuceneIndexCommandsWithReindexAllowedIntegrationTest > 
> whenLuceneReindexingInProgressThenListIndexCommandMustExecuteSuccessfully 
> FAILED
> java.lang.OutOfMemoryError: Java heap space



--
This message was sent by Atlassian Jira
(v8.20.7#820007)


[jira] [Resolved] (GEODE-10350) volunteerForPrimary should proceed when being elector or knowing elector

2022-06-08 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10350?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade resolved GEODE-10350.
---
Resolution: Abandoned

The issue was created based on the assumption that the elector was not properly 
set/used. The planned change (PR in this ticket) is causing issue with Fixed 
partitioning. We need to understand more about the primary logic. Closing this 
issue for now.

> volunteerForPrimary should proceed when being elector or knowing elector
> 
>
> Key: GEODE-10350
> URL: https://issues.apache.org/jira/browse/GEODE-10350
> Project: Geode
>  Issue Type: Bug
>Reporter: Xiaojian Zhou
>Assignee: Joris Melchior
>Priority: Major
>  Labels: needsTriage, pull-request-available
>
> Inside BucketAdvisor.volunteerForPrimary(), when current member is elector or 
> has profile for elector, then current member should proceed to volunteer for 
> primary bucket. 
> Current logic happened to be opposite. It might be due to careless code. 



--
This message was sent by Atlassian Jira
(v8.20.7#820007)


[jira] [Updated] (GEODE-8977) Thread monitoring service should also show locked monitors and synchronizers

2022-06-03 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-8977?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-8977:
-
Labels: GeodeOperationAPI blocks-1.15.0 pull-request-available  (was: 
GeodeOperationAPI pull-request-available)

> Thread monitoring service should also show locked monitors and synchronizers
> 
>
> Key: GEODE-8977
> URL: https://issues.apache.org/jira/browse/GEODE-8977
> Project: Geode
>  Issue Type: Improvement
>  Components: core
>Affects Versions: 1.15.0
>Reporter: Darrel Schneider
>Assignee: Darrel Schneider
>Priority: Major
>  Labels: GeodeOperationAPI, blocks-1.15.0, pull-request-available
> Fix For: 1.15.0
>
>
> The thread monitoring service shows the call stack of a hung thread but it 
> does not show the synchronizations obtained by the frames in the call stack 
> like a normal stack dump does.
> It looks like this is available from the ThreadInfo class that the service is 
> already using by calling getLockedMonitors and getLockedSynchronizers. The 
> getLockedMonitors returns a MonitorInfo which has information in it about 
> which frame of the stack obtained it. MonitorInfo subclasses LockInfo which 
> is what getLockedSynchronizers returns so it is possible that 
> getLockedSynchronizers does not provide any additional information to be 
> logged.



--
This message was sent by Atlassian Jira
(v8.20.7#820007)


[jira] [Updated] (GEODE-8977) Thread monitoring service should also show locked monitors and synchronizers

2022-06-03 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-8977?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-8977:
-
Affects Version/s: 1.15.0

> Thread monitoring service should also show locked monitors and synchronizers
> 
>
> Key: GEODE-8977
> URL: https://issues.apache.org/jira/browse/GEODE-8977
> Project: Geode
>  Issue Type: Improvement
>  Components: core
>Affects Versions: 1.15.0
>Reporter: Darrel Schneider
>Assignee: Darrel Schneider
>Priority: Major
>  Labels: GeodeOperationAPI, pull-request-available
> Fix For: 1.15.0
>
>
> The thread monitoring service shows the call stack of a hung thread but it 
> does not show the synchronizations obtained by the frames in the call stack 
> like a normal stack dump does.
> It looks like this is available from the ThreadInfo class that the service is 
> already using by calling getLockedMonitors and getLockedSynchronizers. The 
> getLockedMonitors returns a MonitorInfo which has information in it about 
> which frame of the stack obtained it. MonitorInfo subclasses LockInfo which 
> is what getLockedSynchronizers returns so it is possible that 
> getLockedSynchronizers does not provide any additional information to be 
> logged.



--
This message was sent by Atlassian Jira
(v8.20.7#820007)


[jira] [Updated] (GEODE-10353) Change IndexThresholdSize default value

2022-06-02 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10353?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-10353:
--
Labels: pull-request-available  (was: needsTriage pull-request-available)

> Change IndexThresholdSize default value
> ---
>
> Key: GEODE-10353
> URL: https://issues.apache.org/jira/browse/GEODE-10353
> Project: Geode
>  Issue Type: Bug
>  Components: querying
>Reporter: Mario Kevo
>Assignee: Mario Kevo
>Priority: Major
>  Labels: pull-request-available
>
> When doing range queries with the wildcard character(%), there are no result.
> The problem is that when we are doing a range query like this:
> {code:java}
> query --query="SELECT  e.key, e.value from /example-region.entrySet e 
> where e.value.positions['SUN'] LIKE 'abc%'"
> {code}
> The printed results will be null.
> The comparison will be divided into two comparison >='abc' and <'abd'.
> First it checks all entries that are lower than 'abd' and store them in the 
> intermediate results. There are all entries which attribute 'SUN' is lower 
> than 'abd' which can be a very high number. It iterates through all entries 
> and store first 100 entries in the intermediate results. The limit is 100, it 
> can be changed if an user sets the indexThresholdSize to higher value when 
> starting servers, but in many cases the user couldn't know how many entries 
> will be in the region.
> So the best way is to set this indexThresholdSize to Integer.MAX_VALUE by 
> default so the query will always give the correct results.
> The limit which is set in query is not the same as this limit, so with this 
> change it will still put limit on printing results.



--
This message was sent by Atlassian Jira
(v8.20.7#820007)


[jira] [Updated] (GEODE-10352) Update Dockerfile to use Ruby >= 2.6 in the tool to preview Geode documentation

2022-06-02 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10352?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-10352:
--
Labels: pull-request-available  (was: needsTriage pull-request-available)

> Update Dockerfile to use Ruby >= 2.6 in the tool to preview Geode 
> documentation
> ---
>
> Key: GEODE-10352
> URL: https://issues.apache.org/jira/browse/GEODE-10352
> Project: Geode
>  Issue Type: Bug
>  Components: docs
>Reporter: Alberto Gomez
>Assignee: Alberto Gomez
>Priority: Major
>  Labels: pull-request-available
>
> The script to preview the documentation of Geode (./preview-user-guide.sh) is 
> not working anymore.
> The following error appears while running it (unless you have a previously 
> downloaded geodedocs/temp docker image in your local docker repo):
> ERROR:  Error installing elasticsearch:
>   The last version of faraday (>= 0) to support your Ruby & RubyGems was 
> 1.10.0. Try installing it with `gem install faraday -v 1.10.0` and then 
> running the current command again
>   faraday requires Ruby version >= 2.6. The current ruby version is 
> 2.5.9.229.
> That error prevents the preview script to work.
> It is needed to update the docker image used to preview the documentation to 
> use a Ruby version >= 2.6



--
This message was sent by Atlassian Jira
(v8.20.7#820007)


[jira] [Assigned] (GEODE-10351) [CI failure] ModifyColocationIntegrationTest > testModifyColocation

2022-06-02 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10351?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade reassigned GEODE-10351:
-

Assignee: Jinmei Liao

> [CI failure] ModifyColocationIntegrationTest > testModifyColocation
> ---
>
> Key: GEODE-10351
> URL: https://issues.apache.org/jira/browse/GEODE-10351
> Project: Geode
>  Issue Type: Bug
>  Components: persistence
>Affects Versions: 1.15.0
>Reporter: Owen Nichols
>Assignee: Jinmei Liao
>Priority: Major
>  Labels: needsTriage
>
> {noformat}
> ModifyColocationIntegrationTest > testModifyColocation FAILED
> java.lang.AssertionError: 
> Expecting actual throwable to be an instance of:
>   java.lang.IllegalStateException
> but was:
>   org.apache.geode.cache.CacheClosedException: For DiskStore: disk: A 
> DiskAccessException has occurred while writing to the disk for region 
> /region3. The region will be closed., caused by 
> org.apache.geode.cache.DiskAccessException: For DiskStore: disk: A 
> DiskAccessException has occurred while writing to the disk for region 
> /region3. The region will be closed.
>   at 
> org.apache.geode.internal.cache.GemFireCacheImpl$Stopper.generateCancelledException(GemFireCacheImpl.java:5221)
>   at 
> org.apache.geode.CancelCriterion.checkCancelInProgress(CancelCriterion.java:83)
>   at 
> org.apache.geode.internal.cache.GemFireCacheImpl.createVMRegion(GemFireCacheImpl.java:3123)
> {noformat}



--
This message was sent by Atlassian Jira
(v8.20.7#820007)


[jira] [Assigned] (GEODE-10350) volunteerForPrimary should proceed when being elector or knowing elector

2022-06-02 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10350?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade reassigned GEODE-10350:
-

Assignee: Joris Melchior

> volunteerForPrimary should proceed when being elector or knowing elector
> 
>
> Key: GEODE-10350
> URL: https://issues.apache.org/jira/browse/GEODE-10350
> Project: Geode
>  Issue Type: Bug
>Reporter: Xiaojian Zhou
>Assignee: Joris Melchior
>Priority: Major
>  Labels: needsTriage
>
> Inside BucketAdvisor.volunteerForPrimary(), when current member is elector or 
> has profile for elector, then current member should proceed to volunteer for 
> primary bucket. 
> Current logic happened to be opposite. It might be due to careless code. 



--
This message was sent by Atlassian Jira
(v8.20.7#820007)


[jira] [Updated] (GEODE-10349) Client connects to a new server socket should check its availability

2022-06-02 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10349?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-10349:
--
Labels: pull-request-available  (was: needsTriage pull-request-available)

> Client connects to a new server socket should check its availability 
> -
>
> Key: GEODE-10349
> URL: https://issues.apache.org/jira/browse/GEODE-10349
> Project: Geode
>  Issue Type: Bug
>  Components: client/server
>Reporter: Xiaojian Zhou
>Assignee: Xiaojian Zhou
>Priority: Major
>  Labels: pull-request-available
>
> When client connects to a new server (or restarted server), it might 
> encounter SocketTimeoutException which could take 59 seconds. 
> To speed up, the idea is to check if the server address's is reachable with 
> smaller timeout. 
> If the server's address is not reachable, no need to block waiting for 59 
> seconds (DEFAULT_SOCKET_CONNECT_TIMEOUT). 



--
This message was sent by Atlassian Jira
(v8.20.7#820007)


[jira] [Assigned] (GEODE-10349) Client connects to a new server socket should check its availability

2022-06-02 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10349?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade reassigned GEODE-10349:
-

Assignee: Xiaojian Zhou

> Client connects to a new server socket should check its availability 
> -
>
> Key: GEODE-10349
> URL: https://issues.apache.org/jira/browse/GEODE-10349
> Project: Geode
>  Issue Type: Bug
>  Components: client/server
>Reporter: Xiaojian Zhou
>Assignee: Xiaojian Zhou
>Priority: Major
>  Labels: needsTriage, pull-request-available
>
> When client connects to a new server (or restarted server), it might 
> encounter SocketTimeoutException which could take 59 seconds. 
> To speed up, the idea is to check if the server address's is reachable with 
> smaller timeout. 
> If the server's address is not reachable, no need to block waiting for 59 
> seconds (DEFAULT_SOCKET_CONNECT_TIMEOUT). 



--
This message was sent by Atlassian Jira
(v8.20.7#820007)


[jira] [Assigned] (GEODE-10106) CI Failure: CacheClientNotifierDUnitTest > testNormalClient2MultipleCacheServer

2022-05-23 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10106?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade reassigned GEODE-10106:
-

Assignee: (was: Hale Bales)

> CI Failure: CacheClientNotifierDUnitTest > 
> testNormalClient2MultipleCacheServer
> ---
>
> Key: GEODE-10106
> URL: https://issues.apache.org/jira/browse/GEODE-10106
> Project: Geode
>  Issue Type: Bug
>  Components: wan
>Affects Versions: 1.15.0
>Reporter: Jens Deppe
>Priority: Major
>  Labels: blocks-1.15.0, pull-request-available
> Fix For: 1.15.0
>
>
> [https://concourse.apachegeode-ci.info/teams/main/pipelines/apache-develop-mass-test-run/jobs/distributed-test-openjdk8/builds/1382]
> {noformat}
> CacheClientNotifierDUnitTest > testNormalClient2MultipleCacheServer FAILED
> 11:49:39java.lang.AssertionError: Suspicious strings were written to the 
> log during this run.
> 11:49:39Fix the strings or use IgnoredException.addIgnoredException to 
> ignore.
> 11:49:39
> ---
> 11:49:39Found suspect string in 'dunit_suspect-vm4.log' at line 431
> 11:49:39
> 11:49:39[error 2022/03/05 19:49:36.075 UTC 
>  tid=55] Error in 
> redundancy satisfier
> 11:49:39java.lang.NullPointerException
> 11:49:39  at 
> org.apache.geode.cache.client.internal.QueueManagerImpl.recoverPrimary(QueueManagerImpl.java:856)
> 11:49:39  at 
> org.apache.geode.cache.client.internal.QueueManagerImpl$RedundancySatisfierTask.run2(QueueManagerImpl.java:1454)
> 11:49:39  at 
> org.apache.geode.cache.client.internal.PoolImpl$PoolTask.run(PoolImpl.java:1340)
> 11:49:39  at 
> java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
> 11:49:39  at java.util.concurrent.FutureTask.run(FutureTask.java:266)
> 11:49:39  at 
> java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$201(ScheduledThreadPoolExecutor.java:180)
> 11:49:39  at 
> java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:293)
> 11:49:39  at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
> 11:49:39  at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
> 11:49:39  at java.lang.Thread.run(Thread.java:750)
> 11:49:39at org.junit.Assert.fail(Assert.java:89)
> 11:49:39at 
> org.apache.geode.test.dunit.internal.DUnitLauncher.closeAndCheckForSuspects(DUnitLauncher.java:422)
> 11:49:39at 
> org.apache.geode.test.dunit.internal.DUnitLauncher.closeAndCheckForSuspects(DUnitLauncher.java:438)
> 11:49:39at 
> org.apache.geode.test.dunit.internal.JUnit4DistributedTestCase.cleanupAllVms(JUnit4DistributedTestCase.java:551)
> 11:49:39at 
> org.apache.geode.test.dunit.internal.JUnit4DistributedTestCase.doTearDownDistributedTestCase(JUnit4DistributedTestCase.java:498)
> 11:49:39at 
> org.apache.geode.test.dunit.internal.JUnit4DistributedTestCase.tearDownDistributedTestCase(JUnit4DistributedTestCase.java:481)
> 11:49:39at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> 11:49:39at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
> 11:49:39at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
> 11:49:39at java.lang.reflect.Method.invoke(Method.java:498)
> 11:49:39at 
> org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:59)
> 11:49:39at 
> org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
> 11:49:39at 
> org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:56)
> 11:49:39at 
> org.junit.internal.runners.statements.RunAfters.invokeMethod(RunAfters.java:46)
> 11:49:39at 
> org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:33)
> 11:49:39at org.junit.rules.TestWatcher$1.evaluate(TestWatcher.java:61)
> 11:49:39at 
> org.junit.runners.ParentRunner$3.evaluate(ParentRunner.java:306)
> 11:49:39at 
> org.junit.runners.BlockJUnit4ClassRunner$1.evaluate(BlockJUnit4ClassRunner.java:100)
> 11:49:39at 
> org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:366)
> 11:49:39at 
> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:103)
> 11:49:39at 
> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:63)
> 11:49:39at org.junit.runners.ParentRunner$4.run(ParentRunner.java:331)
> 11:49:39at 
> org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:79)
> 11:49:39at 

[jira] [Updated] (GEODE-10279) Need to lock RVV and flush before backup

2022-05-09 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10279?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-10279:
--
Labels: pull-request-available  (was: needsTriage pull-request-available)

> Need to lock RVV and flush before backup
> 
>
> Key: GEODE-10279
> URL: https://issues.apache.org/jira/browse/GEODE-10279
> Project: Geode
>  Issue Type: Bug
>Reporter: Xiaojian Zhou
>Assignee: Xiaojian Zhou
>Priority: Major
>  Labels: pull-request-available
>
> When using async disk writer, in memory RVV has contained all the operations 
> in async queue. The items in the async queue might not have completely 
> flushed to disk. So RVV mismatch with the entries' status.
> When restored and GII, since RVVs are the same, no GII will be triggered. 
> Thus the data mismatched in different members.
> To fix it, introduce a step to lock rvvs for all the regions of all the 
> diskstores that will be backup.



--
This message was sent by Atlassian Jira
(v8.20.7#820007)


[jira] [Updated] (GEODE-10250) The LockGrantor can grant a lock to a member that has left the distributed system

2022-05-05 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10250?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-10250:
--
Labels: GeodeOperationAPI pull-request-available  (was: 
pull-request-available)

> The LockGrantor can grant a lock to a member that has left the distributed 
> system
> -
>
> Key: GEODE-10250
> URL: https://issues.apache.org/jira/browse/GEODE-10250
> Project: Geode
>  Issue Type: Bug
>  Components: distributed lock service
>Reporter: Barrett Oglesby
>Assignee: Barrett Oglesby
>Priority: Major
>  Labels: GeodeOperationAPI, pull-request-available
>
> If a member requests a distributed lock and then leaves the distributed 
> system, the grantor may grant that request and leave itself in a state where 
> the lock has been granted but the member has left.
> Here are the steps:
>  # The lock requesting server requests a lock
>  # The grantor server is delayed in granting that lock
>  # The lock requesting server shutsdown in the meantime
>  # The grantor server finally grants the lock after it has released all locks 
> and pending requests for the lock requesting server
>  # The lock requesting server receives the lock response but drops it since 
> the thread pool has been already shutdown



--
This message was sent by Atlassian Jira
(v8.20.7#820007)


[jira] [Updated] (GEODE-10250) The LockGrantor can grant a lock to a member that has left the distributed system

2022-05-05 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10250?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-10250:
--
Labels: pull-request-available  (was: needsTriage pull-request-available)

> The LockGrantor can grant a lock to a member that has left the distributed 
> system
> -
>
> Key: GEODE-10250
> URL: https://issues.apache.org/jira/browse/GEODE-10250
> Project: Geode
>  Issue Type: Bug
>  Components: distributed lock service
>Reporter: Barrett Oglesby
>Assignee: Barrett Oglesby
>Priority: Major
>  Labels: pull-request-available
>
> If a member requests a distributed lock and then leaves the distributed 
> system, the grantor may grant that request and leave itself in a state where 
> the lock has been granted but the member has left.
> Here are the steps:
>  # The lock requesting server requests a lock
>  # The grantor server is delayed in granting that lock
>  # The lock requesting server shutsdown in the meantime
>  # The grantor server finally grants the lock after it has released all locks 
> and pending requests for the lock requesting server
>  # The lock requesting server receives the lock response but drops it since 
> the thread pool has been already shutdown



--
This message was sent by Atlassian Jira
(v8.20.7#820007)


[jira] [Comment Edited] (GEODE-9484) Data inconsistency in replicated region with 3 or more servers, and one server is down

2022-05-04 Thread Anilkumar Gingade (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-9484?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17531896#comment-17531896
 ] 

Anilkumar Gingade edited comment on GEODE-9484 at 5/4/22 6:58 PM:
--

[~mivanac]
Can you please revert these changes. Re-open the ticket, address the NPE with 
new additional tests. 
It will be very helpful; if you could revert this sooner; this could be masking 
other issues, that may not get surfaced. 
Or we may go ahead and revert these changes.


was (Author: agingade):
[~mivanac]
Can you please revert these changes. Re-open the ticket, address the NPE with 
new additional tests. 


> Data inconsistency in replicated region with 3 or more servers, and one 
> server is down 
> ---
>
> Key: GEODE-9484
> URL: https://issues.apache.org/jira/browse/GEODE-9484
> Project: Geode
>  Issue Type: Improvement
>  Components: client/server, regions
>Affects Versions: 1.13.0
>Reporter: Mario Ivanac
>Assignee: Mario Ivanac
>Priority: Major
>  Labels: pull-request-available
> Fix For: 1.15.0
>
>
> We have configured replicated region with 3 or more servers, and client is 
> configured with read timeout set to value same or smaller than member timeout.
> In case while client is putting data in region,  one of replicated servers is 
> shutdown, it is observed that we have data inconsistency.
>  
> We see that data part of data is written in server connected with client, but 
> in remaining replicated servers it is missing.



--
This message was sent by Atlassian Jira
(v8.20.7#820007)


[jira] [Comment Edited] (GEODE-9484) Data inconsistency in replicated region with 3 or more servers, and one server is down

2022-05-04 Thread Anilkumar Gingade (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-9484?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17531896#comment-17531896
 ] 

Anilkumar Gingade edited comment on GEODE-9484 at 5/4/22 6:56 PM:
--

[~mivanac]
Can you please revert these changes. Re-open the ticket, address the NPE with 
new additional tests. 



was (Author: agingade):
[~mivanac]
Can you please revert these changes. Re-open the ticket, address the NPE with 
new additional tests. 


> Data inconsistency in replicated region with 3 or more servers, and one 
> server is down 
> ---
>
> Key: GEODE-9484
> URL: https://issues.apache.org/jira/browse/GEODE-9484
> Project: Geode
>  Issue Type: Improvement
>  Components: client/server, regions
>Affects Versions: 1.13.0
>Reporter: Mario Ivanac
>Assignee: Mario Ivanac
>Priority: Major
>  Labels: pull-request-available
> Fix For: 1.15.0
>
>
> We have configured replicated region with 3 or more servers, and client is 
> configured with read timeout set to value same or smaller than member timeout.
> In case while client is putting data in region,  one of replicated servers is 
> shutdown, it is observed that we have data inconsistency.
>  
> We see that data part of data is written in server connected with client, but 
> in remaining replicated servers it is missing.



--
This message was sent by Atlassian Jira
(v8.20.7#820007)


[jira] [Commented] (GEODE-9484) Data inconsistency in replicated region with 3 or more servers, and one server is down

2022-05-04 Thread Anilkumar Gingade (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-9484?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17531896#comment-17531896
 ] 

Anilkumar Gingade commented on GEODE-9484:
--

[~mivanac]
Can you please revert these changes. Re-open the ticket, address the NPE with 
new additional tests. 


> Data inconsistency in replicated region with 3 or more servers, and one 
> server is down 
> ---
>
> Key: GEODE-9484
> URL: https://issues.apache.org/jira/browse/GEODE-9484
> Project: Geode
>  Issue Type: Improvement
>  Components: client/server, regions
>Affects Versions: 1.13.0
>Reporter: Mario Ivanac
>Assignee: Mario Ivanac
>Priority: Major
>  Labels: pull-request-available
> Fix For: 1.15.0
>
>
> We have configured replicated region with 3 or more servers, and client is 
> configured with read timeout set to value same or smaller than member timeout.
> In case while client is putting data in region,  one of replicated servers is 
> shutdown, it is observed that we have data inconsistency.
>  
> We see that data part of data is written in server connected with client, but 
> in remaining replicated servers it is missing.



--
This message was sent by Atlassian Jira
(v8.20.7#820007)


[jira] [Commented] (GEODE-9484) Data inconsistency in replicated region with 3 or more servers, and one server is down

2022-05-04 Thread Anilkumar Gingade (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-9484?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17531895#comment-17531895
 ] 

Anilkumar Gingade commented on GEODE-9484:
--

[~mivanac]
The PR/Fix for this issue is showing up NPE in internal tests. Here is the 
stack trace:
 java.lang.NullPointerException

        at 
org.apache.geode.internal.tcp.TCPConduit.getFirstScanForConnection(TCPConduit.java:958)

        at 
org.apache.geode.distributed.internal.direct.DirectChannel.getConnections(DirectChannel.java:477)

        at 
org.apache.geode.distributed.internal.direct.DirectChannel.sendToMany(DirectChannel.java:277)

        at 
org.apache.geode.distributed.internal.direct.DirectChannel.send(DirectChannel.java:543)

        at 
org.apache.geode.distributed.internal.DistributionImpl.directChannelSend(DistributionImpl.java:348)

        at 
org.apache.geode.distributed.internal.DistributionImpl.send(DistributionImpl.java:293)

        at 
org.apache.geode.distributed.internal.ClusterDistributionManager.sendViaMembershipManager(ClusterDistributionManager.java:2067)

        at 
org.apache.geode.distributed.internal.ClusterDistributionManager.sendOutgoing(ClusterDistributionManager.java:1994)

        at 
org.apache.geode.distributed.internal.ClusterDistributionManager.sendMessage(ClusterDistributionManager.java:2031)

        at 
org.apache.geode.distributed.internal.ClusterDistributionManager.putOutgoing(ClusterDistributionManager.java:1088)

        at 
org.apache.geode.internal.cache.CreateRegionProcessor.initializeRegion(CreateRegionProcessor.java:115)

        at 
org.apache.geode.internal.cache.DistributedRegion.getInitialImageAndRecovery(DistributedRegion.java:1176)

        at 
org.apache.geode.internal.cache.DistributedRegion.initialize(DistributedRegion.java:1107)

        at 
org.apache.geode.internal.cache.HARegion.initialize(HARegion.java:323)

        at 
org.apache.geode.internal.cache.GemFireCacheImpl.createVMRegion(GemFireCacheImpl.java:3103)

        at 
org.apache.geode.internal.cache.HARegion.getInstance(HARegion.java:246)

        at 
org.apache.geode.internal.cache.ha.HARegionQueue.(HARegionQueue.java:365)

        at 
org.apache.geode.internal.cache.ha.HARegionQueue$BlockingHARegionQueue.(HARegionQueue.java:2233)

        at 
org.apache.geode.internal.cache.ha.HARegionQueue$DurableHARegionQueue.(HARegionQueue.java:2478)

        at 
org.apache.geode.internal.cache.ha.HARegionQueue.getHARegionQueueInstance(HARegionQueue.java:2015)

        at 
org.apache.geode.internal.cache.tier.sockets.MessageDispatcher.getMessageQueue(MessageDispatcher.java:166)

        at 
org.apache.geode.internal.cache.tier.sockets.MessageDispatcher.(MessageDispatcher.java:146)

        at 
org.apache.geode.internal.cache.tier.sockets.CacheClientProxy.createMessageDispatcher(CacheClientProxy.java:1685)

        at 
org.apache.geode.internal.cache.tier.sockets.CacheClientProxy.initializeMessageDispatcher(CacheClientProxy.java:1677)

        at 
org.apache.geode.internal.cache.tier.sockets.CacheClientNotifier.initializeProxy(CacheClientNotifier.java:502)

        at 
org.apache.geode.internal.cache.tier.sockets.CacheClientNotifier.registerClientInternal(CacheClientNotifier.java:406)

        at 
org.apache.geode.internal.cache.tier.sockets.CacheClientNotifier.registerClient(CacheClientNotifier.java:221)

        at 
org.apache.geode.internal.cache.tier.sockets.AcceptorImpl$ClientQueueInitializerTask.run(AcceptorImpl.java:1896)

        at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)

        at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)

        at 
org.apache.geode.internal.cache.tier.sockets.AcceptorImpl.lambda$initializeClientQueueInitializerThreadPool$1(AcceptorImpl.java:678)

        at 
org.apache.geode.logging.internal.executors.LoggingThreadFactory.lambda$newThread$0(LoggingThreadFactory.java:120)

        at java.lang.Thread.run(Thread.java:750)


> Data inconsistency in replicated region with 3 or more servers, and one 
> server is down 
> ---
>
> Key: GEODE-9484
> URL: https://issues.apache.org/jira/browse/GEODE-9484
> Project: Geode
>  Issue Type: Improvement
>  Components: client/server, regions
>Affects Versions: 1.13.0
>Reporter: Mario Ivanac
>Assignee: Mario Ivanac
>Priority: Major
>  Labels: pull-request-available
> Fix For: 1.15.0
>
>
> We have configured replicated region with 3 or more servers, and client is 
> configured with read timeout set to value same or smaller than member timeout.
> In case while client is putting data in region,  one of replicated servers is 
> shutdown, it is observed that we have data inconsistency.
>  
> We see that 

[jira] [Updated] (GEODE-10247) CI: spring-compatibility-test compile failure

2022-04-19 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10247?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-10247:
--
Labels: needsTriage redis-triage  (was: needsTriage)

> CI: spring-compatibility-test compile failure
> -
>
> Key: GEODE-10247
> URL: https://issues.apache.org/jira/browse/GEODE-10247
> Project: Geode
>  Issue Type: Bug
>  Components: redis
>Reporter: Xiaojian Zhou
>Priority: Major
>  Labels: needsTriage, redis-triage
>
> It's found in 
> https://hydradb.hdb.gemfire-ci.info/hdb/testresult/14671910
> https://hydradb.hdb.gemfire-ci.info/hdb/testresult/14643293
> https://hydradb.hdb.gemfire-ci.info/hdb/testresult/14643323
> ERROR] Failed to execute goal 
> org.apache.maven.plugins:maven-compiler-plugin:3.8.1:compile (java-compile) 
> on project spring-data-geode: Compilation failure: Compilation failure: 
> [ERROR] 
> /home/geode/spring-data-geode/spring-data-geode/src/main/java/org/springframework/data/gemfire/GemFireProperties.java:[103,43]
>  error: cannot find symbol
> [ERROR]   symbol:   variable GEODE_FOR_REDIS_BIND_ADDRESS
> [ERROR]   location: interface ConfigurationProperties
> [ERROR] 
> /home/geode/spring-data-geode/spring-data-geode/src/main/java/org/springframework/data/gemfire/GemFireProperties.java:[104,38]
>  error: cannot find symbol
> [ERROR]   symbol:   variable GEODE_FOR_REDIS_ENABLED
> [ERROR]   location: interface ConfigurationProperties
> [ERROR] 
> /home/geode/spring-data-geode/spring-data-geode/src/main/java/org/springframework/data/gemfire/GemFireProperties.java:[105,35]
>  error: cannot find symbol
> [ERROR]   symbol:   variable GEODE_FOR_REDIS_PORT
> [ERROR]   location: interface ConfigurationProperties
> [ERROR] 
> /home/geode/spring-data-geode/spring-data-geode/src/main/java/org/springframework/data/gemfire/GemFireProperties.java:[106,47]
>  error: cannot find symbol
> [ERROR]   symbol:   variable GEODE_FOR_REDIS_REDUNDANT_COPIES
> [ERROR]   location: interface ConfigurationProperties
> [ERROR] 
> /home/geode/spring-data-geode/spring-data-geode/src/main/java/org/springframework/data/gemfire/GemFireProperties.java:[107,39]
>  error: cannot find symbol
> [ERROR]   symbol:   variable GEODE_FOR_REDIS_USERNAME
> [ERROR]   location: interface ConfigurationProperties



--
This message was sent by Atlassian Jira
(v8.20.7#820007)


[jira] [Resolved] (GEODE-10059) CI: WANRollingUpgradeNewSenderProcessOldEvent > oldEventShouldBeProcessedAtNewSender[from_v1.13.7] FAILED

2022-04-12 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade resolved GEODE-10059.
---
Resolution: Cannot Reproduce

The issue did not reproduce in successive runs.

> CI: WANRollingUpgradeNewSenderProcessOldEvent > 
> oldEventShouldBeProcessedAtNewSender[from_v1.13.7] FAILED
> -
>
> Key: GEODE-10059
> URL: https://issues.apache.org/jira/browse/GEODE-10059
> Project: Geode
>  Issue Type: Bug
>Affects Versions: 1.13.8
>Reporter: Kristen
>Priority: Major
>  Labels: needsTriage
>
>  
> {code:java}
> > Task :geode-wan:upgradeTest
> org.apache.geode.cache.wan.WANRollingUpgradeNewSenderProcessOldEvent > 
> bothOldAndNewEventsShouldBeProcessedByOldSender[from_v1.4.0] FAILED
> org.apache.geode.test.dunit.RMIException: While invoking 
> org.apache.geode.test.dunit.IgnoredException$1.run in VM 5 running on Host 
> 64b741e8194f with 7 VMs with version 1.3.0
> Caused by:
> java.lang.IllegalStateException: VM not available: VM 5 running on 
> Host 64b741e8194f with 7 VMs with version 1.3.0
> org.apache.geode.cache.wan.WANRollingUpgradeNewSenderProcessOldEvent > 
> oldEventShouldBeProcessedAtTwoNewSender[from_v1.4.0] FAILED
> org.apache.geode.test.dunit.RMIException: While invoking 
> org.apache.geode.test.dunit.IgnoredException$1.run in VM 5 running on Host 
> 64b741e8194f with 7 VMs with version 1.3.0
> Caused by:
> java.lang.IllegalStateException: VM not available: VM 5 running on 
> Host 64b741e8194f with 7 VMs with version 1.3.0
> org.apache.geode.cache.wan.WANRollingUpgradeNewSenderProcessOldEvent > 
> oldEventShouldBeProcessedAtNewSender[from_v1.4.0] FAILED
> org.apache.geode.test.dunit.RMIException: While invoking 
> org.apache.geode.test.dunit.IgnoredException$1.run in VM 5 running on Host 
> 64b741e8194f with 7 VMs with version 1.3.0
> Caused by:
> java.lang.IllegalStateException: VM not available: VM 5 running on 
> Host 64b741e8194f with 7 VMs with version 1.3.0
> org.apache.geode.cache.wan.WANRollingUpgradeNewSenderProcessOldEvent > 
> bothOldAndNewEventsShouldBeProcessedByOldSender[from_v1.5.0] FAILED
> org.apache.geode.test.dunit.RMIException: While invoking 
> org.apache.geode.test.dunit.IgnoredException$1.run in VM 5 running on Host 
> 64b741e8194f with 7 VMs with version 1.3.0
> Caused by:
> java.lang.IllegalStateException: VM not available: VM 5 running on 
> Host 64b741e8194f with 7 VMs with version 1.3.0
> org.apache.geode.cache.wan.WANRollingUpgradeNewSenderProcessOldEvent > 
> oldEventShouldBeProcessedAtTwoNewSender[from_v1.5.0] FAILED
> org.apache.geode.test.dunit.RMIException: While invoking 
> org.apache.geode.test.dunit.IgnoredException$1.run in VM 5 running on Host 
> 64b741e8194f with 7 VMs with version 1.3.0
> Caused by:
> java.lang.IllegalStateException: VM not available: VM 5 running on 
> Host 64b741e8194f with 7 VMs with version 1.3.0
> org.apache.geode.cache.wan.WANRollingUpgradeNewSenderProcessOldEvent > 
> oldEventShouldBeProcessedAtNewSender[from_v1.5.0] FAILED
> org.apache.geode.test.dunit.RMIException: While invoking 
> org.apache.geode.test.dunit.IgnoredException$1.run in VM 5 running on Host 
> 64b741e8194f with 7 VMs with version 1.3.0
> Caused by:
> java.lang.IllegalStateException: VM not available: VM 5 running on 
> Host 64b741e8194f with 7 VMs with version 1.3.0
> org.apache.geode.cache.wan.WANRollingUpgradeNewSenderProcessOldEvent > 
> bothOldAndNewEventsShouldBeProcessedByOldSender[from_v1.6.0] FAILED
> org.apache.geode.test.dunit.RMIException: While invoking 
> org.apache.geode.test.dunit.IgnoredException$1.run in VM 5 running on Host 
> 64b741e8194f with 7 VMs with version 1.3.0
> Caused by:
> java.lang.IllegalStateException: VM not available: VM 5 running on 
> Host 64b741e8194f with 7 VMs with version 1.3.0
> org.apache.geode.cache.wan.WANRollingUpgradeNewSenderProcessOldEvent > 
> oldEventShouldBeProcessedAtTwoNewSender[from_v1.6.0] FAILED
> org.apache.geode.test.dunit.RMIException: While invoking 
> org.apache.geode.test.dunit.IgnoredException$1.run in VM 5 running on Host 
> 64b741e8194f with 7 VMs with version 1.3.0
> Caused by:
> java.lang.IllegalStateException: VM not available: VM 5 running on 
> Host 64b741e8194f with 7 VMs with version 1.3.0
> org.apache.geode.cache.wan.WANRollingUpgradeNewSenderProcessOldEvent > 
> oldEventShouldBeProcessedAtNewSender[from_v1.6.0] FAILED
> org.apache.geode.test.dunit.RMIException: While invoking 
> org.apache.geode.test.dunit.IgnoredException$1.run in VM 5 running on Host 
> 64b741e8194f with 7 VMs with version 1.3.0
> Caused by:
>    

[jira] [Commented] (GEODE-10059) CI: WANRollingUpgradeNewSenderProcessOldEvent > oldEventShouldBeProcessedAtNewSender[from_v1.13.7] FAILED

2022-04-12 Thread Anilkumar Gingade (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-10059?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17521297#comment-17521297
 ] 

Anilkumar Gingade commented on GEODE-10059:
---

Sounds related to environment issue. The test has passed in successive runs. 
Closing this for now; we will re-open once this surfaces again.

> CI: WANRollingUpgradeNewSenderProcessOldEvent > 
> oldEventShouldBeProcessedAtNewSender[from_v1.13.7] FAILED
> -
>
> Key: GEODE-10059
> URL: https://issues.apache.org/jira/browse/GEODE-10059
> Project: Geode
>  Issue Type: Bug
>Affects Versions: 1.13.8
>Reporter: Kristen
>Priority: Major
>  Labels: needsTriage
>
>  
> {code:java}
> > Task :geode-wan:upgradeTest
> org.apache.geode.cache.wan.WANRollingUpgradeNewSenderProcessOldEvent > 
> bothOldAndNewEventsShouldBeProcessedByOldSender[from_v1.4.0] FAILED
> org.apache.geode.test.dunit.RMIException: While invoking 
> org.apache.geode.test.dunit.IgnoredException$1.run in VM 5 running on Host 
> 64b741e8194f with 7 VMs with version 1.3.0
> Caused by:
> java.lang.IllegalStateException: VM not available: VM 5 running on 
> Host 64b741e8194f with 7 VMs with version 1.3.0
> org.apache.geode.cache.wan.WANRollingUpgradeNewSenderProcessOldEvent > 
> oldEventShouldBeProcessedAtTwoNewSender[from_v1.4.0] FAILED
> org.apache.geode.test.dunit.RMIException: While invoking 
> org.apache.geode.test.dunit.IgnoredException$1.run in VM 5 running on Host 
> 64b741e8194f with 7 VMs with version 1.3.0
> Caused by:
> java.lang.IllegalStateException: VM not available: VM 5 running on 
> Host 64b741e8194f with 7 VMs with version 1.3.0
> org.apache.geode.cache.wan.WANRollingUpgradeNewSenderProcessOldEvent > 
> oldEventShouldBeProcessedAtNewSender[from_v1.4.0] FAILED
> org.apache.geode.test.dunit.RMIException: While invoking 
> org.apache.geode.test.dunit.IgnoredException$1.run in VM 5 running on Host 
> 64b741e8194f with 7 VMs with version 1.3.0
> Caused by:
> java.lang.IllegalStateException: VM not available: VM 5 running on 
> Host 64b741e8194f with 7 VMs with version 1.3.0
> org.apache.geode.cache.wan.WANRollingUpgradeNewSenderProcessOldEvent > 
> bothOldAndNewEventsShouldBeProcessedByOldSender[from_v1.5.0] FAILED
> org.apache.geode.test.dunit.RMIException: While invoking 
> org.apache.geode.test.dunit.IgnoredException$1.run in VM 5 running on Host 
> 64b741e8194f with 7 VMs with version 1.3.0
> Caused by:
> java.lang.IllegalStateException: VM not available: VM 5 running on 
> Host 64b741e8194f with 7 VMs with version 1.3.0
> org.apache.geode.cache.wan.WANRollingUpgradeNewSenderProcessOldEvent > 
> oldEventShouldBeProcessedAtTwoNewSender[from_v1.5.0] FAILED
> org.apache.geode.test.dunit.RMIException: While invoking 
> org.apache.geode.test.dunit.IgnoredException$1.run in VM 5 running on Host 
> 64b741e8194f with 7 VMs with version 1.3.0
> Caused by:
> java.lang.IllegalStateException: VM not available: VM 5 running on 
> Host 64b741e8194f with 7 VMs with version 1.3.0
> org.apache.geode.cache.wan.WANRollingUpgradeNewSenderProcessOldEvent > 
> oldEventShouldBeProcessedAtNewSender[from_v1.5.0] FAILED
> org.apache.geode.test.dunit.RMIException: While invoking 
> org.apache.geode.test.dunit.IgnoredException$1.run in VM 5 running on Host 
> 64b741e8194f with 7 VMs with version 1.3.0
> Caused by:
> java.lang.IllegalStateException: VM not available: VM 5 running on 
> Host 64b741e8194f with 7 VMs with version 1.3.0
> org.apache.geode.cache.wan.WANRollingUpgradeNewSenderProcessOldEvent > 
> bothOldAndNewEventsShouldBeProcessedByOldSender[from_v1.6.0] FAILED
> org.apache.geode.test.dunit.RMIException: While invoking 
> org.apache.geode.test.dunit.IgnoredException$1.run in VM 5 running on Host 
> 64b741e8194f with 7 VMs with version 1.3.0
> Caused by:
> java.lang.IllegalStateException: VM not available: VM 5 running on 
> Host 64b741e8194f with 7 VMs with version 1.3.0
> org.apache.geode.cache.wan.WANRollingUpgradeNewSenderProcessOldEvent > 
> oldEventShouldBeProcessedAtTwoNewSender[from_v1.6.0] FAILED
> org.apache.geode.test.dunit.RMIException: While invoking 
> org.apache.geode.test.dunit.IgnoredException$1.run in VM 5 running on Host 
> 64b741e8194f with 7 VMs with version 1.3.0
> Caused by:
> java.lang.IllegalStateException: VM not available: VM 5 running on 
> Host 64b741e8194f with 7 VMs with version 1.3.0
> org.apache.geode.cache.wan.WANRollingUpgradeNewSenderProcessOldEvent > 
> oldEventShouldBeProcessedAtNewSender[from_v1.6.0] FAILED
> org.apache.geode.test.dunit.RMIException: While invoking 
> org.apache.geode.test.dunit.IgnoredException$1.run 

[jira] [Updated] (GEODE-9889) LettucePubSubIntegrationTest > subscribePsubscribeSameClient FAILED

2022-04-05 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-9889?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-9889:
-
Labels: redis-triage  (was: needsTriage)

> LettucePubSubIntegrationTest > subscribePsubscribeSameClient FAILED
> ---
>
> Key: GEODE-9889
> URL: https://issues.apache.org/jira/browse/GEODE-9889
> Project: Geode
>  Issue Type: Bug
>  Components: redis
>Affects Versions: 1.14.0
>Reporter: Ray Ingles
>Assignee: Jens Deppe
>Priority: Major
>  Labels: redis-triage
>
> Seen in a CI build:
>  
> {{> Task :geode-apis-compatible-with-redis:integrationTest}}
> {{org.apache.geode.redis.internal.executor.pubsub.LettucePubSubIntegrationTest
>  > subscribePsubscribeSameClient FAILED}}
> {{org.junit.ComparisonFailure: expected:<[2]L> but was:<[0]L>}}



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (GEODE-3492) ServerLauncherRemoteFileIntegrationTest fails on Windows

2022-04-05 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-3492?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-3492:
-
Labels: IntegrationTest Windows  (was: IntegrationTest Windows needsTriage)

> ServerLauncherRemoteFileIntegrationTest fails on Windows
> 
>
> Key: GEODE-3492
> URL: https://issues.apache.org/jira/browse/GEODE-3492
> Project: Geode
>  Issue Type: Sub-task
>  Components: gfsh, tests
> Environment: Windows
>Reporter: Kirk Lund
>Priority: Major
>  Labels: IntegrationTest, Windows
>
> {noformat}
> org.apache.geode.distributed.ServerLauncherRemoteFileIntegrationTest > 
> stopWithPidReturnsStopped FAILED
> org.awaitility.core.ConditionTimeoutException: Condition defined as a 
> lambda expression in 
> org.apache.geode.distributed.ServerLauncherRemoteIntegrationTestCase that 
> uses org.apache.geode.distributed.ServerLauncher expected:<[online]> but 
> was:<[not responding]> within 2 minutes.
> at 
> org.awaitility.core.ConditionAwaiter.await(ConditionAwaiter.java:104)
> at 
> org.awaitility.core.AssertionCondition.await(AssertionCondition.java:117)
> at 
> org.awaitility.core.AssertionCondition.await(AssertionCondition.java:32)
> at 
> org.awaitility.core.ConditionFactory.until(ConditionFactory.java:809)
> at 
> org.awaitility.core.ConditionFactory.until(ConditionFactory.java:648)
> at 
> org.apache.geode.distributed.ServerLauncherRemoteIntegrationTestCase.awaitStart(ServerLauncherRemoteIntegrationTestCase.java:198)
> at 
> org.apache.geode.distributed.ServerLauncherRemoteIntegrationTestCase.awaitStart(ServerLauncherRemoteIntegrationTestCase.java:177)
> at 
> org.apache.geode.distributed.ServerLauncherRemoteIntegrationTestCase.awaitStart(ServerLauncherRemoteIntegrationTestCase.java:188)
> at 
> org.apache.geode.distributed.ServerLauncherRemoteIntegrationTestCase.givenRunningServer(ServerLauncherRemoteIntegrationTestCase.java:114)
> at 
> org.apache.geode.distributed.ServerLauncherRemoteIntegrationTestCase.givenRunningServer(ServerLauncherRemoteIntegrationTestCase.java:110)
> at 
> org.apache.geode.distributed.ServerLauncherRemoteFileIntegrationTest.stopWithPidReturnsStopped(ServerLauncherRemoteFileIntegrationTest.java:75)
> org.apache.geode.distributed.ServerLauncherRemoteFileIntegrationTest > 
> stopWithPidDeletesPidFile FAILED
> org.awaitility.core.ConditionTimeoutException: Condition defined as a 
> lambda expression in 
> org.apache.geode.distributed.ServerLauncherRemoteIntegrationTestCase that 
> uses org.apache.geode.distributed.ServerLauncher expected:<[online]> but 
> was:<[not responding]> within 2 minutes.
> at 
> org.awaitility.core.ConditionAwaiter.await(ConditionAwaiter.java:104)
> at 
> org.awaitility.core.AssertionCondition.await(AssertionCondition.java:117)
> at 
> org.awaitility.core.AssertionCondition.await(AssertionCondition.java:32)
> at 
> org.awaitility.core.ConditionFactory.until(ConditionFactory.java:809)
> at 
> org.awaitility.core.ConditionFactory.until(ConditionFactory.java:648)
> at 
> org.apache.geode.distributed.ServerLauncherRemoteIntegrationTestCase.awaitStart(ServerLauncherRemoteIntegrationTestCase.java:198)
> at 
> org.apache.geode.distributed.ServerLauncherRemoteIntegrationTestCase.awaitStart(ServerLauncherRemoteIntegrationTestCase.java:177)
> at 
> org.apache.geode.distributed.ServerLauncherRemoteIntegrationTestCase.awaitStart(ServerLauncherRemoteIntegrationTestCase.java:188)
> at 
> org.apache.geode.distributed.ServerLauncherRemoteIntegrationTestCase.givenRunningServer(ServerLauncherRemoteIntegrationTestCase.java:114)
> at 
> org.apache.geode.distributed.ServerLauncherRemoteIntegrationTestCase.givenRunningServer(ServerLauncherRemoteIntegrationTestCase.java:110)
> at 
> org.apache.geode.distributed.ServerLauncherRemoteFileIntegrationTest.stopWithPidDeletesPidFile(ServerLauncherRemoteFileIntegrationTest.java:63)
> org.apache.geode.distributed.ServerLauncherRemoteFileIntegrationTest > 
> stopWithPidStopsServerProcess FAILED
> org.awaitility.core.ConditionTimeoutException: Condition defined as a 
> lambda expression in 
> org.apache.geode.distributed.ServerLauncherRemoteIntegrationTestCase that 
> uses org.apache.geode.distributed.ServerLauncher expected:<[online]> but 
> was:<[not responding]> within 2 minutes.
> at 
> org.awaitility.core.ConditionAwaiter.await(ConditionAwaiter.java:104)
> at 
> org.awaitility.core.AssertionCondition.await(AssertionCondition.java:117)
> at 
> org.awaitility.core.AssertionCondition.await(AssertionCondition.java:32)
> at 
> 

[jira] [Updated] (GEODE-8641) CI Failure: PartitionedIndexedQueryBenchmark.run

2022-04-05 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-8641?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-8641:
-
Labels:   (was: needsTriage)

> CI Failure: PartitionedIndexedQueryBenchmark.run
> 
>
> Key: GEODE-8641
> URL: https://issues.apache.org/jira/browse/GEODE-8641
> Project: Geode
>  Issue Type: Test
>  Components: benchmarks, tests
>Reporter: Jens Deppe
>Priority: Major
>
> Benchmarking looks like some internal error:
> https://concourse.apachegeode-ci.info/teams/main/pipelines/apache-develop-main/jobs/Benchmark_base/builds/406
> {noformat}
> org.apache.geode.benchmark.tests.PartitionedIndexedQueryBenchmark > run() 
> FAILED
> java.util.concurrent.CompletionException: java.lang.RuntimeException: 
> java.rmi.UnmarshalException: Error unmarshaling return header; nested 
> exception is: 
>   java.io.EOFException
> Caused by:
> java.lang.RuntimeException: java.rmi.UnmarshalException: Error 
> unmarshaling return header; nested exception is: 
>   java.io.EOFException
> Caused by:
> java.rmi.UnmarshalException: Error unmarshaling return header; 
> nested exception is: 
>   java.io.EOFException
> Caused by:
> java.io.EOFException
> {noformat}



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (GEODE-9987) Mass-Test_Run: HostedLocatorsDUnitTest > testGetAllHostedLocators FAILED

2022-04-05 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-9987?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-9987:
-
Labels: pull-request-available  (was: needsTriage pull-request-available)

> Mass-Test_Run: HostedLocatorsDUnitTest > testGetAllHostedLocators FAILED
> 
>
> Key: GEODE-9987
> URL: https://issues.apache.org/jira/browse/GEODE-9987
> Project: Geode
>  Issue Type: Bug
>Affects Versions: 1.15.0
>Reporter: Kristen
>Assignee: Nabarun Nag
>Priority: Major
>  Labels: pull-request-available
>
> {code:java}
> HostedLocatorsDUnitTest > testGetAllHostedLocators FAILED
> org.gradle.internal.exceptions.DefaultMultiCauseException: Multiple 
> Failures (2 failures)
>   org.apache.geode.test.dunit.RMIException: While invoking 
> org.apache.geode.distributed.HostedLocatorsDUnitTest$1.call in VM 3 running 
> on Host 
> heavy-lifter-a104d1cf-827a-5a63-b549-400f89cd9204.c.apachegeode-ci.internal 
> with 4 VMs
>   java.lang.AssertionError: Suspicious strings were written to the log 
> during this run.
> Fix the strings or use IgnoredException.addIgnoredException to ignore.
> ---
> Found suspect string in 'dunit_suspect-vm3.log' at line 458
> [fatal 2022/01/22 17:02:39.638 UTC  receiver,heavy-lifter-a104d1cf-827a-5a63-b549-400f89cd9204-65003> tid=125] 
> Membership service failure: Member isn't responding to heartbeat requests
> 
> org.apache.geode.distributed.internal.membership.api.MemberDisconnectedException:
>  Member isn't responding to heartbeat requests
>   at 
> org.apache.geode.distributed.internal.membership.gms.GMSMembership$ManagerImpl.forceDisconnect(GMSMembership.java:1806)
>   at 
> org.apache.geode.distributed.internal.membership.gms.membership.GMSJoinLeave.forceDisconnect(GMSJoinLeave.java:1120)
>   at 
> org.apache.geode.distributed.internal.membership.gms.membership.GMSJoinLeave.processRemoveMemberMessage(GMSJoinLeave.java:723)
>  ...
> Caused by:
> org.apache.geode.test.dunit.RMIException: While invoking 
> org.apache.geode.distributed.HostedLocatorsDUnitTest$1.call in VM 3 running 
> on Host 
> heavy-lifter-a104d1cf-827a-5a63-b549-400f89cd9204.c.apachegeode-ci.internal 
> with 4 VMs
> at 
> org.apache.geode.test.dunit.VM.executeMethodOnObject(VM.java:631)
> at org.apache.geode.test.dunit.VM.invoke(VM.java:473)
> at 
> org.apache.geode.distributed.HostedLocatorsDUnitTest.testGetAllHostedLocators(HostedLocatorsDUnitTest.java:92)
> Caused by:
> 
> org.apache.geode.distributed.DistributedSystemDisconnectedException: 
> Distribution manager on 
> heavy-lifter-a104d1cf-827a-5a63-b549-400f89cd9204(heavy-lifter-a104d1cf-827a-5a63-b549-400f89cd9204.c.apachegeode-ci.internal_HostedLocatorsDUnitTest_testGetAllHostedLocators-3:115919:locator):59339
>  started at Sat Jan 22 17:02:21 UTC 2022: Member isn't responding to 
> heartbeat requests, caused by org.apache.geode.ForcedDisconnectException: 
> Member isn't responding to heartbeat requests
> at 
> org.apache.geode.distributed.internal.ClusterDistributionManager$Stopper.generateCancelledException(ClusterDistributionManager.java:2893)
> at 
> org.apache.geode.distributed.internal.InternalDistributedSystem$Stopper.generateCancelledException(InternalDistributedSystem.java:1177)
> at 
> org.apache.geode.CancelCriterion.checkCancelInProgress(CancelCriterion.java:83)
> at 
> org.apache.geode.distributed.internal.locks.DLockService.create(DLockService.java:2686)
> at 
> org.apache.geode.distributed.internal.locks.DLockService.create(DLockService.java:2660)
> at 
> org.apache.geode.distributed.internal.locks.DLockService.getOrCreateService(DLockService.java:2641)
> at 
> org.apache.geode.distributed.internal.InternalConfigurationPersistenceService.(InternalConfigurationPersistenceService.java:131)
> at 
> org.apache.geode.distributed.internal.InternalLocator.startConfigurationPersistenceService(InternalLocator.java:1390)
> at 
> org.apache.geode.distributed.internal.InternalLocator.startClusterManagementService(InternalLocator.java:792)
> at 
> org.apache.geode.distributed.internal.InternalLocator.startCache(InternalLocator.java:787)
> at 
> org.apache.geode.distributed.internal.InternalLocator.startDistributedSystem(InternalLocator.java:768)
> at 
> org.apache.geode.distributed.internal.InternalLocator.startLocator(InternalLocator.java:399)
> at 
> 

[jira] [Updated] (GEODE-10056) Gateway-reciver connection load mantained only on one locator

2022-04-05 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10056?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-10056:
--
Labels: pull-request-available  (was: needsTriage pull-request-available)

> Gateway-reciver connection load mantained only on one locator
> -
>
> Key: GEODE-10056
> URL: https://issues.apache.org/jira/browse/GEODE-10056
> Project: Geode
>  Issue Type: Bug
>Reporter: Jakov Varenina
>Assignee: Jakov Varenina
>Priority: Major
>  Labels: pull-request-available
>
> The first problem is that servers send incorrect gateway-receiver connection 
> load to locators with CacheServerLoadMessage. The second problem is that the 
> locator doesn't refresh gateway-receivers load per server in the local map 
> with the load received in CacheServerLoadMessage. This seems to be a bug, as 
> there is already a mechanism to track and store gateway-receiver connection 
> load per server in the locator, but that load is never refreshed by a fault 
> at the reception of CacheServerLoadMessage. Currently, receiver load is only 
> refreshed/increased on the locator that is handling 
> ClientConnectionRequest\{group=__recv_group...} and ClientConnectionResponse 
> messages from a remote server that is trying to establish gateway sender 
> connection. All other locators in a cluster will never refresh the 
> gateway-receiver connection load in this case. When the locator that was 
> serving remote gateway-senders goes down then a new locator will take that 
> job. Problem is that the new locator will not have a correct load (it was 
> never refreshed) and that would in most situations result in new 
> gateway-sender connections being established in an unbalanced way.
> Way to reproduce the issue:
> Start 2 clusters, Let's call site1 the sending and site2 the receiving site, 
> The receiving site should have at least 2 locators. Both have 2 servers. No 
> regions are needed.
> Cluster-1 gfsh>list members
> Member Count : 3Name | Id
> - | -
> locator10 | 10.0.2.15(locator10:7332:locator):41000 [Coordinator]
> server11 | 10.0.2.15(server11:8358):41003
> server12 | 10.0.2.15(server12:8717):41005
>  
> Cluster-2 gfsh>list members
> Member Count : 4Name | Id
> - | -
> locator10 | 10.0.2.15(locator10:7562:locator):41001 [Coordinator]
> locator11 | 10.0.2.15(locator11:8103:locator):41002
> server11 | 10.0.2.15(server11:8547):41004
> server12 | 10.0.2.15(server12:8908):41006
>  
> Create GW receiver in Site2 on both servers.
>  
> Cluster-2 gfsh>list gateways
> GatewayReceiver Section  Member   | Port | Sender 
> Count | Senders Connected
> -- |  |  | -
> 10.0.2.15(server11:8547):41004 | 5175 | 0    |
> 10.0.2.15(server12:8908):41006 | 5457 | 0    |
> Create GW sender in Site1 on both servers. Use 10 dispatcher threads for 
> easier obervation. 
> Cluster-1 gfsh>list gateways
> GatewaySender SectionGatewaySender Id |   Member   | 
> Remote Cluster Id |   Type   |    Status | Queued Events | 
> Receiver Location
>  | -- | - | 
>  | - | - | -
> senderTo2    | 10.0.2.15(server11:8358):41003 | 2 | 
> Parallel | Running and Connected | 0 | 10.0.2.15:5457
> senderTo2    | 10.0.2.15(server12:8717):41005 | 2 | 
> Parallel | Running and Connected | 0 | 10.0.2.15:5457
>  
> Observe balance in GW receiver connections in Site2. It will be perfect.
>  
> Cluster-2 gfsh>list gateways
> GatewayReceiver Section  Member   | Port | Sender 
> Count | Senders Connected
> -- |  |  | 
> -
> 10.0.2.15(server11:8547):41004 | 5175 | 12   | 
> 10.0.2.15(server12:8717):41005, 10.0.2.15(server12:8717):41005, 
> 10.0.2.15(server11:8358):41003, 10.0.2.15(server11:..
> 10.0.2.15(server12:8908):41006 | 5457 | 12   | 
> 10.0.2.15(server12:8717):41005, 10.0.2.15(server12:8717):41005, 
> 10.0.2.15(server12:8717):41005, 10.0.2.15(server12:..
>  
> 12 connections each - 10 payload + 2 ping connections.
> Now stop GW receiver in one server of site2. In Site1 do a stop/start 
> gateway-sender command - all connections will go to the only receiver in 
> site2 (as expected). Check it:
>  
> Cluster-2 gfsh>list gateways
> GatewayReceiver Section  

[jira] [Updated] (GEODE-10095) TcrConnection::readHandshakeData reads too many bytes

2022-04-05 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10095?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-10095:
--
Labels:   (was: needsTriage)

> TcrConnection::readHandshakeData reads too many bytes
> -
>
> Key: GEODE-10095
> URL: https://issues.apache.org/jira/browse/GEODE-10095
> Project: Geode
>  Issue Type: Bug
>  Components: native client
>Reporter: Blake Bender
>Assignee: Matthew Reddington
>Priority: Major
>
> This method is called to read bytes from a socket, and return said data in a 
> `std::vector`.  For some inexplicable (inexcusable?) reason, the 
> method always adds a 0 byte to the end of the vector, as if it were 
> null-terminating a string.  So, `readHandshakeData(1)` returns 2 bytes, 
> `readHandshakeData(5)` returns 6 bytes, etc.
> This is extremely misleading, given the name of the method and the fact that 
> the requested number of bytes is a parameter passed in.  Also, in no 
> circumstance is this method used to actually read a string, i.e. something 
> that may require null-termination.  Please remove the extra byte from the 
> returned vector, and if possible add a unit test to the suite that reads _n_ 
> bytes and asserts that the method always returns _n_ bytes.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (GEODE-10012) Avoid retries for non-HA function execution

2022-04-05 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10012?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-10012:
--
Labels: pull-request-available  (was: needsTriage pull-request-available)

> Avoid retries for non-HA function execution
> ---
>
> Key: GEODE-10012
> URL: https://issues.apache.org/jira/browse/GEODE-10012
> Project: Geode
>  Issue Type: Bug
>  Components: native client
>Reporter: Mario Salazar de Torres
>Assignee: Mario Salazar de Torres
>Priority: Major
>  Labels: pull-request-available
>
> *GIVEN* a cluster with 3 servers and 1 locator
> *AND* a PartitionedRegion with redundant-copies="1"
> *AND* a user-defined Function with isHA=false
> *AND* a geode-native client with a pool with single-hop enabled and retry 
> attempts set to 2
> *WHEN* execution fails due to a connectivity issue/connection 
> timeout/internal cluster error
> *THEN* function is retried twice
> 
> *Additional information.* The function is retried at the network code layer, 
> and given whenever there is a timeout/connectivity error, the BSL is removed 
> from the ClientMetadataService, whenever retries happen, you might get a 
> *InternalFunctionInvocationTargetException* exception thrown indicating: 
> {code:java}
> Multiple target nodes found for single hop operation {code}
> Also, have into account that Java client API behaviour never retries a 
> Function Execution if isHA=false for that function, and whenever isHA=true, 
> the function is retried but on the function execution logic and not at the 
> networking layer.
>  



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (GEODE-10017) Fix new ITs unstability for TCs that involve members restart

2022-04-05 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10017?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-10017:
--
Labels: pull-request-available  (was: needsTriage pull-request-available)

> Fix new ITs unstability for TCs that involve members restart
> 
>
> Key: GEODE-10017
> URL: https://issues.apache.org/jira/browse/GEODE-10017
> Project: Geode
>  Issue Type: Bug
>  Components: native client
>Reporter: Mario Salazar de Torres
>Assignee: Mario Salazar de Torres
>Priority: Major
>  Labels: pull-request-available
>
> *GIVEN* an integration TC on which a server restart needs to be restarted
> *WHEN* the server is starting up again
> *THEN* +{color:#172b4d}it might{color}+{color:#172b4d} happen that the TC 
> gets stuck{color}
> 
> *Additional information.* This issue does not always happens, and I've seen 
> it happening more frequently with the latest version of Geode server (1.15.0)
> Some examples of this TC are:
>  * RegisterKeysTest.RegisterKeySetAndClusterRestart
>  * PartitionRegionWithRedundancyTest.putgetWithSingleHop
>  * ···
> Also, this is normally the exec flow for the TCs that get stuck:
>  # Setup cluster
>  # Do TC specific ops
>  # Stop server(s)/Cluster shutdown
>  # Start server(s)
> In all cases, the server gets stuck at step 4



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (GEODE-9753) Coredump during PdxSerializable object serialization

2022-04-05 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-9753?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-9753:
-
Labels: pull-request-available  (was: needsTriage pull-request-available)

> Coredump during PdxSerializable object serialization
> 
>
> Key: GEODE-9753
> URL: https://issues.apache.org/jira/browse/GEODE-9753
> Project: Geode
>  Issue Type: Bug
>  Components: native client
>Reporter: Mario Salazar de Torres
>Assignee: Mario Salazar de Torres
>Priority: Major
>  Labels: pull-request-available
>
> *GIVEN*  **a  single server and locator cluster with 1 PdxSerializable class 
> registered, named Order
>    *AND* a geode-native client with 1 PdxSerializable 1 PdxSerializable class 
> registered, named Order
>  *WHEN* a Order object is tried to be serialized
>    *WHILE* the cluster is being restarted
> *THEN* a coredump happens due to PdxType=nullptr
> —
> *Additional information*. As seen by early troubleshooting, the coredump 
> happens because the pdx type is tried to be fetched from the PdxTypeRegist by 
> its class name, but the PdxTypeRegistry is cleaned up during serialization 
> given that subscription redundancy was lost.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (GEODE-10217) Mockito Unable to Mock org.apache.geode.internal.cache.DiskRegion

2022-04-05 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-10217:
--
Labels:   (was: needsTriage)

> Mockito Unable to Mock org.apache.geode.internal.cache.DiskRegion
> -
>
> Key: GEODE-10217
> URL: https://issues.apache.org/jira/browse/GEODE-10217
> Project: Geode
>  Issue Type: Bug
>  Components: core, tests
>Reporter: Patrick Johnsn
>Priority: Major
>
> Mokito cannot mock DiskRegion, DiskRegionView, and AbstractDiskRegion because 
> Byte Buddy could not instrument all classes within the mock's type hierarchy.
> {noformat}
> DiskEntryHelperTest > doSynchronousWriteReturnsTrueWhenDiskRegionIsSync FAILED
> 10:19:40org.mockito.exceptions.base.MockitoException: 
> 10:19:40Mockito cannot mock this class: class 
> org.apache.geode.internal.cache.DiskRegion.
> 10:19:40
> 10:19:40If you're not sure why you're getting this error, please report 
> to the mailing list.
> 10:19:40
> 10:19:40
> 10:19:40Java   : 1.8
> 10:19:40JVM vendor name: BellSoft
> 10:19:40JVM vendor version : 25.322-b06
> 10:19:40JVM name   : OpenJDK 64-Bit Server VM
> 10:19:40JVM version: 1.8.0_322-b06
> 10:19:40JVM info   : mixed mode
> 10:19:40OS name: Linux
> 10:19:40OS version : 5.4.0-1069-gcp
> 10:19:40
> 10:19:40
> 10:19:40You are seeing this disclaimer because Mockito is configured to 
> create inlined mocks.
> 10:19:40You can learn about inline mocks and their limitations under item 
> #39 of the Mockito class javadoc.
> 10:19:40
> 10:19:40Underlying exception : 
> org.mockito.exceptions.base.MockitoException: Could not modify all classes 
> [class org.apache.geode.internal.cache.DiskRegion, interface 
> org.apache.geode.internal.cache.persistence.DiskRegionView, class 
> org.apache.geode.internal.cache.AbstractDiskRegion]
> 10:19:40at 
> org.apache.geode.internal.cache.entries.DiskEntryHelperTest.(DiskEntryHelperTest.java:44)
> 10:19:40
> 10:19:40Caused by:
> 10:19:40org.mockito.exceptions.base.MockitoException: Could not 
> modify all classes [class org.apache.geode.internal.cache.DiskRegion, 
> interface org.apache.geode.internal.cache.persistence.DiskRegionView, class 
> org.apache.geode.internal.cache.AbstractDiskRegion]
> 10:19:40at 
> net.bytebuddy.TypeCache.findOrInsert(TypeCache.java:157)
> 10:19:40at 
> net.bytebuddy.TypeCache$WithInlineExpunction.findOrInsert(TypeCache.java:371)
> 10:19:40at 
> net.bytebuddy.TypeCache.findOrInsert(TypeCache.java:179)
> 10:19:40at 
> net.bytebuddy.TypeCache$WithInlineExpunction.findOrInsert(TypeCache.java:382)
> 10:19:40... 1 more
> 10:19:40
> 10:19:40Caused by:
> 10:19:40java.lang.IllegalStateException: 
> 10:19:40Byte Buddy could not instrument all classes within the 
> mock's type hierarchy
> 10:19:40
> 10:19:40This problem should never occur for javac-compiled 
> classes. This problem has been observed for classes that are:
> 10:19:40 - Compiled by older versions of scalac
> 10:19:40 - Classes that are part of the Android distribution
> 10:19:40at 
> org.mockito.internal.creation.bytebuddy.InlineBytecodeGenerator.triggerRetransformation(InlineBytecodeGenerator.java:280)
> 10:19:40at 
> org.mockito.internal.creation.bytebuddy.InlineBytecodeGenerator.mockClass(InlineBytecodeGenerator.java:213)
> 10:19:40at 
> org.mockito.internal.creation.bytebuddy.TypeCachingBytecodeGenerator.lambda$mockClass$0(TypeCachingBytecodeGenerator.java:47)
> 10:19:40at 
> net.bytebuddy.TypeCache.findOrInsert(TypeCache.java:157)
> 10:19:40at 
> net.bytebuddy.TypeCache$WithInlineExpunction.findOrInsert(TypeCache.java:371)
> 10:19:40at 
> net.bytebuddy.TypeCache.findOrInsert(TypeCache.java:179)
> 10:19:40at 
> net.bytebuddy.TypeCache$WithInlineExpunction.findOrInsert(TypeCache.java:382)
> 10:19:40at 
> org.mockito.internal.creation.bytebuddy.TypeCachingBytecodeGenerator.mockClass(TypeCachingBytecodeGenerator.java:40)
> 10:19:40at 
> org.mockito.internal.creation.bytebuddy.InlineDelegateByteBuddyMockMaker.createMockType(InlineDelegateByteBuddyMockMaker.java:389)
> 10:19:40at 
> org.mockito.internal.creation.bytebuddy.InlineDelegateByteBuddyMockMaker.doCreateMock(InlineDelegateByteBuddyMockMaker.java:349)
> 10:19:40at 
> org.mockito.internal.creation.bytebuddy.InlineDelegateByteBuddyMockMaker.createMock(InlineDelegateByteBuddyMockMaker.java:328)
> 10:19:40at 
> 

[jira] [Updated] (GEODE-10208) Support branches fail to run geode-connectors acceptanceTests due to stale dependency

2022-04-05 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10208?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-10208:
--
Labels: pull-request-available  (was: needsTriage pull-request-available)

> Support branches fail to run geode-connectors acceptanceTests due to stale 
> dependency
> -
>
> Key: GEODE-10208
> URL: https://issues.apache.org/jira/browse/GEODE-10208
> Project: Geode
>  Issue Type: Bug
>Reporter: Kirk Lund
>Priority: Major
>  Labels: pull-request-available
>
> Attempting to execute geode-connectors:acceptanceTests on support/1.12 or 
> 1.13 results in failures with message:
> {noformat}
> com.github.dockerjava.api.exception.NotFoundException: {"message":"No such 
> image: quay.io/testcontainers/ryuk:0.2.3"}
> {noformat}
> These support branches are using testcontainers version 1.13. Versions prior 
> to 1.14.3 are no longer available anywhere:
> https://stackoverflow.com/questions/61887363/testcontainers-cant-pull-ryuk-image-quay-io-is-not-reachable
> All branches need to be updated to use 1.14.3 or later.
> {noformat}
> > Task :geode-connectors:acceptanceTest
> org.apache.geode.connectors.jdbc.MySqlJdbcWriterIntegrationTest > classMethod 
> FAILED
> com.github.dockerjava.api.exception.NotFoundException: {"message":"No 
> such image: quay.io/testcontainers/ryuk:0.2.3"}
> org.apache.geode.connectors.jdbc.MySqlJdbcDistributedTest > classMethod FAILED
> com.github.dockerjava.api.exception.NotFoundException: {"message":"No 
> such image: quay.io/testcontainers/ryuk:0.2.3"}
> org.apache.geode.connectors.jdbc.PostgresJdbcLoaderIntegrationTest > 
> classMethod FAILED
> com.github.dockerjava.api.exception.NotFoundException: {"message":"No 
> such image: quay.io/testcontainers/ryuk:0.2.3"}
> org.apache.geode.connectors.jdbc.internal.PostgresTableMetaDataManagerIntegrationTest
>  > classMethod FAILED
> com.github.dockerjava.api.exception.NotFoundException: {"message":"No 
> such image: quay.io/testcontainers/ryuk:0.2.3"}
> org.apache.geode.connectors.jdbc.internal.MySqlTableMetaDataManagerIntegrationTest
>  > classMethod FAILED
> com.github.dockerjava.api.exception.NotFoundException: {"message":"No 
> such image: quay.io/testcontainers/ryuk:0.2.3"}
> > Task :geode-redis:acceptanceTest
> org.apache.geode.redis.ExpireNativeRedisAcceptanceTest > classMethod FAILED
> org.testcontainers.containers.ContainerLaunchException: Container startup 
> failed
> at 
> org.testcontainers.containers.GenericContainer.doStart(GenericContainer.java:322)
> at 
> org.testcontainers.containers.GenericContainer.start(GenericContainer.java:302)
> at 
> org.apache.geode.redis.ExpireNativeRedisAcceptanceTest.setUp(ExpireNativeRedisAcceptanceTest.java:43)
> Caused by:
> org.testcontainers.containers.ContainerFetchException: Can't get 
> Docker image: 
> RemoteDockerImage(imageNameFuture=java.util.concurrent.CompletableFuture@28de948a[Completed
>  normally], imagePullPolicy=DefaultPullPolicy(), 
> dockerClient=LazyDockerClient.INSTANCE)
> at 
> org.testcontainers.containers.GenericContainer.getDockerImageName(GenericContainer.java:1265)
> at 
> org.testcontainers.containers.GenericContainer.logger(GenericContainer.java:600)
> at 
> org.testcontainers.containers.GenericContainer.doStart(GenericContainer.java:311)
> ... 2 more
> Caused by:
> com.github.dockerjava.api.exception.NotFoundException: 
> {"message":"No such image: quay.io/testcontainers/ryuk:0.2.3"}
> at 
> org.testcontainers.dockerclient.transport.okhttp.OkHttpInvocationBuilder.execute(OkHttpInvocationBuilder.java:281)
> at 
> org.testcontainers.dockerclient.transport.okhttp.OkHttpInvocationBuilder.execute(OkHttpInvocationBuilder.java:265)
> at 
> org.testcontainers.dockerclient.transport.okhttp.OkHttpInvocationBuilder.post(OkHttpInvocationBuilder.java:136)
> at 
> com.github.dockerjava.core.exec.CreateContainerCmdExec.execute(CreateContainerCmdExec.java:33)
> at 
> com.github.dockerjava.core.exec.CreateContainerCmdExec.execute(CreateContainerCmdExec.java:13)
> at 
> com.github.dockerjava.core.exec.AbstrSyncDockerCmdExec.exec(AbstrSyncDockerCmdExec.java:21)
> at 
> com.github.dockerjava.core.command.AbstrDockerCmd.exec(AbstrDockerCmd.java:35)
> at 
> com.github.dockerjava.core.command.CreateContainerCmdImpl.exec(CreateContainerCmdImpl.java:1139)
> at 
> org.testcontainers.utility.ResourceReaper.start(ResourceReaper.java:91)
> at 
> org.testcontainers.DockerClientFactory.client(DockerClientFactory.java:155)
> 

[jira] [Commented] (GEODE-10215) WAN replication not working after re-creating the partitioned region

2022-04-05 Thread Anilkumar Gingade (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-10215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17517736#comment-17517736
 ] 

Anilkumar Gingade commented on GEODE-10215:
---

[~jvarenina]Are you working/fixing on this issue.

> WAN replication not working after re-creating the partitioned region
> 
>
> Key: GEODE-10215
> URL: https://issues.apache.org/jira/browse/GEODE-10215
> Project: Geode
>  Issue Type: Bug
>Reporter: Jakov Varenina
>Assignee: Jakov Varenina
>Priority: Major
>  Labels: needsTriage
>
> Steps to reproduce the issue:
> Start multi-site with at least 3 servers on each site. If there are less than 
> three servers then issue will not reproduce.
> Configuration site 1:
> {code:java}
> create disk-store --name=queue_disk_store --dir=ds2
> create gateway-sender -id="remote_site_2" --parallel="true" 
> --remote-distributed-system-id="1"  -enable-persistence=true 
> --disk-store-name=queue_disk_store
> create disk-store --name=data_disk_store --dir=ds1
> create region --name=example-region --type=PARTITION_PERSISTENT 
> --gateway-sender-id="remote_site_2" --disk-store=data_disk_store 
> --total-num-buckets=1103 --redundant-copies=1 --enable-synchronous-disk=false
> #Configure the remote site 2 with the region and the gateway-receiver  
> #Run some traffic so that all buckets are created and data is replicated to 
> the other site
> alter region --name=/example-region --gateway-sender-id=""
> destroy region --name=/example-region
> create region --name=example-region --type=PARTITION_PERSISTENT 
> --gateway-sender-id="remote_site_2" --disk-store=data_disk_store 
> --total-num-buckets=1103 --redundant-copies=1 --enable-synchronous-disk=false
> #run traffic to see that some data is not replicated to the remote site 2 
> {code}



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (GEODE-10215) WAN replication not working after re-creating the partitioned region

2022-04-05 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10215?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-10215:
--
Labels: needsTriage  (was: needs)

> WAN replication not working after re-creating the partitioned region
> 
>
> Key: GEODE-10215
> URL: https://issues.apache.org/jira/browse/GEODE-10215
> Project: Geode
>  Issue Type: Bug
>Reporter: Jakov Varenina
>Assignee: Jakov Varenina
>Priority: Major
>  Labels: needsTriage
>
> Steps to reproduce the issue:
> Start multi-site with at least 3 servers on each site. If there are less than 
> three servers then issue will not reproduce.
> Configuration site 1:
> {code:java}
> create disk-store --name=queue_disk_store --dir=ds2
> create gateway-sender -id="remote_site_2" --parallel="true" 
> --remote-distributed-system-id="1"  -enable-persistence=true 
> --disk-store-name=queue_disk_store
> create disk-store --name=data_disk_store --dir=ds1
> create region --name=example-region --type=PARTITION_PERSISTENT 
> --gateway-sender-id="remote_site_2" --disk-store=data_disk_store 
> --total-num-buckets=1103 --redundant-copies=1 --enable-synchronous-disk=false
> #Configure the remote site 2 with the region and the gateway-receiver  
> #Run some traffic so that all buckets are created and data is replicated to 
> the other site
> alter region --name=/example-region --gateway-sender-id=""
> destroy region --name=/example-region
> create region --name=example-region --type=PARTITION_PERSISTENT 
> --gateway-sender-id="remote_site_2" --disk-store=data_disk_store 
> --total-num-buckets=1103 --redundant-copies=1 --enable-synchronous-disk=false
> #run traffic to see that some data is not replicated to the remote site 2 
> {code}



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (GEODE-10215) WAN replication not working after re-creating the partitioned region

2022-04-05 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10215?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-10215:
--
Labels: needs  (was: needsTriage)

> WAN replication not working after re-creating the partitioned region
> 
>
> Key: GEODE-10215
> URL: https://issues.apache.org/jira/browse/GEODE-10215
> Project: Geode
>  Issue Type: Bug
>Reporter: Jakov Varenina
>Assignee: Jakov Varenina
>Priority: Major
>  Labels: needs
>
> Steps to reproduce the issue:
> Start multi-site with at least 3 servers on each site. If there are less than 
> three servers then issue will not reproduce.
> Configuration site 1:
> {code:java}
> create disk-store --name=queue_disk_store --dir=ds2
> create gateway-sender -id="remote_site_2" --parallel="true" 
> --remote-distributed-system-id="1"  -enable-persistence=true 
> --disk-store-name=queue_disk_store
> create disk-store --name=data_disk_store --dir=ds1
> create region --name=example-region --type=PARTITION_PERSISTENT 
> --gateway-sender-id="remote_site_2" --disk-store=data_disk_store 
> --total-num-buckets=1103 --redundant-copies=1 --enable-synchronous-disk=false
> #Configure the remote site 2 with the region and the gateway-receiver  
> #Run some traffic so that all buckets are created and data is replicated to 
> the other site
> alter region --name=/example-region --gateway-sender-id=""
> destroy region --name=/example-region
> create region --name=example-region --type=PARTITION_PERSISTENT 
> --gateway-sender-id="remote_site_2" --disk-store=data_disk_store 
> --total-num-buckets=1103 --redundant-copies=1 --enable-synchronous-disk=false
> #run traffic to see that some data is not replicated to the remote site 2 
> {code}



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (GEODE-10188) AvailablePortHelperIntegrationTest > initializeUniquePortRange_returnSamePortsForSameRange gets different ports on subsequent tries

2022-04-05 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10188?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-10188:
--
Labels:   (was: needsTriage)

> AvailablePortHelperIntegrationTest > 
> initializeUniquePortRange_returnSamePortsForSameRange gets different ports on 
> subsequent tries
> ---
>
> Key: GEODE-10188
> URL: https://issues.apache.org/jira/browse/GEODE-10188
> Project: Geode
>  Issue Type: Bug
>  Components: tests
>Affects Versions: 1.13.9
>Reporter: Bill Burcham
>Priority: Major
>
> Failed here: [https://hydradb.hdb.gemfire-ci.info/hdb/testresult/14294054]
>  
> {noformat}
> > Task :geode-core:integrationTest
> org.apache.geode.internal.AvailablePortHelperIntegrationTest > 
> initializeUniquePortRange_returnSamePortsForSameRange(useMembershipPortRange=true)
>  FAILED
> org.junit.ComparisonFailure: expected:<[460[10, 46011, 4601]2]> but 
> was:<[460[00, 46001, 4600]2]>
> at 
> jdk.internal.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
> at 
> jdk.internal.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:62)
> at 
> jdk.internal.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)
> at 
> org.apache.geode.internal.AvailablePortHelperIntegrationTest.initializeUniquePortRange_returnSamePortsForSameRange(AvailablePortHelperIntegrationTest.java:322)
> 4023 tests completed, 1 failed, 82 skipped
> =-=-=-=-=-=-=-=-=-=-=-=-=-=-=  Test Results URI 
> =-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=
> http://files.apachegeode-ci.info/builds/apache-support-1-13-main/1.13.9-build.0668/test-results/integrationTest/1648509410/
> =-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=
> Test report artifacts from this job are available at:
> http://files.apachegeode-ci.info/builds/apache-support-1-13-main/1.13.9-build.0668/test-artifacts/1648509410/integrationtestfiles-openjdk11-1.13.9-build.0668.tgz
> {noformat}



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (GEODE-10205) getDiskStore method described in the docs does not exist in the Java API

2022-04-05 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-10205:
--
Labels: pull-request-available  (was: needsTriage pull-request-available)

> getDiskStore method described in the docs does not exist in the Java API
> 
>
> Key: GEODE-10205
> URL: https://issues.apache.org/jira/browse/GEODE-10205
> Project: Geode
>  Issue Type: Bug
>  Components: docs
>Affects Versions: 1.14.4
>Reporter: Max Hufnagel
>Priority: Major
>  Labels: pull-request-available
>
> h2. Description
>  
> Looking at document
> [https://geode.apache.org/docs/guide/114/managing/disk_storage/compacting_disk_stores.html|https://gemfire.docs.pivotal.io/910/geode/managing/disk_storage/compacting_disk_stores.html]
> It states:
> Compact the logs for a single online disk store through the API, with the 
> forceCompaction method. This method first rolls the oplogs and then compacts 
> them. Example:
> myCache.getDiskStore("myDiskStore").forceCompaction();
> In the Java API there is no getDiskStore method at all.
> Also in the DiskStore Interface it says to use Cache.FindDiskStore(STRING) to 
> find get an existing DiskStore object.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (GEODE-10189) Errors encountered during Apache Geode upgrade

2022-04-05 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-10189:
--
Labels:   (was: needsTriage)

> Errors encountered during Apache Geode upgrade
> --
>
> Key: GEODE-10189
> URL: https://issues.apache.org/jira/browse/GEODE-10189
> Project: Geode
>  Issue Type: Bug
>Affects Versions: 1.14.1
>Reporter: Swetha Sudheesh
>Priority: Critical
>
> We are trying to upgrade Apache Geode from *1.8.0* version to *1.14.1* 
> version to avoid any vulnerabilities. We also added all the dependencies 
> mentioned in the Maven with the latest versions. Our application uses {*}JDK 
> 11{*}. We faced few issues such as the one described below:
>  
> Exception in thread "Locator" *java.lang.ExceptionInInitializerError*
>     at 
> org.apache.geode.distributed.internal.membership.gms.Services.(Services.java:155)
>     at 
> org.apache.geode.distributed.internal.membership.gms.MembershipBuilderImpl.create(MembershipBuilderImpl.java:114)
>     at 
> org.apache.geode.distributed.internal.DistributionImpl.(DistributionImpl.java:152)
>     at 
> org.apache.geode.distributed.internal.DistributionImpl.createDistribution(DistributionImpl.java:219)
>     at 
> org.apache.geode.distributed.internal.ClusterDistributionManager.(ClusterDistributionManager.java:464)
>     at 
> org.apache.geode.distributed.internal.ClusterDistributionManager.(ClusterDistributionManager.java:497)
>     at 
> org.apache.geode.distributed.internal.ClusterDistributionManager.create(ClusterDistributionManager.java:326)
>     at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.initialize(InternalDistributedSystem.java:756)
>     at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.access$200(InternalDistributedSystem.java:132)
>     at 
> org.apache.geode.distributed.internal.InternalDistributedSystem$Builder.build(InternalDistributedSystem.java:3011)
>     at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.connectInternal(InternalDistributedSystem.java:282)
>     at 
> org.apache.geode.distributed.internal.InternalLocator.startDistributedSystem(InternalLocator.java:746)
>     at 
> org.apache.geode.distributed.internal.InternalLocator.startLocator(InternalLocator.java:392)
>     at 
> org.apache.geode.distributed.LocatorLauncher.start(LocatorLauncher.java:716)
>  
> Caused by: *java.lang.IllegalStateException: JGAddress.create() returned the 
> wrong class: UUID*
>     at org.jgroups.conf.ClassConfigurator.add(ClassConfigurator.java:101)
>     at 
> org.apache.geode.distributed.internal.membership.gms.messenger.JGroupsMessenger.(JGroupsMessenger.java:167)
>     ... 17 more
> Please let us know why we are encountering this error and how can it be 
> resolved? Also let us know the right dependencies that need to be added for 
> Apache Geode 1.14.1 upgrade.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (GEODE-10195) MicrometerBinderTest > processorMetricsBinderExists FAILED

2022-04-05 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10195?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-10195:
--
Labels:   (was: needsTriage)

> MicrometerBinderTest > processorMetricsBinderExists FAILED
> --
>
> Key: GEODE-10195
> URL: https://issues.apache.org/jira/browse/GEODE-10195
> Project: Geode
>  Issue Type: Bug
>  Components: core
>Reporter: Mark Hanson
>Priority: Major
>
> windows-acceptance-test-openjdk11 failed with the following error.
>  
> {noformat}
> MicrometerBinderTest > processorMetricsBinderExists FAILED
> org.apache.geode.cache.client.ServerOperationException: remote server on 
> heavy-lifter-ceacbfa8-6147-51ca-affd-b497cd16e2ef(4420:loner):54545:7074b0d7: 
> Function named CheckIfMeterExistsFunction is not registered to FunctionService
> at 
> org.apache.geode.cache.client.internal.ExecuteFunctionOp$ExecuteFunctionOpImpl.processResponse(ExecuteFunctionOp.java:394)
> at 
> org.apache.geode.cache.client.internal.AbstractOp.processResponse(AbstractOp.java:234)
> at 
> org.apache.geode.cache.client.internal.AbstractOp.attemptReadResponse(AbstractOp.java:209)
> at 
> org.apache.geode.cache.client.internal.AbstractOp.attempt(AbstractOp.java:394)
> at 
> org.apache.geode.cache.client.internal.AbstractOpWithTimeout.attempt(AbstractOpWithTimeout.java:45)
> at 
> org.apache.geode.cache.client.internal.ConnectionImpl.execute(ConnectionImpl.java:284)
> at 
> org.apache.geode.cache.client.internal.pooling.PooledConnection.execute(PooledConnection.java:358)
> at 
> org.apache.geode.cache.client.internal.OpExecutorImpl.executeWithPossibleReAuthentication(OpExecutorImpl.java:760)
> at 
> org.apache.geode.cache.client.internal.OpExecutorImpl.execute(OpExecutorImpl.java:151)
> at 
> org.apache.geode.cache.client.internal.PoolImpl.execute(PoolImpl.java:820)
> at 
> org.apache.geode.cache.client.internal.ExecuteFunctionOp.execute(ExecuteFunctionOp.java:100)
> at 
> org.apache.geode.internal.cache.execute.ServerFunctionExecutor.executeOnServer(ServerFunctionExecutor.java:217)
> at 
> org.apache.geode.internal.cache.execute.ServerFunctionExecutor.executeFunction(ServerFunctionExecutor.java:104)
> at 
> org.apache.geode.internal.cache.execute.ServerFunctionExecutor.execute(ServerFunctionExecutor.java:368)
> at 
> org.apache.geode.internal.cache.execute.ServerFunctionExecutor.execute(ServerFunctionExecutor.java:377)
> at 
> org.apache.geode.metrics.MicrometerBinderTest.processorMetricsBinderExists(MicrometerBinderTest.java:152)
>  {noformat}



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (GEODE-10077) wan-copy region command does not prevent callbacks to be executed on the remote site

2022-04-05 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10077?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-10077:
--
Labels: pull-request-available  (was: needsTriage pull-request-available)

> wan-copy region command does not prevent callbacks to be executed on the 
> remote site
> 
>
> Key: GEODE-10077
> URL: https://issues.apache.org/jira/browse/GEODE-10077
> Project: Geode
>  Issue Type: Bug
>  Components: gfsh, wan
>Reporter: Alberto Gomez
>Assignee: Alberto Gomez
>Priority: Major
>  Labels: pull-request-available
>
> The wan-copy region command does not always prevent that callbacks are 
> executed at the remote site for the entries copied.
> For example, if the remote site has more than one server and the gateway 
> receiver that gets the entries copied does not host the primary bucket of the 
> entry received, it will send the put operation to the server than hosts the 
> primary bucket and on this server, callbacks will be executed, provoking, for 
> example, that if a gateway sender is configured for the region, the event 
> will be sent by that gateway sender in the remote site.
> Also, if the remote site has more than one server and the region has at least 
> one replica (either because it is a replicated region or because it is a 
> partitioned region with number of replicas greater than 1) then servers in 
> the remote site that receive an UpdateMessage from the server that received 
> the copied entry via the gateway receiver, would execute the callbacks, 
> provoking, like in the above case that  if a gateway sender is configured for 
> the region, the event will be sent by that gateway sender in the remote site.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (GEODE-10187) PutAllGlobalDUnitTest > testputAllGlobalRemoteVM fails to receive expected TimeoutException

2022-04-05 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10187?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-10187:
--
Labels:   (was: needsTriage)

> PutAllGlobalDUnitTest > testputAllGlobalRemoteVM fails to receive expected 
> TimeoutException
> ---
>
> Key: GEODE-10187
> URL: https://issues.apache.org/jira/browse/GEODE-10187
> Project: Geode
>  Issue Type: Bug
>  Components: regions
>Affects Versions: 1.14.5
>Reporter: Bill Burcham
>Priority: Major
>
> Failed here: [https://hydradb.hdb.gemfire-ci.info/hdb/testresult/14277444]
> {noformat}
> > Task :geode-core:distributedTest
> org.apache.geode.internal.cache.PutAllGlobalDUnitTest > 
> testputAllGlobalRemoteVM FAILED
> java.lang.AssertionError: async2 failed
> at org.apache.geode.test.dunit.Assert.fail(Assert.java:66)
> at 
> org.apache.geode.internal.cache.PutAllGlobalDUnitTest.testputAllGlobalRemoteVM(PutAllGlobalDUnitTest.java:215)
> Caused by:
> java.lang.AssertionError: Should have thrown TimeoutException
> at org.junit.Assert.fail(Assert.java:89)
> at 
> org.apache.geode.internal.cache.PutAllGlobalDUnitTest$2.run2(PutAllGlobalDUnitTest.java:193)
> 8805 tests completed, 1 failed, 455 skipped
> =-=-=-=-=-=-=-=-=-=-=-=-=-=-=  Test Results URI 
> =-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=
> http://files.apachegeode-ci.info/builds/apache-support-1-14-main/1.14.5-build.0942/test-results/distributedTest/1648360227/
> =-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=
> Test report artifacts from this job are available at:
> http://files.apachegeode-ci.info/builds/apache-support-1-14-main/1.14.5-build.0942/test-artifacts/1648360227/distributedtestfiles-openjdk11-1.14.5-build.0942.tgz{noformat}



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (GEODE-10155) ServerConnection thread hangs when client function execution times-out

2022-04-05 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10155?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-10155:
--
Labels: pull-request-available  (was: needsTriage pull-request-available)

> ServerConnection thread hangs when client function execution times-out
> --
>
> Key: GEODE-10155
> URL: https://issues.apache.org/jira/browse/GEODE-10155
> Project: Geode
>  Issue Type: Bug
>  Components: core, functions
>Reporter: Alberto Gomez
>Assignee: Alberto Gomez
>Priority: Major
>  Labels: pull-request-available
>
> If a Geode client executes a server function with a timeout
> and
> the function is handled in the Geode server by a "Function Execution 
> Processor" thread (for example by calling the function with onRegion() on a 
> partitioned region without a filter)
> and
> the function times-out before the server has answered back with all the 
> results
> then
> the ServerConnection thread that originally started to handle the function 
> execution will be stuck forever.
>  
> If this happens several times and the max-threads parameters is set to a 
> value greater than 0, the server will eventually run out of ServerConnection 
> threads and will not be able to process more client requests.
>  



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (GEODE-10170) CI failure: DistributedSystemBridgeIntegrationTest > testPrepareErrorAbortsBackup

2022-04-05 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10170?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-10170:
--
Labels:   (was: needsTriage)

> CI failure: DistributedSystemBridgeIntegrationTest > 
> testPrepareErrorAbortsBackup
> -
>
> Key: GEODE-10170
> URL: https://issues.apache.org/jira/browse/GEODE-10170
> Project: Geode
>  Issue Type: Bug
>Reporter: Jianxia Chen
>Priority: Major
>
> {code:java}
> DistributedSystemBridgeIntegrationTest > testPrepareErrorAbortsBackup FAILED
>     java.lang.AssertionError: 
>     Expecting actual throwable to be an instance of:
>       java.lang.RuntimeException
>     but was:
>       java.lang.NoClassDefFoundError: Could not initialize class 
> org.apache.geode.internal.util.concurrent.StoppableCountDownLatch
>         at 
> org.apache.geode.distributed.internal.ReplyProcessor21.(ReplyProcessor21.java:333)
>         at 
> org.apache.geode.distributed.internal.ReplyProcessor21.(ReplyProcessor21.java:307)
>         at 
> org.apache.geode.distributed.internal.ReplyProcessor21.(ReplyProcessor21.java:264)
>         ...(83 remaining lines not displayed - this can be changed with 
> Assertions.setMaxStackTraceElementsDisplayed)
>         at 
> org.apache.geode.management.internal.beans.DistributedSystemBridgeIntegrationTest.testPrepareErrorAbortsBackup(DistributedSystemBridgeIntegrationTest.java:134)
>  {code}



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (GEODE-10169) CI failure: DistributedSystemBridgeIntegrationTest > testSuccessfulBackup

2022-04-05 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10169?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-10169:
--
Labels:   (was: needsTriage)

> CI failure: DistributedSystemBridgeIntegrationTest > testSuccessfulBackup
> -
>
> Key: GEODE-10169
> URL: https://issues.apache.org/jira/browse/GEODE-10169
> Project: Geode
>  Issue Type: Bug
>Reporter: Jianxia Chen
>Priority: Major
>
> {code:java}
> DistributedSystemBridgeIntegrationTest > testSuccessfulBackup FAILED
> 17:57:56java.lang.ExceptionInInitializerError
> 17:57:56at 
> java.util.concurrent.ConcurrentHashMap.fullAddCount(ConcurrentHashMap.java:2526)
> 17:57:56at 
> java.util.concurrent.ConcurrentHashMap.addCount(ConcurrentHashMap.java:2266)
> 17:57:56at 
> java.util.concurrent.ConcurrentHashMap.replaceNode(ConcurrentHashMap.java:1166)
> 17:57:56at 
> java.util.concurrent.ConcurrentHashMap.remove(ConcurrentHashMap.java:1097)
> 17:57:56at 
> org.mockito.internal.util.concurrent.WeakConcurrentMap.expungeStaleEntries(WeakConcurrentMap.java:139)
> 17:57:56at 
> org.mockito.internal.util.concurrent.WeakConcurrentMap$WithInlinedExpunction.containsKey(WeakConcurrentMap.java:272)
> 17:57:56at 
> org.mockito.internal.creation.bytebuddy.MockMethodAdvice.isMock(MockMethodAdvice.java:169)
> 17:57:56at 
> org.mockito.internal.creation.bytebuddy.MockMethodAdvice.isMocked(MockMethodAdvice.java:174)
> 17:57:56at java.util.Properties.getProperty(Properties.java:969)
> 17:57:56at java.lang.System.getProperty(System.java:722)
> 17:57:56at java.lang.Long.getLong(Long.java:1208)
> 17:57:56at java.lang.Long.getLong(Long.java:1157)
> 17:57:56at 
> org.apache.geode.internal.util.concurrent.StoppableCountDownLatch.(StoppableCountDownLatch.java:36)
> 17:57:56at 
> org.apache.geode.distributed.internal.ReplyProcessor21.(ReplyProcessor21.java:333)
> 17:57:56at 
> org.apache.geode.distributed.internal.ReplyProcessor21.(ReplyProcessor21.java:307)
> 17:57:56at 
> org.apache.geode.distributed.internal.ReplyProcessor21.(ReplyProcessor21.java:264)
> 17:57:56at 
> org.apache.geode.distributed.internal.ReplyProcessor21.(ReplyProcessor21.java:249)
> 17:57:56at 
> org.apache.geode.internal.admin.remote.AdminMultipleReplyProcessor.(AdminMultipleReplyProcessor.java:32)
> 17:57:56at 
> org.apache.geode.internal.admin.remote.MissingPersistentIDsRequest$MissingPersistentIDProcessor.(MissingPersistentIDsRequest.java:115)
> 17:57:56at 
> org.apache.geode.internal.admin.remote.MissingPersistentIDsRequest$MissingPersistentIDProcessor.(MissingPersistentIDsRequest.java:108)
> 17:57:56at 
> org.apache.geode.internal.admin.remote.MissingPersistentIDsRequest.send(MissingPersistentIDsRequest.java:55)
> 17:57:56at 
> org.apache.geode.admin.internal.AdminDistributedSystemImpl.getMissingPersistentMembers(AdminDistributedSystemImpl.java:2226)
> 17:57:56at 
> org.apache.geode.internal.cache.backup.BackupOperation$DefaultMissingPersistentMembersProvider.getMissingPersistentMembers(BackupOperation.java:147)
> 17:57:56at 
> org.apache.geode.internal.cache.backup.BackupOperation.performBackupUnderLock(BackupOperation.java:89)
> 17:57:56at 
> org.apache.geode.internal.cache.backup.BackupOperation.performBackup(BackupOperation.java:77)
> 17:57:56at 
> org.apache.geode.internal.cache.backup.BackupOperation.backupAllMembers(BackupOperation.java:71)
> 17:57:56at 
> org.apache.geode.management.internal.beans.DistributedSystemBridge.backupAllMembers(DistributedSystemBridge.java:489)
> 17:57:56at 
> org.apache.geode.management.internal.beans.DistributedSystemBridgeIntegrationTest.testSuccessfulBackup(DistributedSystemBridgeIntegrationTest.java:113)
> 17:57:56
> 17:57:56Caused by:
> 17:57:56java.lang.NullPointerException
> 17:57:56at 
> java.util.concurrent.ThreadLocalRandom.getProbe(ThreadLocalRandom.java:980)
> 17:57:56at 
> java.util.concurrent.ConcurrentHashMap.fullAddCount(ConcurrentHashMap.java:2526)
> 17:57:56at 
> java.util.concurrent.ConcurrentHashMap.addCount(ConcurrentHashMap.java:2266)
> 17:57:56at 
> java.util.concurrent.ConcurrentHashMap.replaceNode(ConcurrentHashMap.java:1166)
> 17:57:56at 
> java.util.concurrent.ConcurrentHashMap.remove(ConcurrentHashMap.java:1097)
> 17:57:56at 
> org.mockito.internal.util.concurrent.WeakConcurrentMap.expungeStaleEntries(WeakConcurrentMap.java:139)
> 17:57:56at 
> org.mockito.internal.util.concurrent.WeakConcurrentMap$WithInlinedExpunction.containsKey(WeakConcurrentMap.java:272)
> 17:57:56at 
> 

[jira] [Updated] (GEODE-10168) CI failure: RestAPICompatibilityTest > restCommandExecutedOnLatestLocatorShouldBeBackwardsCompatible

2022-04-05 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10168?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-10168:
--
Labels:   (was: needsTriage)

> CI failure: RestAPICompatibilityTest > 
> restCommandExecutedOnLatestLocatorShouldBeBackwardsCompatible
> 
>
> Key: GEODE-10168
> URL: https://issues.apache.org/jira/browse/GEODE-10168
> Project: Geode
>  Issue Type: Bug
>Reporter: Jianxia Chen
>Priority: Major
>
> {code:java}
> RestAPICompatibilityTest > 
> restCommandExecutedOnLatestLocatorShouldBeBackwardsCompatible[1.13.0] FAILED
> 13:12:20java.lang.AssertionError: Suspicious strings were written to the 
> log during this run.
> 13:12:20Fix the strings or use IgnoredException.addIgnoredException to 
> ignore.
> 13:12:20
> ---
> 13:12:20Found suspect string in 'dunit_suspect-vm1.log' at line 511
> 13:12:20
> 13:12:20[fatal 2022/03/24 20:11:53.071 UTC  receiver,heavy-lifter-8ca77d83-23cd-5773-ab23-a0729142416f-17781> tid=55] 
> Membership service failure: Member isn't responding to heartbeat requests
> 13:12:20
> org.apache.geode.distributed.internal.membership.api.MemberDisconnectedException:
>  Member isn't responding to heartbeat requests
> 13:12:20  at 
> org.apache.geode.distributed.internal.membership.gms.GMSMembership$ManagerImpl.forceDisconnect(GMSMembership.java:2012)
> 13:12:20  at 
> org.apache.geode.distributed.internal.membership.gms.membership.GMSJoinLeave.forceDisconnect(GMSJoinLeave.java:1085)
> 13:12:20  at 
> org.apache.geode.distributed.internal.membership.gms.membership.GMSJoinLeave.processMessage(GMSJoinLeave.java:688)
> 13:12:20  at 
> org.apache.geode.distributed.internal.membership.gms.messenger.JGroupsMessenger$JGroupsReceiver.receive(JGroupsMessenger.java:1331)
> 13:12:20  at 
> org.apache.geode.distributed.internal.membership.gms.messenger.JGroupsMessenger$JGroupsReceiver.receive(JGroupsMessenger.java:1267)
> 13:12:20  at org.jgroups.JChannel.invokeCallback(JChannel.java:816)
> 13:12:20  at org.jgroups.JChannel.up(JChannel.java:741)
> 13:12:20  at org.jgroups.stack.ProtocolStack.up(ProtocolStack.java:1030)
> 13:12:20  at org.jgroups.protocols.FRAG2.up(FRAG2.java:165)
> 13:12:20  at org.jgroups.protocols.FlowControl.up(FlowControl.java:390)
> 13:12:20  at 
> org.jgroups.protocols.UNICAST3.deliverMessage(UNICAST3.java:1077)
> 13:12:20  at 
> org.jgroups.protocols.UNICAST3.handleDataReceived(UNICAST3.java:792)
> 13:12:20  at org.jgroups.protocols.UNICAST3.up(UNICAST3.java:433)
> 13:12:20  at 
> org.apache.geode.distributed.internal.membership.gms.messenger.StatRecorder.up(StatRecorder.java:72)
> 13:12:20  at 
> org.apache.geode.distributed.internal.membership.gms.messenger.AddressManager.up(AddressManager.java:70)
> 13:12:20  at org.jgroups.protocols.TP.passMessageUp(TP.java:1658)
> 13:12:20  at 
> org.jgroups.protocols.TP$SingleMessageHandler.run(TP.java:1876)
> 13:12:20  at 
> org.jgroups.util.DirectExecutor.execute(DirectExecutor.java:10)
> 13:12:20  at org.jgroups.protocols.TP.handleSingleMessage(TP.java:1789)
> 13:12:20  at org.jgroups.protocols.TP.receive(TP.java:1714)
> 13:12:20  at 
> org.apache.geode.distributed.internal.membership.gms.messenger.Transport.receive(Transport.java:159)
> 13:12:20  at org.jgroups.protocols.UDP$PacketReceiver.run(UDP.java:701)
> 13:12:20  at java.base/java.lang.Thread.run(Thread.java:829)
> 13:12:20at org.junit.Assert.fail(Assert.java:89)
> 13:12:20at 
> org.apache.geode.test.dunit.internal.DUnitLauncher.closeAndCheckForSuspects(DUnitLauncher.java:422)
> 13:12:20at 
> org.apache.geode.test.dunit.internal.DUnitLauncher.closeAndCheckForSuspects(DUnitLauncher.java:438)
> 13:12:20at 
> org.apache.geode.test.dunit.rules.ClusterStartupRule.after(ClusterStartupRule.java:183)
> 13:12:20at 
> org.apache.geode.test.dunit.rules.ClusterStartupRule.access$100(ClusterStartupRule.java:70)
> 13:12:20at 
> org.apache.geode.test.dunit.rules.ClusterStartupRule$1.evaluate(ClusterStartupRule.java:141)
> 13:12:20at 
> org.apache.geode.test.junit.rules.DescribedExternalResource$1.evaluate(DescribedExternalResource.java:40)
> 13:12:20at 
> org.junit.runners.ParentRunner$3.evaluate(ParentRunner.java:306)
> 13:12:20at 
> org.junit.runners.BlockJUnit4ClassRunner$1.evaluate(BlockJUnit4ClassRunner.java:100)
> 13:12:20at 
> org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:366)
> 13:12:20at 
> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:103)
> 13:12:20at 
> 

[jira] [Updated] (GEODE-10167) CI failure: RollingUpgradeQueryReturnsCorrectResultAfterTwoLocatorsWithTwoServersAreRolled > luceneQueryReturnsCorrectResultAfterTwoLocatorsWithTwoServersAreRolled

2022-04-05 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10167?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-10167:
--
Labels:   (was: needsTriage)

> CI failure: 
> RollingUpgradeQueryReturnsCorrectResultAfterTwoLocatorsWithTwoServersAreRolled
>  > luceneQueryReturnsCorrectResultAfterTwoLocatorsWithTwoServersAreRolled
> ---
>
> Key: GEODE-10167
> URL: https://issues.apache.org/jira/browse/GEODE-10167
> Project: Geode
>  Issue Type: Bug
>Reporter: Jianxia Chen
>Priority: Major
>
> {code:java}
> RollingUpgradeQueryReturnsCorrectResultAfterTwoLocatorsWithTwoServersAreRolled
>  > 
> luceneQueryReturnsCorrectResultAfterTwoLocatorsWithTwoServersAreRolled[from_v1.8.0,
>  with reindex=false, singleHopEnabled=true] FAILED
> 00:24:01org.gradle.internal.exceptions.DefaultMultiCauseException: 
> Multiple Failures (2 failures)
> 00:24:01  org.apache.geode.test.dunit.RMIException: While invoking 
> org.apache.geode.cache.lucene.LuceneSearchWithRollingUpgradeTestBase$$Lambda$422/0x000100379c40.run
>  in VM 2 running on Host 
> heavy-lifter-138290eb-3fbe-5801-8565-60d3c24284e9.c.apachegeode-ci.internal 
> with 4 VMs
> 00:24:01  java.lang.AssertionError: Suspicious strings were written to 
> the log during this run.
> 00:24:01Fix the strings or use IgnoredException.addIgnoredException to 
> ignore.
> 00:24:01
> ---
> 00:24:01Found suspect string in 'dunit_suspect-vm2.log' at line 3243
> 00:24:01
> 00:24:01[fatal 2022/03/25 07:23:55.137 UTC  receiver,heavy-lifter-138290eb-3fbe-5801-8565-60d3c24284e9-23002> tid=49] 
> Membership service failure: Member isn't responding to heartbeat requests
> 00:24:01
> org.apache.geode.distributed.internal.membership.api.MemberDisconnectedException:
>  Member isn't responding to heartbeat requests
> 00:24:01  at 
> org.apache.geode.distributed.internal.membership.gms.GMSMembership$ManagerImpl.forceDisconnect(GMSMembership.java:1806)
> 00:24:01  at 
> org.apache.geode.distributed.internal.membership.gms.membership.GMSJoinLeave.forceDisconnect(GMSJoinLeave.java:1120)
> 00:24:01  at 
> org.apache.geode.distributed.internal.membership.gms.membership.GMSJoinLeave.processRemoveMemberMessage(GMSJoinLeave.java:723)
> 00:24:01  at 
> org.apache.geode.distributed.internal.membership.gms.messenger.JGroupsMessenger$JGroupsReceiver.receive(JGroupsMessenger.java:1367)
> 00:24:01  at 
> org.apache.geode.distributed.internal.membership.gms.messenger.JGroupsMessenger$JGroupsReceiver.receive(JGroupsMessenger.java:1303)
> 00:24:01  at org.jgroups.JChannel.invokeCallback(JChannel.java:816)
> 00:24:01  at org.jgroups.JChannel.up(JChannel.java:741)
> 00:24:01  at org.jgroups.stack.ProtocolStack.up(ProtocolStack.java:1030)
> 00:24:01  at org.jgroups.protocols.FRAG2.up(FRAG2.java:165)
> 00:24:01  at org.jgroups.protocols.FlowControl.up(FlowControl.java:390)
> 00:24:01  at 
> org.jgroups.protocols.UNICAST3.deliverMessage(UNICAST3.java:1077)
> 00:24:01  at 
> org.jgroups.protocols.UNICAST3.handleDataReceived(UNICAST3.java:792)
> 00:24:01  at org.jgroups.protocols.UNICAST3.up(UNICAST3.java:433)
> 00:24:01  at 
> org.apache.geode.distributed.internal.membership.gms.messenger.StatRecorder.up(StatRecorder.java:72)
> 00:24:01  at 
> org.apache.geode.distributed.internal.membership.gms.messenger.AddressManager.up(AddressManager.java:70)
> 00:24:01  at org.jgroups.protocols.TP.passMessageUp(TP.java:1658)
> 00:24:01  at 
> org.jgroups.protocols.TP$SingleMessageHandler.run(TP.java:1876)
> 00:24:01  at 
> org.jgroups.util.DirectExecutor.execute(DirectExecutor.java:10)
> 00:24:01  at org.jgroups.protocols.TP.handleSingleMessage(TP.java:1789)
> 00:24:01  at org.jgroups.protocols.TP.receive(TP.java:1714)
> 00:24:01  at 
> org.apache.geode.distributed.internal.membership.gms.messenger.Transport.receive(Transport.java:160)
> 00:24:01  at org.jgroups.protocols.UDP$PacketReceiver.run(UDP.java:701)
> 00:24:01  at java.base/java.lang.Thread.run(Thread.java:829)
> 00:24:01at 
> org.junit.vintage.engine.execution.TestRun.getStoredResultOrSuccessful(TestRun.java:196)
> 00:24:01at 
> org.junit.vintage.engine.execution.RunListenerAdapter.fireExecutionFinished(RunListenerAdapter.java:226)
> 00:24:01at 
> org.junit.vintage.engine.execution.RunListenerAdapter.testFinished(RunListenerAdapter.java:192)
> 00:24:01at 
> org.junit.vintage.engine.execution.RunListenerAdapter.testFinished(RunListenerAdapter.java:79)
> 00:24:01at 
> 

[jira] [Updated] (GEODE-10176) CI Failure: PubSubDUnitTest > shouldContinueToFunction_whenOneServerShutsDownGracefully_givenTwoSubscribersOnePublisher

2022-04-05 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10176?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-10176:
--
Labels: redis-triage  (was: needsTriage)

> CI Failure: PubSubDUnitTest > 
> shouldContinueToFunction_whenOneServerShutsDownGracefully_givenTwoSubscribersOnePublisher
> ---
>
> Key: GEODE-10176
> URL: https://issues.apache.org/jira/browse/GEODE-10176
> Project: Geode
>  Issue Type: Bug
>Reporter: Jianxia Chen
>Priority: Major
>  Labels: redis-triage
>
> {code:java}
> org.apache.geode.redis.internal.executor.pubsub.PubSubDUnitTest > 
> shouldContinueToFunction_whenOneServerShutsDownGracefully_givenTwoSubscribersOnePublisher
>  FAILED
> 15:29:54org.junit.ComparisonFailure: [channel subscription was not 
> received] 
> 15:29:54Expecting value to be true but was false expected:<[tru]e> but 
> was:<[fals]e>
> 15:29:54at 
> sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
> 15:29:54at 
> sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:62)
> 15:29:54at 
> sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)
> 15:29:54at 
> org.apache.geode.redis.internal.executor.pubsub.PubSubDUnitTest.shouldContinueToFunction_whenOneServerShutsDownGracefully_givenTwoSubscribersOnePublisher(PubSubDUnitTest.java:225)
>  {code}
> =-=-=-=-=-=-=-=-=-=-=-=-=-=-=  Test Results URI 
> =-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=
> http://files.apachegeode-ci.info/builds/apache-support-1-14-main/1.14.5-build.0938/test-results/distributedTest/1648080229/
> =-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=
> Test report artifacts from this job are available at:
> http://files.apachegeode-ci.info/builds/apache-support-1-14-main/1.14.5-build.0938/test-artifacts/1648080229/distributedtestfiles-openjdk8-1.14.5-build.0938.tgz



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (GEODE-10177) CI Failure: PubSubDUnitTest > shouldContinueToFunction_whenOneServerShutsDownGracefully_givenTwoSubscribersTwoPublishers

2022-04-05 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10177?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-10177:
--
Labels: redis-triage  (was: needsTriage)

> CI Failure: PubSubDUnitTest > 
> shouldContinueToFunction_whenOneServerShutsDownGracefully_givenTwoSubscribersTwoPublishers
> 
>
> Key: GEODE-10177
> URL: https://issues.apache.org/jira/browse/GEODE-10177
> Project: Geode
>  Issue Type: Bug
>Reporter: Jianxia Chen
>Priority: Major
>  Labels: redis-triage
>
>  
> {code:java}
> org.apache.geode.redis.internal.executor.pubsub.PubSubDUnitTest > 
> shouldContinueToFunction_whenOneServerShutsDownGracefully_givenTwoSubscribersTwoPublishers
>  FAILED
> 15:35:00org.awaitility.core.ConditionTimeoutException: Assertion 
> condition defined as a lambda expression in 
> org.apache.geode.redis.internal.executor.pubsub.PubSubDUnitTest that uses 
> java.util.concurrent.Future null within 5 minutes.
> 15:35:00at 
> org.awaitility.core.ConditionAwaiter.await(ConditionAwaiter.java:165)
> 15:35:00at 
> org.awaitility.core.AssertionCondition.await(AssertionCondition.java:119)
> 15:35:00at 
> org.awaitility.core.AssertionCondition.await(AssertionCondition.java:31)
> 15:35:00at 
> org.awaitility.core.ConditionFactory.until(ConditionFactory.java:895)
> 15:35:00at 
> org.awaitility.core.ConditionFactory.untilAsserted(ConditionFactory.java:679)
> 15:35:00at 
> org.apache.geode.redis.internal.executor.pubsub.PubSubDUnitTest.shouldContinueToFunction_whenOneServerShutsDownGracefully_givenTwoSubscribersTwoPublishers(PubSubDUnitTest.java:319)
> 15:35:00
> 15:35:00Caused by:
> 15:35:00java.util.concurrent.TimeoutException
> 15:35:00at 
> java.util.concurrent.FutureTask.get(FutureTask.java:205)
> 15:35:00at 
> org.awaitility.core.Uninterruptibles.getUninterruptibly(Uninterruptibles.java:101)
> 15:35:00at 
> org.awaitility.core.Uninterruptibles.getUninterruptibly(Uninterruptibles.java:81)
> 15:35:00at 
> org.awaitility.core.ConditionAwaiter.await(ConditionAwaiter.java:101)
> 15:35:00... 5 more {code}
>  
>  



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (GEODE-10175) CI Failure: PubSubDUnitTest > shouldContinueToFunction_whenOneServerShutsDownAbruptly_givenTwoSubscribersOnePublisher

2022-04-05 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10175?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-10175:
--
Labels: redis-triage  (was: needsTriage)

> CI Failure: PubSubDUnitTest > 
> shouldContinueToFunction_whenOneServerShutsDownAbruptly_givenTwoSubscribersOnePublisher
> -
>
> Key: GEODE-10175
> URL: https://issues.apache.org/jira/browse/GEODE-10175
> Project: Geode
>  Issue Type: Bug
>Reporter: Jianxia Chen
>Priority: Major
>  Labels: redis-triage
>
> {code:java}
> org.apache.geode.redis.internal.executor.pubsub.PubSubDUnitTest > 
> shouldContinueToFunction_whenOneServerShutsDownAbruptly_givenTwoSubscribersOnePublisher
>  FAILED
> 15:29:24org.junit.ComparisonFailure: [channel subscription was not 
> received] 
> 15:29:24Expecting value to be true but was false expected:<[tru]e> but 
> was:<[fals]e>
> 15:29:24at 
> sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
> 15:29:24at 
> sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:62)
> 15:29:24at 
> sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)
> 15:29:24at 
> org.apache.geode.redis.internal.executor.pubsub.PubSubDUnitTest.shouldContinueToFunction_whenOneServerShutsDownAbruptly_givenTwoSubscribersOnePublisher(PubSubDUnitTest.java:255)
>  {code}
> =-=-=-=-=-=-=-=-=-=-=-=-=-=-=  Test Results URI 
> =-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=
> http://files.apachegeode-ci.info/builds/apache-support-1-14-main/1.14.5-build.0938/test-results/distributedTest/1648080229/
> =-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=
> Test report artifacts from this job are available at:
> http://files.apachegeode-ci.info/builds/apache-support-1-14-main/1.14.5-build.0938/test-artifacts/1648080229/distributedtestfiles-openjdk8-1.14.5-build.0938.tgz



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (GEODE-9664) Two different clients with the same durable id will both connect to the servers and receive messages

2022-04-05 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-9664?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-9664:
-
Labels:   (was: needsTriage)

> Two different clients with the same durable id will both connect to the 
> servers and receive messages
> 
>
> Key: GEODE-9664
> URL: https://issues.apache.org/jira/browse/GEODE-9664
> Project: Geode
>  Issue Type: Bug
>  Components: client queues
>Reporter: Barrett Oglesby
>Priority: Major
>
> There are two cases:
>  # The number of queues is the same as the number of servers (e.g. client 
> with subscription-redundancy=1 and 2 servers)
>  # The number of queues is less than the number of servers (e.g. client with 
> subscription-redundancy=0 and 2 servers)
> h2. Case 1
>  In this case, the client first attempts to connect to the primary and fails.
> {noformat}
> [warn 2021/10/01 14:37:56.209 PDT server-1  Thread 1> tid=0x4b] XXX CacheClientNotifier.registerClientInternal about to 
> register 
> clientProxyMembershipID=identity(127.0.0.1(client-a-2:89832:loner):61596:fad3ca3d:client-a-2,connection=2,durableAttributes=DurableClientAttributes[id=client-a;
>  timeout=300])
> [warn 2021/10/01 14:37:56.209 PDT server-1  Thread 1> tid=0x4b] XXX CacheClientNotifier.registerClientInternal existing 
> proxy=CacheClientProxy[identity(127.0.0.1(client-a-1:89806:loner):61573:10a9ca3d:client-a-1,connection=2,durableAttributes=DurableClientAttributes[id=client-a;
>  timeout=300]); port=61581; primary=true; version=GEODE 1.15.0]
> [warn 2021/10/01 14:37:56.210 PDT server-1  Thread 1> tid=0x4b] XXX CacheClientNotifier.registerClientInternal existing 
> proxy isPaused=false
> [warn 2021/10/01 14:37:56.210 PDT server-1  Thread 1> tid=0x4b] The requested durable client has the same identifier ( 
> client-a ) as an existing durable client ( 
> CacheClientProxy[identity(127.0.0.1(client-a-1:89806:loner):61573:10a9ca3d:client-a-1,connection=2,durableAttributes=DurableClientAttributes[id=client-a;
>  timeout=300]); port=61581; primary=true; version=GEODE 1.15.0] ). Duplicate 
> durable clients are not allowed.
> [warn 2021/10/01 14:37:56.210 PDT server-1  Thread 1> tid=0x4b] CacheClientNotifier: Unsuccessfully registered client 
> with identifier 
> identity(127.0.0.1(client-a-2:89832:loner):61596:fad3ca3d:client-a-2,connection=2,durableAttributes=DurableClientAttributes[id=client-a;
>  timeout=300]) and response code 64
> {noformat}
> It then attempts to connect to the secondary and succeeds.
> {noformat}
> [warn 2021/10/01 14:37:56.215 PDT server-2  Thread 1> tid=0x47] XXX CacheClientNotifier.registerClientInternal about to 
> register 
> clientProxyMembershipID=identity(127.0.0.1(client-a-2:89832:loner):61596:fad3ca3d:client-a-2,connection=2,durableAttributes=DurableClientAttributes[id=client-a;
>  timeout=300])
> [warn 2021/10/01 14:37:56.215 PDT server-2  Thread 1> tid=0x47] XXX CacheClientNotifier.registerClientInternal existing 
> proxy=CacheClientProxy[identity(127.0.0.1(client-a-1:89806:loner):61573:10a9ca3d:client-a-1,connection=2,durableAttributes=DurableClientAttributes[id=client-a;
>  timeout=300]); port=61578; primary=false; version=GEODE 1.15.0]
> [warn 2021/10/01 14:37:56.216 PDT server-2  Thread 1> tid=0x47] XXX CacheClientNotifier.registerClientInternal existing 
> proxy isPaused=true
> [warn 2021/10/01 14:37:56.217 PDT server-2  Thread 1> tid=0x47] XXX CacheClientNotifier.registerClientInternal 
> reinitialized existing 
> proxy=CacheClientProxy[identity(127.0.0.1(client-a-1:89806:loner):61573:10a9ca3d:client-a-1,connection=2,durableAttributes=DurableClientAttributes[id=client-a;
>  timeout=300]); port=61578; primary=true; version=GEODE 1.15.0]
> {noformat}
> The previous secondary is reinitialized and made into a primary. Both queues 
> will dispatch events.
> The CacheClientNotifier.registerClientInternal method invoked when a client 
> connects does:
> {noformat}
> if (cacheClientProxy.isPaused()) {
>   ...
>   cacheClientProxy.reinitialize(...);
> } else {
>   unsuccessfulMsg = String.format("The requested durable client has the same 
> identifier ( %s ) as an existing durable client...);
>   logger.warn(unsuccessfulMsg);
> }
> {noformat}
> The CacheClientProxy is paused when the durable client it represents has 
> disconnected. Unfortunately, a secondary CacheClientProxy is also paused. So, 
> this check is not good enough to prevent a duplicate durable client from 
> connecting.
> There are a few things that can also be checked. One of them is:
> {noformat}
> cacheClientProxy.getCommBuffer() == null
> {noformat}
> With that check added, when the client attempts to connect to the secondary, 
> it fails just like the it does with the primary.
> The client then exits with this exception:

[jira] [Updated] (GEODE-10174) CI Failure: PubSubDUnitTest > testConcurrentPubSub

2022-04-05 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10174?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-10174:
--
Labels: redis-triage  (was: needsTriage)

> CI Failure: PubSubDUnitTest > testConcurrentPubSub
> --
>
> Key: GEODE-10174
> URL: https://issues.apache.org/jira/browse/GEODE-10174
> Project: Geode
>  Issue Type: Bug
>Reporter: Jianxia Chen
>Priority: Major
>  Labels: redis-triage
>
> {code:java}
> org.apache.geode.redis.internal.executor.pubsub.PubSubDUnitTest > 
> testConcurrentPubSub FAILED
> 15:28:39java.util.concurrent.ExecutionException: 
> redis.clients.jedis.exceptions.JedisConnectionException: 
> java.net.SocketTimeoutException: Read timed out
> 15:28:39
> 15:28:39Caused by:
> 15:28:39redis.clients.jedis.exceptions.JedisConnectionException: 
> java.net.SocketTimeoutException: Read timed out
> 15:28:39at 
> redis.clients.jedis.util.RedisInputStream.ensureFill(RedisInputStream.java:205)
> 15:28:39at 
> redis.clients.jedis.util.RedisInputStream.readByte(RedisInputStream.java:43)
> 15:28:39at redis.clients.jedis.Protocol.process(Protocol.java:155)
> 15:28:39at redis.clients.jedis.Protocol.read(Protocol.java:220)
> 15:28:39at 
> redis.clients.jedis.Connection.readProtocolWithCheckingBroken(Connection.java:283)
> 15:28:39at 
> redis.clients.jedis.Connection.getIntegerReply(Connection.java:225)
> 15:28:39at redis.clients.jedis.Jedis.publish(Jedis.java:2908)
> 15:28:39at 
> org.apache.geode.redis.internal.executor.pubsub.PubSubDUnitTest.lambda$testConcurrentPubSub$16(PubSubDUnitTest.java:373)
> 15:28:39
> 15:28:39Caused by:
> 15:28:39java.net.SocketTimeoutException: Read timed out
> 15:28:39at java.net.SocketInputStream.socketRead0(Native 
> Method)
> 15:28:39at 
> java.net.SocketInputStream.socketRead(SocketInputStream.java:116)
> 15:28:39at 
> java.net.SocketInputStream.read(SocketInputStream.java:171)
> 15:28:39at 
> java.net.SocketInputStream.read(SocketInputStream.java:141)
> 15:28:39at 
> java.net.SocketInputStream.read(SocketInputStream.java:127)
> 15:28:39at 
> redis.clients.jedis.util.RedisInputStream.ensureFill(RedisInputStream.java:199)
> 15:28:39... 7 more
> 15:29:24 {code}



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (GEODE-9744) bug like CVE-2020-8908

2022-04-05 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-9744?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-9744:
-
Labels: pull-request-available  (was: needsTriage pull-request-available)

> bug like CVE-2020-8908
> --
>
> Key: GEODE-9744
> URL: https://issues.apache.org/jira/browse/GEODE-9744
> Project: Geode
>  Issue Type: Bug
>Reporter: lujie
>Priority: Major
>  Labels: pull-request-available
>
> see [https://www.cvedetails.com/cve/CVE-2020-8908/]
> A temp directory creation vulnerability exists in all versions of Guava, 
> allowing an attacker with access to the machine to potentially access data in 
> a temporary directory created by the Guava API 
> com.google.common.io.Files.createTempDir(). By default, on unix-like systems, 
> the created directory is world-readable (readable by an attacker with access 
> to the system). The method in question has been marked 
> [@deprecated|https://github.com/deprecated] in versions 30.0 and later and 
> should not be used. For Android developers, we recommend choosing a temporary 
> directory API provided by Android, such as context.getCacheDir(). For other 
> Java developers, we recommend migrating to the Java 7 API 
> java.nio.file.Files.createTempDirectory() which explicitly configures 
> permissions of 700, or configuring the Java runtime's java.io.tmpdir system 
> property to point to a location whose permissions are appropriately 
> configured.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (GEODE-10081) Freeze during launch of locator

2022-04-05 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10081?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-10081:
--
Labels:   (was: needsTriage)

> Freeze during launch of locator
> ---
>
> Key: GEODE-10081
> URL: https://issues.apache.org/jira/browse/GEODE-10081
> Project: Geode
>  Issue Type: Bug
>  Components: gfsh
>Affects Versions: 1.14.2
>Reporter: Julien Gilles
>Priority: Major
> Attachments: jstackout.txt
>
>
> We are using gfsh with a script to automatically launch a locator and a 
> server.
> During the launch of the locator, the process gfsh just freezes randomly. The 
> locator process is forked, but gfsh remains frozen.
> A jstack on the process shows that the process is stuck on an internal 
> NativeLibrary.load call. This one happens when gfsh try to attach to the 
> process : internally the jvm loads the libattach.so library, but the call 
> never return.
> To be complete, Geode is here deployed inside a docker container, using ubi8 
> as base image, java is jdk11.0.11 - we have tested with 11.0.13, same issue.
>  
> The stack is the following (full jstack dump attached)
> "main" #1 prio=5 os_prio=0 cpu=2402.78ms elapsed=1924.45s allocated=236M 
> defined_classes=10346 tid=0x7fc904024000 nid=0x16 runnable  
> [0x7fc90a84a000]
>    java.lang.Thread.State: RUNNABLE
>     at java.lang.ClassLoader$NativeLibrary.load0(java.base@11.0.11/Native 
> Method)
>     at java.lang.ClassLoader$NativeLibrary.load(java.base@11.0.11/Unknown 
> Source)
>     at 
> java.lang.ClassLoader$NativeLibrary.loadLibrary(java.base@11.0.11/Unknown 
> Source)
>     - locked <0x823efe60> (a java.util.HashSet)
>     at java.lang.ClassLoader.loadLibrary0(java.base@11.0.11/Unknown Source)
>     at java.lang.ClassLoader.loadLibrary(java.base@11.0.11/Unknown Source)
>     at java.lang.Runtime.loadLibrary0(java.base@11.0.11/Unknown Source)
>     - locked <0x828224b8> (a java.lang.Runtime)
>     at java.lang.System.loadLibrary(java.base@11.0.11/Unknown Source)
>     at 
> sun.tools.attach.VirtualMachineImpl.(jdk.attach@11.0.11/Unknown 
> Source)
>     at 
> sun.tools.attach.AttachProviderImpl.attachVirtualMachine(jdk.attach@11.0.11/Unknown
>  Source)
>     at com.sun.tools.attach.VirtualMachine.attach(jdk.attach@11.0.11/Unknown 
> Source)
>     at 
> org.apache.geode.internal.process.MBeanProcessController.getJMXServiceURL(MBeanProcessController.java:227)
>     at 
> org.apache.geode.internal.process.MBeanProcessController.connect(MBeanProcessController.java:192)
>     at 
> org.apache.geode.internal.process.MBeanProcessController.invokeOperationOnTargetMBean(MBeanProcessController.java:159)
>     at 
> org.apache.geode.internal.process.MBeanProcessController.status(MBeanProcessController.java:136)
>     at 
> org.apache.geode.internal.process.MBeanProcessController.status(MBeanProcessController.java:81)
>     at 
> org.apache.geode.internal.process.MBeanOrFileProcessController.status(MBeanOrFileProcessController.java:37)
>     at 
> org.apache.geode.distributed.LocatorLauncher.statusWithWorkingDirectory(LocatorLauncher.java:1012)
>     at 
> org.apache.geode.distributed.LocatorLauncher.status(LocatorLauncher.java:940)
>     at 
> org.apache.geode.distributed.LocatorLauncher$LocatorState.fromDirectory(LocatorLauncher.java:2077)
>     at 
> org.apache.geode.management.internal.cli.commands.StartLocatorCommand.doStartLocator(StartLocatorCommand.java:267)
>     at 
> org.apache.geode.management.internal.cli.commands.StartLocatorCommand.startLocator(StartLocatorCommand.java:133)
>     at 
> jdk.internal.reflect.NativeMethodAccessorImpl.invoke0(java.base@11.0.11/Native
>  Method)
>     at 
> jdk.internal.reflect.NativeMethodAccessorImpl.invoke(java.base@11.0.11/Unknown
>  Source)
>     at 
> jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(java.base@11.0.11/Unknown
>  Source)
>     at java.lang.reflect.Method.invoke(java.base@11.0.11/Unknown Source)
>     at 
> org.springframework.util.ReflectionUtils.invokeMethod(ReflectionUtils.java:282)
>     at 
> org.apache.geode.management.internal.cli.remote.CommandExecutor.callInvokeMethod(CommandExecutor.java:139)
>     at 
> org.apache.geode.management.internal.cli.remote.CommandExecutor.invokeCommand(CommandExecutor.java:149)



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (GEODE-10003) Support JDK 17 on Geode

2022-04-05 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10003?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-10003:
--
Labels: jdk17  (was: )

> Support JDK 17 on Geode
> ---
>
> Key: GEODE-10003
> URL: https://issues.apache.org/jira/browse/GEODE-10003
> Project: Geode
>  Issue Type: Improvement
>Reporter: Owen Nichols
>Priority: Major
>  Labels: jdk17
>
> JDK 17 Schedule: [https://openjdk.java.net/projects/jdk/17/]
> The first ticket to suggest going beyond JDK8 was GEODE-3.  Now that JDK17 is 
> LTS, and Spring and other users are embracing JDK17, Geode should at least 
> run (if not compile) on JDK17.
> This may take a lot of work, possibly requiring a new major release.  Please 
> add subtasks to this ticket as necessary.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (GEODE-10036) Joining Locators in a cluster is not possible on Java 17

2022-04-05 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10036?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-10036:
--
Labels: jdk17  (was: needsTriage)

> Joining Locators in a cluster is not possible on Java 17
> 
>
> Key: GEODE-10036
> URL: https://issues.apache.org/jira/browse/GEODE-10036
> Project: Geode
>  Issue Type: Sub-task
>Affects Versions: 1.14.3
>Reporter: John Blum
>Priority: Critical
>  Labels: jdk17
>
> When trying to add multiple _Locators_ to the same cluster, particularly when 
> "_direct_" {{ByteBuffers}} are used, the following Exception is thrown:
> {code:java}
> Caused by: org.apache.geode.SystemConnectException: One or more peers 
> generated exceptions during connection attempt
>   at 
> org.apache.geode.distributed.internal.ClusterDistributionManager.sendStartupMessage(ClusterDistributionManager.java:1634)
>   at 
> org.apache.geode.distributed.internal.ClusterDistributionManager.create(ClusterDistributionManager.java:361)
>   at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.initialize(InternalDistributedSystem.java:756)
>   at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.access$200(InternalDistributedSystem.java:132)
>   at 
> org.apache.geode.distributed.internal.InternalDistributedSystem$Builder.build(InternalDistributedSystem.java:3011)
>   at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.connectInternal(InternalDistributedSystem.java:282)
>   at 
> org.apache.geode.distributed.internal.InternalLocator.startDistributedSystem(InternalLocator.java:746)
>   at 
> org.apache.geode.distributed.internal.InternalLocator.startLocator(InternalLocator.java:392)
>   at 
> org.apache.geode.distributed.LocatorLauncher.start(LocatorLauncher.java:716)
>   at 
> org.springframework.data.gemfire.LocatorFactoryBean.init(LocatorFactoryBean.java:120)
>   ... 44 more
> Caused by: org.apache.geode.InternalGemFireException: unable to retrieve 
> underlying byte buffer
>   at 
> org.apache.geode.internal.net.BufferPool.getPoolableBuffer(BufferPool.java:346)
>   at 
> org.apache.geode.internal.net.BufferPool.releaseBuffer(BufferPool.java:310)
>   at 
> org.apache.geode.internal.net.BufferPool.releaseSenderBuffer(BufferPool.java:213)
>   at 
> org.apache.geode.internal.tcp.MsgStreamer.release(MsgStreamer.java:101)
>   at 
> org.apache.geode.internal.tcp.MsgStreamer.writeMessage(MsgStreamer.java:258)
>   at 
> org.apache.geode.distributed.internal.direct.DirectChannel.sendToMany(DirectChannel.java:318)
>   at 
> org.apache.geode.distributed.internal.direct.DirectChannel.sendToOne(DirectChannel.java:187)
>   at 
> org.apache.geode.distributed.internal.direct.DirectChannel.send(DirectChannel.java:525)
>   at 
> org.apache.geode.distributed.internal.DistributionImpl.directChannelSend(DistributionImpl.java:348)
>   at 
> org.apache.geode.distributed.internal.DistributionImpl.send(DistributionImpl.java:293)
>   at 
> org.apache.geode.distributed.internal.ClusterDistributionManager.sendViaMembershipManager(ClusterDistributionManager.java:2064)
>   at 
> org.apache.geode.distributed.internal.ClusterDistributionManager.sendOutgoing(ClusterDistributionManager.java:1991)
>   at 
> org.apache.geode.distributed.internal.StartupOperation.sendStartupMessage(StartupOperation.java:75)
>   at 
> org.apache.geode.distributed.internal.ClusterDistributionManager.sendStartupMessage(ClusterDistributionManager.java:1631)
>   ... 56 more
> Caused by: java.lang.reflect.InaccessibleObjectException: Unable to make 
> public java.lang.Object java.nio.DirectByteBuffer.attachment() accessible: 
> module java.base does not "opens java.nio" to unnamed module @2e0fa5d3
>   at 
> java.base/java.lang.reflect.AccessibleObject.checkCanSetAccessible(AccessibleObject.java:354)
>   at 
> java.base/java.lang.reflect.AccessibleObject.checkCanSetAccessible(AccessibleObject.java:297)
>   at 
> java.base/java.lang.reflect.Method.checkCanSetAccessible(Method.java:199)
>   at java.base/java.lang.reflect.Method.setAccessible(Method.java:193)
>   at 
> org.apache.geode.internal.net.BufferPool.getPoolableBuffer(BufferPool.java:343)
>   ... 69 more
> {code}



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (GEODE-10037) Make o.a.geode.internal.cache.wan.spi.WANFactory SPI part of the non-internal, public API

2022-04-05 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10037?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-10037:
--
Labels:   (was: needsTriage)

> Make o.a.geode.internal.cache.wan.spi.WANFactory SPI part of the 
> non-internal, public API
> -
>
> Key: GEODE-10037
> URL: https://issues.apache.org/jira/browse/GEODE-10037
> Project: Geode
>  Issue Type: Improvement
>Affects Versions: 1.14.3
>Reporter: John Blum
>Priority: Minor
>
> For the same reasons that were stated in GEODE-10007.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (GEODE-10115) org.apache.geode.security.SecurityManager.init() declaration is inconsistent with method documentation

2022-04-05 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10115?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-10115:
--
Labels:   (was: needsTriage)

> org.apache.geode.security.SecurityManager.init() declaration is inconsistent 
> with method documentation
> --
>
> Key: GEODE-10115
> URL: https://issues.apache.org/jira/browse/GEODE-10115
> Project: Geode
>  Issue Type: Bug
>Reporter: Lynn Gallinat
>Priority: Major
>
> org.apache.geode.security.SecurityManager.init() does not include a throws 
> clause, but the method documentation says it throws 
> AuthenticationFailedException.
> {noformat}
> /**
>  * Initialize the SecurityManager. This is invoked when a cache is created
>  *
>  * @param securityProps the security properties obtained using a call to
>  *{@link DistributedSystem#getSecurityProperties}
>  * @throws AuthenticationFailedException if some exception occurs during the 
> initialization
>  */
> default void init(Properties securityProps) {}{noformat}
> Either the throws documentation needs to be deleted, or a throws clause needs 
> to be added.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (GEODE-10132) org.apache.geode.management.internal.cli.commands.StopServerAcceptanceTest Failed

2022-04-05 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10132?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-10132:
--
Labels:   (was: needsTriage)

> org.apache.geode.management.internal.cli.commands.StopServerAcceptanceTest 
> Failed
> -
>
> Key: GEODE-10132
> URL: https://issues.apache.org/jira/browse/GEODE-10132
> Project: Geode
>  Issue Type: Bug
>  Components: tests
>Reporter: Eric Shu
>Priority: Major
>
> cannotStopServerByNameWhenNotConnected
> org.opentest4j.AssertionFailedError: [Exit value from process started by 
> [fcdfe561432081bf: gfsh -e start locator --name=locator -e start server 
> --name=server --server-port=0]] 
> expected: 0
>  but was: 1
>   at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
>   at 
> sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:62)
>   at 
> sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)
>   at 
> org.apache.geode.test.junit.rules.gfsh.GfshExecution.awaitTermination(GfshExecution.java:103)
>   at 
> org.apache.geode.test.junit.rules.gfsh.GfshRule.execute(GfshRule.java:154)
>   at 
> org.apache.geode.test.junit.rules.gfsh.GfshRule.execute(GfshRule.java:163)
>   at 
> org.apache.geode.test.junit.rules.gfsh.GfshRule.execute(GfshRule.java:129)
>   at 
> org.apache.geode.management.internal.cli.commands.StopServerAcceptanceTest.startCluster(StopServerAcceptanceTest.java:32)
>   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>   at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
>   at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>   at java.lang.reflect.Method.invoke(Method.java:498)
>   at 
> org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:59)
>   at 
> org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
>   at 
> org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:56)
>   at 
> org.junit.internal.runners.statements.RunBefores.invokeMethod(RunBefores.java:33)
>   at 
> org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java:24)
>   at org.junit.rules.ExternalResource$1.evaluate(ExternalResource.java:54)
>   at org.junit.runners.ParentRunner$3.evaluate(ParentRunner.java:306)
>   at 
> org.junit.runners.BlockJUnit4ClassRunner$1.evaluate(BlockJUnit4ClassRunner.java:100)
>   at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:366)
>   at 
> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:103)
>   at 
> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:63)
>   at org.junit.runners.ParentRunner$4.run(ParentRunner.java:331)
>   at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:79)
>   at org.junit.runners.ParentRunner.runChildren(ParentRunner.java:329)
>   at org.junit.runners.ParentRunner.access$100(ParentRunner.java:66)
>   at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:293)
>   at org.junit.runners.ParentRunner$3.evaluate(ParentRunner.java:306)
>   at org.junit.runners.ParentRunner.run(ParentRunner.java:413)
>   at org.junit.runner.JUnitCore.run(JUnitCore.java:137)
>   at org.junit.runner.JUnitCore.run(JUnitCore.java:115)
>   at 
> org.junit.vintage.engine.execution.RunnerExecutor.execute(RunnerExecutor.java:42)
>   at 
> org.junit.vintage.engine.VintageTestEngine.executeAllChildren(VintageTestEngine.java:80)
>   at 
> org.junit.vintage.engine.VintageTestEngine.execute(VintageTestEngine.java:72)
>   at 
> org.junit.platform.launcher.core.EngineExecutionOrchestrator.execute(EngineExecutionOrchestrator.java:108)
>   at 
> org.junit.platform.launcher.core.EngineExecutionOrchestrator.execute(EngineExecutionOrchestrator.java:88)
>   at 
> org.junit.platform.launcher.core.EngineExecutionOrchestrator.lambda$execute$0(EngineExecutionOrchestrator.java:54)
>   at 
> org.junit.platform.launcher.core.EngineExecutionOrchestrator.withInterceptedStreams(EngineExecutionOrchestrator.java:67)
>   at 
> org.junit.platform.launcher.core.EngineExecutionOrchestrator.execute(EngineExecutionOrchestrator.java:52)
>   at 
> org.junit.platform.launcher.core.DefaultLauncher.execute(DefaultLauncher.java:96)
>   at 
> org.junit.platform.launcher.core.DefaultLauncher.execute(DefaultLauncher.java:75)
>   at 
> 

[jira] [Updated] (GEODE-10143) ClusterStartupRule.withLogFile does not seem to work as described in comment

2022-04-05 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10143?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-10143:
--
Labels: GeodeOperationAPI  (was: GeodeOperationAPI needsTriage)

> ClusterStartupRule.withLogFile does not seem to work as described in comment
> 
>
> Key: GEODE-10143
> URL: https://issues.apache.org/jira/browse/GEODE-10143
> Project: Geode
>  Issue Type: Bug
>  Components: tests
>Affects Versions: 1.14.4
>Reporter: Joris Melchior
>Priority: Minor
>  Labels: GeodeOperationAPI
>
> ClusterStartupRule.withLogFile method comment describes that using the flag 
> will redirect output from standard out to log files. When used this does not 
> seem to be the case.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (GEODE-10156) RollingUpgradeWithGfshDUnitTest > testRollingUpgradeWithDeployment[1.12.0] FAILED

2022-04-05 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10156?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-10156:
--
Labels:   (was: needsTriage)

> RollingUpgradeWithGfshDUnitTest > testRollingUpgradeWithDeployment[1.12.0] 
> FAILED
> -
>
> Key: GEODE-10156
> URL: https://issues.apache.org/jira/browse/GEODE-10156
> Project: Geode
>  Issue Type: Bug
>  Components: tests
>Reporter: Eric Shu
>Priority: Major
>
> org.opentest4j.AssertionFailedError: [Exit value from process started by 
> [4f44cb7de3b710f1: gfsh -e start locator --name=loc1 --port=20846 
> --http-service-port=0 --J=-Dgemfire.jmx-manager-port=20847 -e start locator 
> --name=loc2 --port=20848 --http-service-port=0 --locators=localhost[20846] 
> --J=-Dgemfire.jmx-manager-port=20849 -e start server --name=server1 
> --server-port=20850 --locators=localhost[20846] -e start server 
> --name=server2 --server-port=20851 --locators=localhost[20846] -e deploy 
> --dir=/tmp/junit4947263696282150978/junit6257623670577599443]] 
> expected: 0
>  but was: 1
>   at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
>   at 
> sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:62)
>   at 
> sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)
>   at 
> org.apache.geode.test.junit.rules.gfsh.GfshExecution.awaitTermination(GfshExecution.java:103)
>   at 
> org.apache.geode.test.junit.rules.gfsh.GfshRule.execute(GfshRule.java:154)
>   at 
> org.apache.geode.test.junit.rules.gfsh.GfshRule.execute(GfshRule.java:163)
>   at 
> org.apache.geode.test.junit.rules.gfsh.GfshScript.execute(GfshScript.java:153)
>   at 
> org.apache.geode.management.RollingUpgradeWithGfshDUnitTest.testRollingUpgradeWithDeployment(RollingUpgradeWithGfshDUnitTest.java:93)
>   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>   at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
>   at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>   at java.lang.reflect.Method.invoke(Method.java:498)
>   at 
> org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:59)
>   at 
> org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
>   at 
> org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:56)
>   at 
> org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
>   at org.junit.rules.ExternalResource$1.evaluate(ExternalResource.java:54)
>   at org.junit.rules.ExternalResource$1.evaluate(ExternalResource.java:54)
>   at org.junit.rules.ExternalResource$1.evaluate(ExternalResource.java:54)
>   at org.junit.runners.ParentRunner$3.evaluate(ParentRunner.java:306)
>   at 
> org.junit.runners.BlockJUnit4ClassRunner$1.evaluate(BlockJUnit4ClassRunner.java:100)
>   at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:366)
>   at 
> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:103)
>   at 
> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:63)
>   at org.junit.runners.ParentRunner$4.run(ParentRunner.java:331)
>   at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:79)
>   at org.junit.runners.ParentRunner.runChildren(ParentRunner.java:329)
>   at org.junit.runners.ParentRunner.access$100(ParentRunner.java:66)
>   at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:293)
>   at org.junit.runners.ParentRunner.run(ParentRunner.java:413)
>   at org.junit.runners.Suite.runChild(Suite.java:128)
>   at org.junit.runners.Suite.runChild(Suite.java:27)
>   at org.junit.runners.ParentRunner$4.run(ParentRunner.java:331)
>   at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:79)
>   at org.junit.runners.ParentRunner.runChildren(ParentRunner.java:329)
>   at org.junit.runners.ParentRunner.access$100(ParentRunner.java:66)
>   at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:293)
>   at org.junit.runners.ParentRunner$3.evaluate(ParentRunner.java:306)
>   at org.junit.runners.ParentRunner.run(ParentRunner.java:413)
>   at org.junit.runner.JUnitCore.run(JUnitCore.java:137)
>   at org.junit.runner.JUnitCore.run(JUnitCore.java:115)
>   at 
> org.junit.vintage.engine.execution.RunnerExecutor.execute(RunnerExecutor.java:42)
>   at 
> org.junit.vintage.engine.VintageTestEngine.executeAllChildren(VintageTestEngine.java:80)
>   at 
> 

[jira] [Updated] (GEODE-9789) CI Failure: ConnectCommandWithSSLMultiKeyTest > classMethod FAILED

2022-04-05 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-9789?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-9789:
-
Labels:   (was: needsTriage)

> CI Failure: ConnectCommandWithSSLMultiKeyTest > classMethod FAILED
> --
>
> Key: GEODE-9789
> URL: https://issues.apache.org/jira/browse/GEODE-9789
> Project: Geode
>  Issue Type: Bug
>  Components: membership
>Reporter: Jens Deppe
>Priority: Major
>
> https://concourse.apachegeode-ci.info/teams/main/pipelines/apache-support-1-12-main/jobs/windows-gfsh-distributed-test-openjdk8/builds/49
>  
> {noformat}
> org.apache.geode.management.internal.cli.commands.ConnectCommandWithSSLMultiKeyTest
>  > classMethod FAILED
> 19:03:00org.apache.geode.test.dunit.RMIException: While invoking 
> org.apache.geode.test.dunit.internal.IdentifiableCallable.call in VM 0 
> running on Host 
> heavy-lifter-b81c90cf-1834-5764-b07d-52bb4df9d090.c.apachegeode-ci.internal 
> with 4 VMs
> 19:03:00at 
> org.apache.geode.test.dunit.VM.executeMethodOnObject(VM.java:610)
> 19:03:00at org.apache.geode.test.dunit.VM.invoke(VM.java:450)
> 19:03:00at 
> org.apache.geode.test.dunit.rules.ClusterStartupRule.startLocatorVM(ClusterStartupRule.java:225)
> 19:03:00at 
> org.apache.geode.test.dunit.rules.ClusterStartupRule.startLocatorVM(ClusterStartupRule.java:218)
> 19:03:00at 
> org.apache.geode.management.internal.cli.commands.ConnectCommandWithSSLMultiKeyTest.beforeClass(ConnectCommandWithSSLMultiKeyTest.java:80)
> 19:03:00
> 19:03:00Caused by:
> 19:03:00org.apache.geode.GemFireConfigException: Unable to join the 
> distributed system.  Operation either timed out, was stopped or Locator does 
> not exist. {noformat}
>  
> =-=-=-=-=-=-=-=-=-=-=-=-=-=-= Test Results URI 
> =-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=
> http://files.apachegeode-ci.info/builds/apache-support-1-12-main/1.12.6-build.0297/test-results/distributedTest/1635475233/
> =-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=
> Test report artifacts from this job are available at:
> http://files.apachegeode-ci.info/builds/apache-support-1-12-main/1.12.6-build.0297/test-artifacts/1635475233/windows-gfshdistributedtest-openjdk8-1.12.6-build.0297.tgz



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (GEODE-9989) add a few info level logs in PersistenceAdvisorImpl to identify splitbrain issue

2022-04-05 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-9989?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-9989:
-
Labels:   (was: needsTriage)

> add a few info level logs in PersistenceAdvisorImpl to identify splitbrain 
> issue
> 
>
> Key: GEODE-9989
> URL: https://issues.apache.org/jira/browse/GEODE-9989
> Project: Geode
>  Issue Type: Bug
>Reporter: Xiaojian Zhou
>Priority: Major
>
> In scenario like:
> {code:java}
> 03:33:03.644 dataStoregemfire4_4494 recovered from disk
> 03:33:03.732 dataStoregemfire4_4494 closing
> 03:33:03.735 dataStoregemfire4_4494 Initialization of region replicate_5 
> completed, send newId(let’s name it 4494) to gemfire2
> 03:33:03.754 dataStoregemfire2_4493 recovered from disk
> 03:33:03.770 dataStoregemfire2_4493 closing
> 03:33:03.792 dataStoregemfire2_4493 Initialization of region replicate_5 
> completed. send newId(let’s name is 4493) to gemfire4, but gemfire4 is 
> offline. So gemfire4 does not know gemfire2’s newId 4493.
> 03:34:11.247 gemfire4_9779 restarted, it does not know 4493
> 03:34:11.269 gemfire2_9856 restarted, it sends oldId=4493, newId=9856 to 
> gemfire4, but gemfire4 does not know either of gemfire2’s oldId and newId
> When gemfire2_9856 asked gemfire4_9779 for its state, gemfire4_9779 replied 
> "I don't know you", then gemfire2_9856's starting ends with 
> ConflictingPersistentDataException.
> {code}
> We need more log to identify the issue. 



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (GEODE-9777) CI failure: QueryConfigurationServiceConstraintsDistributedTest queriesInFlightExecutedByClientsShouldNotBeAffectedWhenMethodAuthorizerIsChanged(RegionType:PARTITION) FAI

2022-04-05 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-9777?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-9777:
-
Labels:   (was: needsTriage)

> CI failure: QueryConfigurationServiceConstraintsDistributedTest 
> queriesInFlightExecutedByClientsShouldNotBeAffectedWhenMethodAuthorizerIsChanged(RegionType:PARTITION)
>  FAILED
> -
>
> Key: GEODE-9777
> URL: https://issues.apache.org/jira/browse/GEODE-9777
> Project: Geode
>  Issue Type: Bug
>  Components: querying
>Reporter: Darrel Schneider
>Priority: Major
>
> cause of the failure was: AuthenticationRequiredException: Failed to find the 
> authenticated user



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (GEODE-9785) CI Failure: RedundancyLevelPart2DUnitTest fails due to PoolImpl.getRedundantNames() not returning expected servers

2022-04-05 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-9785?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-9785:
-
Labels:   (was: needsTriage)

> CI Failure: RedundancyLevelPart2DUnitTest fails due to 
> PoolImpl.getRedundantNames() not returning expected servers
> --
>
> Key: GEODE-9785
> URL: https://issues.apache.org/jira/browse/GEODE-9785
> Project: Geode
>  Issue Type: Bug
>  Components: regions
>Reporter: Eric Shu
>Priority: Major
>
> {noformat}
> org.junit.ComparisonFailure: expected:<[2]> but was:<[0]>
>   at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
>   at 
> sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:62)
>   at 
> sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)
>   at 
> org.apache.geode.internal.cache.tier.sockets.RedundancyLevelPart2DUnitTest.testRedundancySpecifiedEPFails(RedundancyLevelPart2DUnitTest.java:215)
>   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>   at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
>   at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>   at java.lang.reflect.Method.invoke(Method.java:498)
>   at 
> org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:50)
>   at 
> org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
>   at 
> org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:47)
>   at 
> org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
>   at 
> org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java:26)
>   at 
> org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:27)
>   at org.junit.rules.TestWatcher$1.evaluate(TestWatcher.java:55)
>   at org.junit.rules.RunRules.evaluate(RunRules.java:20)
>   at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:325)
>   at 
> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:78)
>   at 
> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:57)
>   at org.junit.runners.ParentRunner$3.run(ParentRunner.java:290)
>   at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:71)
>   at org.junit.runners.ParentRunner.runChildren(ParentRunner.java:288)
>   at org.junit.runners.ParentRunner.access$000(ParentRunner.java:58)
>   at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:268)
>   at 
> org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java:26)
>   at org.junit.runners.ParentRunner.run(ParentRunner.java:363)
>   at 
> org.gradle.api.internal.tasks.testing.junit.JUnitTestClassExecutor.runTestClass(JUnitTestClassExecutor.java:110)
>   at 
> org.gradle.api.internal.tasks.testing.junit.JUnitTestClassExecutor.execute(JUnitTestClassExecutor.java:58)
>   at 
> org.gradle.api.internal.tasks.testing.junit.JUnitTestClassExecutor.execute(JUnitTestClassExecutor.java:38)
>   at 
> org.gradle.api.internal.tasks.testing.junit.AbstractJUnitTestClassProcessor.processTestClass(AbstractJUnitTestClassProcessor.java:62)
>   at 
> org.gradle.api.internal.tasks.testing.SuiteTestClassProcessor.processTestClass(SuiteTestClassProcessor.java:51)
>   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>   at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
>   at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>   at java.lang.reflect.Method.invoke(Method.java:498)
>   at 
> org.gradle.internal.dispatch.ReflectionDispatch.dispatch(ReflectionDispatch.java:35)
>   at 
> org.gradle.internal.dispatch.ReflectionDispatch.dispatch(ReflectionDispatch.java:24)
>   at 
> org.gradle.internal.dispatch.ContextClassLoaderDispatch.dispatch(ContextClassLoaderDispatch.java:32)
>   at 
> org.gradle.internal.dispatch.ProxyDispatchAdapter$DispatchingInvocationHandler.invoke(ProxyDispatchAdapter.java:93)
>   at com.sun.proxy.$Proxy2.processTestClass(Unknown Source)
>   at 
> org.gradle.api.internal.tasks.testing.worker.TestWorker.processTestClass(TestWorker.java:118)
>   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>   at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
>   at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>   at 

[jira] [Updated] (GEODE-10065) Fix regexp in gnmsg for handshake messages

2022-04-05 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10065?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-10065:
--
Labels: pull-request-available  (was: needsTriage pull-request-available)

> Fix regexp in gnmsg for handshake messages
> --
>
> Key: GEODE-10065
> URL: https://issues.apache.org/jira/browse/GEODE-10065
> Project: Geode
>  Issue Type: Bug
>  Components: native client
>Reporter: Michael Martell
>Priority: Major
>  Labels: pull-request-available
>
> gnmsg tool has a bug in the regexp for handshake messages.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (GEODE-10065) Fix regexp in gnmsg for handshake messages

2022-04-05 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10065?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-10065:
--
Component/s: native client

> Fix regexp in gnmsg for handshake messages
> --
>
> Key: GEODE-10065
> URL: https://issues.apache.org/jira/browse/GEODE-10065
> Project: Geode
>  Issue Type: Bug
>  Components: native client
>Reporter: Michael Martell
>Priority: Major
>  Labels: needsTriage, pull-request-available
>
> gnmsg tool has a bug in the regexp for handshake messages.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (GEODE-9804) Both registerAllKeys and registerRegex always fetch initial entries.

2022-04-05 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-9804?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-9804:
-
Labels: pull-request-available  (was: needsTriage pull-request-available)

> Both registerAllKeys and registerRegex always fetch initial entries.
> 
>
> Key: GEODE-9804
> URL: https://issues.apache.org/jira/browse/GEODE-9804
> Project: Geode
>  Issue Type: Bug
>  Components: native client
>Reporter: Jacob Barrett
>Priority: Major
>  Labels: pull-request-available
>
> A inconsistency and bug in how the two regex interest methods configure the 
> initial interest policy results in the both registerAllKeys and registerRegex 
> fetching all initial entries despite the boolean parameter to get initial 
> entries. Furthermore the misconfiguration results in none of the entries 
> actually getting sent to the cache listener. The result is unnecessarily long 
> registration times, network traffic and load on the servers. On servers with 
> say millions of keys this can result in long GC pauses to unintentionally 
> iterate over all those keys.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (GEODE-9804) Both registerAllKeys and registerRegex always fetch initial entries.

2022-04-05 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-9804?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-9804:
-
Component/s: native client

> Both registerAllKeys and registerRegex always fetch initial entries.
> 
>
> Key: GEODE-9804
> URL: https://issues.apache.org/jira/browse/GEODE-9804
> Project: Geode
>  Issue Type: Bug
>  Components: native client
>Reporter: Jacob Barrett
>Priority: Major
>  Labels: needsTriage, pull-request-available
>
> A inconsistency and bug in how the two regex interest methods configure the 
> initial interest policy results in the both registerAllKeys and registerRegex 
> fetching all initial entries despite the boolean parameter to get initial 
> entries. Furthermore the misconfiguration results in none of the entries 
> actually getting sent to the cache listener. The result is unnecessarily long 
> registration times, network traffic and load on the servers. On servers with 
> say millions of keys this can result in long GC pauses to unintentionally 
> iterate over all those keys.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (GEODE-9711) Fix the typos in ConflationDUnitTestHelper

2022-04-05 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-9711?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-9711:
-
Labels: pull-request-available  (was: needsTriage pull-request-available)

> Fix the typos in ConflationDUnitTestHelper
> --
>
> Key: GEODE-9711
> URL: https://issues.apache.org/jira/browse/GEODE-9711
> Project: Geode
>  Issue Type: Bug
>  Components: tests
>Reporter: Nabarun Nag
>Priority: Major
>  Labels: pull-request-available
>




--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (GEODE-9880) Cluster with multiple locators in an environment with no host name resolution, leads to null pointer exception

2022-04-05 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-9880?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-9880:
-
Labels: blocks-1.12.10 blocks-1.15.0 membership  (was: blocks-1.12.10 
blocks-1.15.0 blocks-1.15.0​ membership)

> Cluster with multiple locators in an environment with no host name 
> resolution, leads to null pointer exception
> --
>
> Key: GEODE-9880
> URL: https://issues.apache.org/jira/browse/GEODE-9880
> Project: Geode
>  Issue Type: Bug
>  Components: locator, membership
>Affects Versions: 1.12.5
>Reporter: Tigran Ghahramanyan
>Priority: Major
>  Labels: blocks-1.12.10, blocks-1.15.0, membership
>
> In our use case we have two locators that are initially configured with IP 
> addresses, but _AutoConnectionSourceImpl.UpdateLocatorList()_ flow keeps on 
> adding their corresponding host names to the locators list, while these host 
> names are not resolvable.
> Later in {_}AutoConnectionSourceImpl.queryLocators(){_}, whenever a client 
> tries to use such non resolvable host name to connect to a locator it tries 
> to establish a connection to {_}socketaddr=0.0.0.0{_}, as written in 
> {_}SocketCreator.connect(){_}. Which seems strange.
> Then, if there is no locator running on the same host, the next locator in 
> the list is contacted, until reaching a locator contact configured with IP 
> address - which succeeds eventually.
> But, when there happens to be a locator listening on the same host, then we 
> have a null pointer exception in the second line below, because _inetadd=null_
> _socket.connect(sockaddr, Math.max(timeout, 0)); // sockaddr=0.0.0.0, 
> connects to a locator listening on the same host_
> _configureClientSSLSocket(socket, inetadd.getHostName(), timeout); // inetadd 
> = null_
>  
> As a result, the cluster comes to a failed state, unable to recover.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (GEODE-9880) Cluster with multiple locators in an environment with no host name resolution, leads to null pointer exception

2022-04-05 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-9880?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-9880:
-
Labels: blocks-1.12.10 blocks-1.15.0 blocks-1.15.0​ membership  (was: 
blocks-1.12.10 blocks-1.15.0​ membership)

> Cluster with multiple locators in an environment with no host name 
> resolution, leads to null pointer exception
> --
>
> Key: GEODE-9880
> URL: https://issues.apache.org/jira/browse/GEODE-9880
> Project: Geode
>  Issue Type: Bug
>  Components: locator, membership
>Affects Versions: 1.12.5
>Reporter: Tigran Ghahramanyan
>Priority: Major
>  Labels: blocks-1.12.10, blocks-1.15.0, blocks-1.15.0​, membership
>
> In our use case we have two locators that are initially configured with IP 
> addresses, but _AutoConnectionSourceImpl.UpdateLocatorList()_ flow keeps on 
> adding their corresponding host names to the locators list, while these host 
> names are not resolvable.
> Later in {_}AutoConnectionSourceImpl.queryLocators(){_}, whenever a client 
> tries to use such non resolvable host name to connect to a locator it tries 
> to establish a connection to {_}socketaddr=0.0.0.0{_}, as written in 
> {_}SocketCreator.connect(){_}. Which seems strange.
> Then, if there is no locator running on the same host, the next locator in 
> the list is contacted, until reaching a locator contact configured with IP 
> address - which succeeds eventually.
> But, when there happens to be a locator listening on the same host, then we 
> have a null pointer exception in the second line below, because _inetadd=null_
> _socket.connect(sockaddr, Math.max(timeout, 0)); // sockaddr=0.0.0.0, 
> connects to a locator listening on the same host_
> _configureClientSSLSocket(socket, inetadd.getHostName(), timeout); // inetadd 
> = null_
>  
> As a result, the cluster comes to a failed state, unable to recover.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (GEODE-10210) [CI Failure] : InternalConfigurationPersistenceServiceDeploymentTest > addSemanticAndPlainJarToThisLocator FAILED

2022-04-01 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10210?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-10210:
--
Labels: windows  (was: needsTriage windows)

> [CI Failure] : InternalConfigurationPersistenceServiceDeploymentTest > 
> addSemanticAndPlainJarToThisLocator FAILED
> -
>
> Key: GEODE-10210
> URL: https://issues.apache.org/jira/browse/GEODE-10210
> Project: Geode
>  Issue Type: Bug
>Affects Versions: 1.15.0
>Reporter: Nabarun Nag
>Priority: Major
>  Labels: windows
>
> {code:java}
> InternalConfigurationPersistenceServiceDeploymentTest > 
> addSemanticAndPlainJarToThisLocator FAILED
> java.lang.AssertionError: 
> Expecting actual:
>   ["abc-1.0.jar", "abc.jar", "def-1.0.jar"]
> to contain exactly in any order:
>   ["abc.jar", "def-1.0.jar"]
> but the following elements were unexpected:
>   ["abc-1.0.jar"]
> at 
> org.apache.geode.distributed.internal.InternalConfigurationPersistenceServiceDeploymentTest.addSemanticAndPlainJarToThisLocator(InternalConfigurationPersistenceServiceDeploymentTest.java:173)
> Note: Some input files use or override a deprecated API.
> Note: Recompile with -Xlint:deprecation for details.
> Note: Some input files use unchecked or unsafe operations.
> Note: Recompile with -Xlint:unchecked for details.
> Note: 
> C:\\Users\\geode\\geode\\extensions\\geode-modules-test\\src\\main\\java\\org\\apache\\geode\\modules\\session\\AbstractSessionsTest.java
>  uses or overrides a deprecated API.
> Note: Recompile with -Xlint:deprecation for details.
> Note: 
> C:\\Users\\geode\\geode\\extensions\\geode-modules-test\\src\\main\\java\\org\\apache\\geode\\modules\\session\\catalina\\AbstractDeltaSessionManagerTest.java
>  uses unchecked or unsafe operations.
> Note: Recompile with -Xlint:unchecked for details.
> Note: Some input files use unchecked or unsafe operations.
> Note: Recompile with -Xlint:unchecked for details. {code}



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (GEODE-10209) [CI Failure] : bind exception in InternalCacheForClientAccessDistributedTest

2022-04-01 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10209?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-10209:
--
Labels:   (was: needsTriage)

> [CI Failure] : bind exception in InternalCacheForClientAccessDistributedTest
> 
>
> Key: GEODE-10209
> URL: https://issues.apache.org/jira/browse/GEODE-10209
> Project: Geode
>  Issue Type: Bug
>  Components: ci
>Affects Versions: 1.15.0
>Reporter: Nabarun Nag
>Priority: Major
>
> This test class needs to be refactored to use AvailablePortHelper
> {code:java}
> InternalCacheForClientAccessDistributedTest > serverUsesFilteredCache FAILED
> org.gradle.internal.exceptions.DefaultMultiCauseException: Multiple 
> Failures (2 failures)
>   org.apache.geode.test.dunit.RMIException: While invoking 
> org.apache.geode.internal.cache.InternalCacheForClientAccessDistributedTest$$Lambda$402/0x000100357c40.run
>  in VM 0 running on Host 
> heavy-lifter-48c728b8-c947-5573-8aaa-909b857e1986.c.apachegeode-ci.internal 
> with 4 VMs
>   org.apache.geode.test.dunit.RMIException: While invoking 
> org.apache.geode.internal.cache.InternalCacheForClientAccessDistributedTest$$Lambda$404/0x000100354840.run
>  in VM 1 running on Host 
> heavy-lifter-48c728b8-c947-5573-8aaa-909b857e1986.c.apachegeode-ci.internal 
> with 4 VMs
> at 
> org.junit.vintage.engine.execution.TestRun.getStoredResultOrSuccessful(TestRun.java:196)
> at 
> org.junit.vintage.engine.execution.RunListenerAdapter.fireExecutionFinished(RunListenerAdapter.java:226)
> at 
> org.junit.vintage.engine.execution.RunListenerAdapter.testFinished(RunListenerAdapter.java:192)
> at 
> org.junit.vintage.engine.execution.RunListenerAdapter.testFinished(RunListenerAdapter.java:79)
> at 
> org.junit.runner.notification.SynchronizedRunListener.testFinished(SynchronizedRunListener.java:87)
> at 
> org.junit.runner.notification.RunNotifier$9.notifyListener(RunNotifier.java:225)
> at 
> org.junit.runner.notification.RunNotifier$SafeNotifier.run(RunNotifier.java:72)
> at 
> org.junit.runner.notification.RunNotifier.fireTestFinished(RunNotifier.java:222)
> at 
> org.junit.internal.runners.model.EachTestNotifier.fireTestFinished(EachTestNotifier.java:38)
> at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:372)
> at 
> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:103)
> at 
> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:63)
> at org.junit.runners.ParentRunner$4.run(ParentRunner.java:331)
> at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:79)
> at org.junit.runners.ParentRunner.runChildren(ParentRunner.java:329)
> at org.junit.runners.ParentRunner.access$100(ParentRunner.java:66)
> at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:293)
> at org.junit.runners.ParentRunner$3.evaluate(ParentRunner.java:306)
> at org.junit.runners.ParentRunner.run(ParentRunner.java:413)
> at org.junit.runner.JUnitCore.run(JUnitCore.java:137)
> at org.junit.runner.JUnitCore.run(JUnitCore.java:115)
> at 
> org.junit.vintage.engine.execution.RunnerExecutor.execute(RunnerExecutor.java:42)
> at 
> org.junit.vintage.engine.VintageTestEngine.executeAllChildren(VintageTestEngine.java:80)
> at 
> org.junit.vintage.engine.VintageTestEngine.execute(VintageTestEngine.java:72)
> at 
> org.junit.platform.launcher.core.EngineExecutionOrchestrator.execute(EngineExecutionOrchestrator.java:108)
> at 
> org.junit.platform.launcher.core.EngineExecutionOrchestrator.execute(EngineExecutionOrchestrator.java:88)
> at 
> org.junit.platform.launcher.core.EngineExecutionOrchestrator.lambda$execute$0(EngineExecutionOrchestrator.java:54)
> at 
> org.junit.platform.launcher.core.EngineExecutionOrchestrator.withInterceptedStreams(EngineExecutionOrchestrator.java:67)
> at 
> org.junit.platform.launcher.core.EngineExecutionOrchestrator.execute(EngineExecutionOrchestrator.java:52)
> at 
> org.junit.platform.launcher.core.DefaultLauncher.execute(DefaultLauncher.java:96)
> at 
> org.junit.platform.launcher.core.DefaultLauncher.execute(DefaultLauncher.java:75)
> at 
> org.gradle.api.internal.tasks.testing.junitplatform.JUnitPlatformTestClassProcessor$CollectAllTestClassesExecutor.processAllTestClasses(JUnitPlatformTestClassProcessor.java:99)
> at 
> org.gradle.api.internal.tasks.testing.junitplatform.JUnitPlatformTestClassProcessor$CollectAllTestClassesExecutor.access$000(JUnitPlatformTestClassProcessor.java:79)
> at 
> 

[jira] [Updated] (GEODE-7710) CI Failure: JMXMBeanReconnectDUnitTest aka JmxServerReconnectDistributedTest fails intermittently because one locator is missing the LockServiceMXBean

2022-04-01 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-7710?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-7710:
-
Labels: GeodeOperationAPI flaky pull-request-available  (was: 
GeodeOperationAPI flaky needsTriage pull-request-available)

> CI Failure: JMXMBeanReconnectDUnitTest aka JmxServerReconnectDistributedTest 
> fails intermittently because one locator is missing the LockServiceMXBean
> --
>
> Key: GEODE-7710
> URL: https://issues.apache.org/jira/browse/GEODE-7710
> Project: Geode
>  Issue Type: Bug
>  Components: tests
>Affects Versions: 1.13.0, 1.14.0, 1.15.0
>Reporter: Kirk Lund
>Priority: Major
>  Labels: GeodeOperationAPI, flaky, pull-request-available
>  Time Spent: 5h
>  Remaining Estimate: 0h
>
> This test fails due to GEODE-7739. Enter new failures against that ticket.
> Multiple tests in JMXMBeanReconnectDUnitTest may fail an await due to one of 
> the locators missing the LockServiceMXBean for the cluster config service.
> {noformat}
> but could not find:
>  
> <[GemFire:service=LockService,name=__CLUSTER_CONFIG_LS,type=Member,member=locator1]>
> {noformat}
> These test failures are caused by *GEODE-7739*.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Assigned] (GEODE-10200) [CI Failure] : SocketCreatorUpgradeTest > upgradingToNewGeodeAndNewJavaWithProtocolsAny[1.14.0] FAILED

2022-04-01 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade reassigned GEODE-10200:
-

Assignee: Jacob Barrett

> [CI Failure] :  SocketCreatorUpgradeTest > 
> upgradingToNewGeodeAndNewJavaWithProtocolsAny[1.14.0] FAILED
> ---
>
> Key: GEODE-10200
> URL: https://issues.apache.org/jira/browse/GEODE-10200
> Project: Geode
>  Issue Type: Bug
>  Components: security
>Affects Versions: 1.15.0
>Reporter: Nabarun Nag
>Assignee: Jacob Barrett
>Priority: Major
>  Labels: needsTriage
>
>  
> {code:java}
> SocketCreatorUpgradeTest > 
> upgradingToNewGeodeAndNewJavaWithProtocolsAny[1.14.0] FAILED
> org.opentest4j.AssertionFailedError: [Exit value from process started by 
> [1fa9fcaebd8c018e: gfsh -e start locator --connect=false 
> --http-service-port=0 --name=locator2 
> --bind-address=heavy-lifter-5c2a1d0b-5930-5788-97d6-3ca24d2f026a.c.apachegeode-ci.internal
>  --port=21172 --J=-Dgemfire.jmx-manager-port=21173 
> --security-properties-file=/tmp/junit1876902159761664930/junit7901411307157053608.tmp
>  
> --locators=heavy-lifter-5c2a1d0b-5930-5788-97d6-3ca24d2f026a.c.apachegeode-ci.internal[21170]]]
>  
> expected: 0
>  but was: 1
> at 
> jdk.internal.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
> at 
> jdk.internal.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:62)
> at 
> jdk.internal.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)
> at 
> org.apache.geode.test.junit.rules.gfsh.GfshExecution.awaitTermination(GfshExecution.java:103)
> at 
> org.apache.geode.test.junit.rules.gfsh.GfshRule.execute(GfshRule.java:154)
> at 
> org.apache.geode.test.junit.rules.gfsh.GfshRule.execute(GfshRule.java:133)
> at 
> org.apache.geode.internal.net.SocketCreatorUpgradeTest.upgradingToNewGeodeAndNewJavaWithProtocolsAny(SocketCreatorUpgradeTest.java:450)
>  {code}
> In the logs we can see that there are a lot SSLv2Hello not supported errors. 
> {code:java}
> [warn 2022/03/30 11:49:45.067 UTC locator2  
> tid=0x38] SSL handshake exception
> javax.net.ssl.SSLHandshakeException: SSLv2Hello is not enabled
>   at 
> sun.security.ssl.SSLEngineInputRecord.handleUnknownRecord(SSLEngineInputRecord.java:366)
>   at 
> sun.security.ssl.SSLEngineInputRecord.decode(SSLEngineInputRecord.java:193)
>   at 
> sun.security.ssl.SSLEngineInputRecord.decode(SSLEngineInputRecord.java:160)
>   at sun.security.ssl.SSLTransport.decode(SSLTransport.java:108)
>   at sun.security.ssl.SSLEngineImpl.decode(SSLEngineImpl.java:575)
>   at sun.security.ssl.SSLEngineImpl.readRecord(SSLEngineImpl.java:531)
>   at sun.security.ssl.SSLEngineImpl.unwrap(SSLEngineImpl.java:398)
>   at sun.security.ssl.SSLEngineImpl.unwrap(SSLEngineImpl.java:377)
>   at javax.net.ssl.SSLEngine.unwrap(SSLEngine.java:626)
>   at 
> org.apache.geode.internal.net.NioSslEngine.handshake(NioSslEngine.java:147)
>   at 
> org.apache.geode.internal.net.SocketCreator.handshakeSSLSocketChannel(SocketCreator.java:436)
>   at 
> org.apache.geode.internal.tcp.Connection.createIoFilter(Connection.java:1775)
>   at 
> org.apache.geode.internal.tcp.Connection.readMessages(Connection.java:1563)
>   at org.apache.geode.internal.tcp.Connection.run(Connection.java:1500)
>   at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
>   at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
>   at java.lang.Thread.run(Thread.java:748) {code}



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (GEODE-10186) CI failure: RedundancyLevelPart1DUnitTest > testRedundancySpecifiedNonPrimaryEPFailsDetectionByCCU times out waiting for getClientProxies() to return more than 0 objects

2022-04-01 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10186?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-10186:
--
Labels:   (was: needsTriage)

> CI failure: RedundancyLevelPart1DUnitTest > 
> testRedundancySpecifiedNonPrimaryEPFailsDetectionByCCU times out waiting for 
> getClientProxies() to return more than 0 objects
> -
>
> Key: GEODE-10186
> URL: https://issues.apache.org/jira/browse/GEODE-10186
> Project: Geode
>  Issue Type: Bug
>  Components: client queues
>Affects Versions: 1.15.0
>Reporter: Bill Burcham
>Priority: Major
>
> Failed here: [https://hydradb.hdb.gemfire-ci.info/hdb/testresult/14277358]
>  
> {noformat}
> > Task :geode-core:distributedTest
> RedundancyLevelPart1DUnitTest > 
> testRedundancySpecifiedNonPrimaryEPFailsDetectionByCCU FAILED
> org.apache.geode.test.dunit.RMIException: While invoking 
> org.apache.geode.internal.cache.tier.sockets.RedundancyLevelPart1DUnitTest$$Lambda$543/510122765.run
>  in VM 2 running on Host 
> heavy-lifter-f58561da-caf9-5bc0-a7fa-f938c3fd1e51.c.apachegeode-ci.internal 
> with 4 VMs
> at org.apache.geode.test.dunit.VM.executeMethodOnObject(VM.java:631)
> at org.apache.geode.test.dunit.VM.invoke(VM.java:448)
> at 
> org.apache.geode.internal.cache.tier.sockets.RedundancyLevelPart1DUnitTest.testRedundancySpecifiedNonPrimaryEPFailsDetectionByCCU(RedundancyLevelPart1DUnitTest.java:284)
> Caused by:
> org.awaitility.core.ConditionTimeoutException: Assertion condition 
> defined as a lambda expression in 
> org.apache.geode.internal.cache.tier.sockets.RedundancyLevelPart1DUnitTest 
> that uses org.apache.geode.internal.cache.tier.sockets.CacheClientNotifier 
> Expecting actual:
>   0
> to be greater than:
>   0
>  within 5 minutes.
> at 
> org.awaitility.core.ConditionAwaiter.await(ConditionAwaiter.java:167)
> at 
> org.awaitility.core.AssertionCondition.await(AssertionCondition.java:119)
> at 
> org.awaitility.core.AssertionCondition.await(AssertionCondition.java:31)
> at 
> org.awaitility.core.ConditionFactory.until(ConditionFactory.java:985)
> at 
> org.awaitility.core.ConditionFactory.untilAsserted(ConditionFactory.java:769)
> at 
> org.apache.geode.internal.cache.tier.sockets.RedundancyLevelPart1DUnitTest.verifyInterestRegistration(RedundancyLevelPart1DUnitTest.java:505)
> Caused by:
> java.lang.AssertionError: 
> Expecting actual:
>   0
> to be greater than:
>   0
> at 
> org.apache.geode.internal.cache.tier.sockets.RedundancyLevelPart1DUnitTest.lambda$verifyInterestRegistration$19(RedundancyLevelPart1DUnitTest.java:506)
> 8352 tests completed, 1 failed, 414 skipped
> =-=-=-=-=-=-=-=-=-=-=-=-=-=-=  Test Results URI 
> =-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=
> http://files.apachegeode-ci.info/builds/apache-develop-mass-test-run/1.15.0-build.1033/test-results/distributedTest/1648331031/
> =-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=
> Test report artifacts from this job are available at:
> http://files.apachegeode-ci.info/builds/apache-develop-mass-test-run/1.15.0-build.1033/test-artifacts/1648331031/distributedtestfiles-openjdk8-1.15.0-build.1033.tgz
> {noformat}



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (GEODE-10192) CI hang: testRollingEnabledRecoverValuesTruePersistWithOverFlowWithEarlyTerminationOfRoller

2022-04-01 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10192?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-10192:
--
Labels:   (was: needsTriage)

> CI hang: 
> testRollingEnabledRecoverValuesTruePersistWithOverFlowWithEarlyTerminationOfRoller
> ---
>
> Key: GEODE-10192
> URL: https://issues.apache.org/jira/browse/GEODE-10192
> Project: Geode
>  Issue Type: Bug
>  Components: persistence
>Affects Versions: 1.15.0
>Reporter: Bill Burcham
>Priority: Major
>
> Hung here: 
> [https://concourse.apachegeode-ci.info/teams/main/pipelines/apache-develop-main/jobs/integration-test-openjdk8/builds/246#C]
>  
>  
> {noformat}
> > Task :geode-for-redis:integrationTest
> timeout exceeded
> =-=-=-=-=-=-=-=-=-=-=-=-=-=-=  Test Results URI 
> =-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=
> http://files.apachegeode-ci.info/builds/apache-develop-main/1.15.0-build.1035/test-results/integrationTest/1648477166/
> =-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=
> Test report artifacts from this job are available at:
> http://files.apachegeode-ci.info/builds/apache-develop-main/1.15.0-build.1035/test-artifacts/1648477166/integrationtestfiles-openjdk8-1.15.0-build.1035.tgz{noformat}
> The only test in the "started" state is:
>  
> {noformat}
>   |2.3.1| bburcham-a01 in 
> ~/Downloads/integrationtestfiles-openjdk8-1.15.0-build.1035
> ○ → progress -s started
> org.apache.geode.internal.cache.DiskRandomOperationsAndRecoveryJUnitTest.testRollingEnabledRecoverValuesTruePersistWithOverFlowWithEarlyTerminationOfRoller
>     Iteration: 1
>     Start:     2022-03-28 13:41:07.109 +
>     End:       0001-01-01 00:00:00.000 +
>     Duration:  0s
>     Status:    started
> {noformat}
> That JUnit test takes about 20s to run on a Macbook Pro.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (GEODE-10147) CI Failure: [ benchmark-with-ssl ] FAILURE: Build failed with an exception while trying to run ssl benchmarks

2022-04-01 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10147?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-10147:
--
Labels:   (was: needsTriage)

> CI Failure: [ benchmark-with-ssl ]  FAILURE: Build failed with an exception 
> while trying to run ssl benchmarks
> --
>
> Key: GEODE-10147
> URL: https://issues.apache.org/jira/browse/GEODE-10147
> Project: Geode
>  Issue Type: Bug
>  Components: benchmarks
>Affects Versions: 1.15.0
>Reporter: Nabarun Nag
>Priority: Major
>
> {code:java}
> This is ITERATION 1 of benchmarking against baseline.
> > Task :geode-benchmarks:benchmark
> P2pPartitionedPutBenchmark > run() FAILED
> net.schmizz.sshj.transport.TransportException: Connection reset
> at 
> net.schmizz.sshj.transport.TransportImpl.init(TransportImpl.java:194)
> at net.schmizz.sshj.SSHClient.onConnect(SSHClient.java:793)
> at net.schmizz.sshj.SocketClient.connect(SocketClient.java:178)
> at 
> org.apache.geode.perftest.infrastructure.ssh.SshInfrastructure.getSSHClient(SshInfrastructure.java:74)
> at 
> org.apache.geode.perftest.infrastructure.ssh.SshInfrastructure.copyFromNode(SshInfrastructure.java:185)
> at 
> org.apache.geode.perftest.jvms.RemoteJVMs.copyResults(RemoteJVMs.java:87)
> at 
> org.apache.geode.perftest.runner.DefaultTestRunner.runTest(DefaultTestRunner.java:112)
> at 
> org.apache.geode.perftest.runner.DefaultTestRunner.runTest(DefaultTestRunner.java:65)
> at 
> org.apache.geode.benchmark.tests.P2pPartitionedPutBenchmark.run(P2pPartitionedPutBenchmark.java:52)
> Caused by:
> java.net.SocketException: Connection reset
> at java.net.SocketInputStream.read(SocketInputStream.java:186)
> at java.net.SocketInputStream.read(SocketInputStream.java:140)
> at java.net.SocketInputStream.read(SocketInputStream.java:200)
> at 
> net.schmizz.sshj.transport.TransportImpl.receiveServerIdent(TransportImpl.java:211)
> at 
> net.schmizz.sshj.transport.TransportImpl.init(TransportImpl.java:187)
> ... 8 more
> 4 tests completed, 1 failed
> This is ITERATION 2 of benchmarking against baseline.
> > Task :geode-benchmarks:benchmark
> ReplicatedPutBenchmark > run() FAILED
> java.util.concurrent.CompletionException: java.io.UncheckedIOException: 
> net.schmizz.sshj.transport.TransportException: Server closed connection 
> during identification exchange
> at 
> java.util.concurrent.CompletableFuture.encodeThrowable(CompletableFuture.java:314)
> at 
> java.util.concurrent.CompletableFuture.completeThrowable(CompletableFuture.java:319)
> at 
> java.util.concurrent.CompletableFuture$AsyncRun.run(CompletableFuture.java:1739)
> at 
> java.util.concurrent.CompletableFuture$AsyncRun.exec(CompletableFuture.java:1728)
> at java.util.concurrent.ForkJoinTask.doExec(ForkJoinTask.java:290)
> at 
> java.util.concurrent.ForkJoinPool$WorkQueue.topLevelExec(ForkJoinPool.java:1020)
> at java.util.concurrent.ForkJoinPool.scan(ForkJoinPool.java:1656)
> at java.util.concurrent.ForkJoinPool.runWorker(ForkJoinPool.java:1594)
> at 
> java.util.concurrent.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:183)
> Caused by:
> java.io.UncheckedIOException: 
> net.schmizz.sshj.transport.TransportException: Server closed connection 
> during identification exchange
> at 
> org.apache.geode.perftest.infrastructure.ssh.SshInfrastructure.lambda$copyToNodes$1(SshInfrastructure.java:176)
> at 
> java.util.concurrent.CompletableFuture$AsyncRun.run(CompletableFuture.java:1736)
> ... 6 more
> Caused by:
> net.schmizz.sshj.transport.TransportException: Server closed 
> connection during identification exchange
> at 
> net.schmizz.sshj.transport.TransportImpl.init(TransportImpl.java:194)
> at net.schmizz.sshj.SSHClient.onConnect(SSHClient.java:793)
> at 
> net.schmizz.sshj.SocketClient.connect(SocketClient.java:178)
> at 
> org.apache.geode.perftest.infrastructure.ssh.SshInfrastructure.getSSHClient(SshInfrastructure.java:74)
> at 
> org.apache.geode.perftest.infrastructure.ssh.SshInfrastructure.lambda$copyToNodes$1(SshInfrastructure.java:158)
> ... 7 more
> Caused by:
> net.schmizz.sshj.transport.TransportException: Server closed 
> connection during identification exchange
> at 
> net.schmizz.sshj.transport.TransportImpl.receiveServerIdent(TransportImpl.java:214)
> at 
> 

[jira] [Resolved] (GEODE-10107) CI Failure: ReplicateRegionNetsearchDistributedTest > proxyReplicateDoesNetsearchFromOnlyOneFullReplicate

2022-04-01 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10107?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade resolved GEODE-10107.
---
Resolution: Won't Fix

Closing i as infrastructure issue. The week this was reproduced was flaky. Will 
open if it surfaces again.

> CI Failure: ReplicateRegionNetsearchDistributedTest > 
> proxyReplicateDoesNetsearchFromOnlyOneFullReplicate
> -
>
> Key: GEODE-10107
> URL: https://issues.apache.org/jira/browse/GEODE-10107
> Project: Geode
>  Issue Type: Bug
>  Components: core, regions
>Affects Versions: 1.15.0
>Reporter: Jens Deppe
>Priority: Major
>  Labels: needsTriage
>
> [https://concourse.apachegeode-ci.info/teams/main/pipelines/apache-develop-mass-test-run/jobs/distributed-test-openjdk8/]
>  
> {noformat}
> ReplicateRegionNetsearchDistributedTest > 
> proxyReplicateDoesNetsearchFromOnlyOneFullReplicate FAILED
>     org.opentest4j.AssertionFailedError: 
>     expected: 1L
>      but was: 0L
>         at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native 
> Method)
>         at 
> sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:62)
>         at 
> sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)
>         at 
> org.apache.geode.cache.ReplicateRegionNetsearchDistributedTest.proxyReplicateDoesNetsearchFromOnlyOneFullReplicate(ReplicateRegionNetsearchDistributedTest.java:391)
>  {noformat}
>  
> =-=-=-=-=-=-=-=-=-=-=-=-=-=-=  Test Results URI 
> =-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=
> http://files.apachegeode-ci.info/builds/apache-develop-mass-test-run/1.16.0-build.0115/test-results/distributedTest/1646476338/
> =-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=
> Test report artifacts from this job are available at:
> http://files.apachegeode-ci.info/builds/apache-develop-mass-test-run/1.16.0-build.0115/test-artifacts/1646476338/distributedtestfiles-openjdk8-1.16.0-build.0115.tgz



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (GEODE-10160) SizeableByteArrayList does not update the sizeInBytes for some non-overriden methods

2022-04-01 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10160?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-10160:
--
Labels: pull-request-available redis-triage  (was: blocks-1.15.0​ 
pull-request-available)

> SizeableByteArrayList does not update the sizeInBytes for some non-overriden 
> methods
> 
>
> Key: GEODE-10160
> URL: https://issues.apache.org/jira/browse/GEODE-10160
> Project: Geode
>  Issue Type: Bug
>  Components: redis
>Affects Versions: 1.15.0
>Reporter: Hale Bales
>Assignee: Steve Sienkowski
>Priority: Major
>  Labels: pull-request-available, redis-triage
>
> Some List methods aren't overriden in SizeableByteArrayList which means that 
> the memoryOverhead doesn't get updated. add(int index, byte[] element), 
> clear(), and set(int index, byte[] newElement) all need to update the size 
> and don't.
> This can be accomplished by adding overrides to SizeableByteArrayList:
> {code:java}
>   @Override
>   public byte[] set(int index, byte[] newElement) {
> byte[] replacedElement = super.set(index, newElement);
> memberOverhead -= calculateByteArrayOverhead(replacedElement);
> memberOverhead += calculateByteArrayOverhead(newElement);
> return replacedElement;
>   }
>   @Override
>   public void add(int index, byte[] element) {
> memberOverhead += calculateByteArrayOverhead(element);
> super.add(index, element);
>   }
> {code}
> The test for set could look something like:
> {code:java}
>   @Test
>   public void sizeInBytesGetsUpdatedAccurately_whenDoingSets() {
> SizeableByteArrayList list = new SizeableByteArrayList();
> byte[] element = "element".getBytes(StandardCharsets.UTF_8);
> list.addFirst(element);
> long initialSize = list.getSizeInBytes();
> assertThat(initialSize).isEqualTo(sizer.sizeof(list));
> list.set(0, "some really big new element 
> name".getBytes(StandardCharsets.UTF_8));
> assertThat(list.getSizeInBytes()).isEqualTo(sizer.sizeof(list));
> list.set(0, element);
> assertThat(list.getSizeInBytes()).isEqualTo(initialSize);
>   }
> {code}
> We need more tests than just this one. add(int, byte[]) needs to be tested as 
> well. Any method on SizeableByteArrayList that modify the data should have a 
> test that the memoryOverhead gets updated correctly.
> Clear itself isn't a problem - just set the memberOverhead to 0 - but the 
> issue is that for the version of LTRIM in 
> [PR#7403|https://github.com/apache/geode/pull/7403], we clear a sublist of 
> SizeableByteArrayList. This means that the overridden clear method does not 
> get called and the LinkedList implementation of clear does not call any other 
> methods that we can override. There needs to be some new approach to LTRIM 
> that doesn't use sublists.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (GEODE-10171) AbstractRedisData version being incremented for no-op operations can lead to delta not being applied on secondary

2022-04-01 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10171?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-10171:
--
Labels: pull-request-available redis-triage  (was: blocks-1.15.0​ 
pull-request-available)

> AbstractRedisData version being incremented for no-op operations can lead to 
> delta not being applied on secondary
> -
>
> Key: GEODE-10171
> URL: https://issues.apache.org/jira/browse/GEODE-10171
> Project: Geode
>  Issue Type: Bug
>  Components: redis
>Affects Versions: 1.15.0
>Reporter: Donal Evans
>Assignee: Kristen
>Priority: Major
>  Labels: pull-request-available, redis-triage
>
> For SADD, which may or may not modify the data in the region depending on 
> whether the member being added is already present in the set, the version in 
> AbstractRedisData is updated regardless of whether a Delta is sent to the 
> secondary. This can lead to the version on the primary wrapping around to be 
> equal to the version on the secondary, which means that if a delta is sent, 
> it will not be applied on the secondary, leading to potential data loss.
> Below is a test to show this behaviour:
> {code:java}
> @Test
> public void deltaVersionOnPrimary_shouldNotUpdate_ifNoDeltaSent() {
>   String originalMember = "fixedMemberName";
>   String key = clusterStartUp.getKeyOnServer("tag", 1);
>   
>   // Version of primary = 0
>   // Version of secondary = 0
>   jedis.sadd(key, originalMember);
>   // No changes are made to the set, since adding an already existing member 
> doesn't modify it
>   // Version of primary wraps around back to -1
>   // Version of secondary doesn't change because we don't send a delta
>   for (int i = 0; i < 255; ++i) {
> jedis.sadd(key, originalMember);
>   }
>   String newMember = "aNewMemberName";
>   // Version of primary = 0
>   // Version of secondary = 0, delta is not applied
>   jedis.sadd(key, newMember);
>   assertThat(jedis.smembers(key)).containsExactlyInAnyOrder(originalMember, 
> newMember);
>   clusterStartUp.crashVM(1); // kill primary server
>   assertThat(jedis.smembers(key)).containsExactlyInAnyOrder(originalMember, 
> newMember);
> } {code}
> The example here uses SADD, but potentially, any command that can be a no-op 
> and uses a versioned Delta is at risk of hitting this. All commands should be 
> checked to ensure they're not vulnerable to this bug.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (GEODE-10091) Benchmark instability in RedisZaddAndZremBenchmark

2022-04-01 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10091?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-10091:
--
Labels: redis-triage  (was: )

> Benchmark instability in RedisZaddAndZremBenchmark
> --
>
> Key: GEODE-10091
> URL: https://issues.apache.org/jira/browse/GEODE-10091
> Project: Geode
>  Issue Type: Bug
>  Components: benchmarks, redis
>Affects Versions: 1.15.0
>Reporter: Donal Evans
>Assignee: Eric Zoerner
>Priority: Major
>  Labels: redis-triage
>
> {noformat}
> This is ITERATION 1 of benchmarking against baseline.
>    RedisGetBenchmark avg ops/sec  
> Baseline:428358.55  Test:446241.49  Difference:   +4.2%
>  avg latency  
> Baseline:   1678809.55  Test:   1611045.34  Difference:   -4.0%
>   RedisHgetBenchmark avg ops/sec  
> Baseline:438535.25  Test:441078.97  Difference:   +0.6%
>  avg latency  
> Baseline:   1638968.92  Test:   1630999.16  Difference:   -0.5%
>    RedisHsetAndHgetBenchmark avg ops/sec  
> Baseline:191331.54  Test:186806.37  Difference:   -2.4%
>  avg latency  
> Baseline:   3759694.09  Test:   3850396.04  Difference:   +2.4%
>   RedisHsetBenchmark avg ops/sec  
> Baseline:308079.41  Test:303715.78  Difference:   -1.4%
>  avg latency  
> Baseline:   2332102.91  Test:   2367639.93  Difference:   +1.5%
>    RedisSetBenchmark avg ops/sec  
> Baseline:318147.74  Test:326996.22  Difference:   +2.8%
>  avg latency  
> Baseline:   2263687.57  Test:   2198181.68  Difference:   -2.9%
>    RedisWeightedHsetAndHgetBenchmark avg ops/sec  
> Baseline:390383.79  Test:385565.48  Difference:   -1.2%
>  avg latency  
> Baseline:   1842561.96  Test:   1864716.09  Difference:   +1.2%
>  RedisWeightedZaddAndZrangeBenchmark avg ops/sec  
> Baseline:375078.82  Test:350886.03  Difference:   -6.5%
>  avg latency  
> Baseline:   1917476.15  Test:   2049452.87  Difference:   +6.9%
>    RedisZaddAndZremBenchmark avg ops/sec  
> Baseline:124763.60  Test:118600.72  Difference:   -4.9%
>  avg latency  
> Baseline:   5766048.34  Test:   6065491.13  Difference:   +5.2%
>   RedisZaddBenchmark avg ops/sec  
> Baseline:432494.29  Test:438594.53  Difference:   +1.4%
>  avg latency  
> Baseline:   1662639.09  Test:   1639906.18  Difference:   -1.4%
> RedisZrangeBenchmark avg ops/sec  
> Baseline:295523.77  Test:340381.87  Difference:  +15.2%
>  avg latency  
> Baseline:   2433250.54  Test:   2112961.66  Difference:  -13.2%
>  RedisZrangeByScoreBenchmark avg ops/sec  
> Baseline:285836.33  Test:356142.91  Difference:  +24.6%
>  avg latency  
> Baseline:   2515708.23  Test:   2019249.68  Difference:  -19.7%
> This is ITERATION 2 of benchmarking against baseline.
>  RedisWeightedZaddAndZrangeBenchmark avg ops/sec  
> Baseline:358570.33  Test:348012.02  Difference:   -2.9%
>  avg latency  
> Baseline:   2005804.05  Test:   2066535.24  Difference:   +3.0%
>    RedisZaddAndZremBenchmark avg ops/sec  
> Baseline:122900.38  Test:115226.22  Difference:   -6.2%
>  avg latency  
> Baseline:   5858357.13  Test:   6243018.38  Difference:   +6.6%
> This is ITERATION 3 of benchmarking against baseline.
>    RedisZaddAndZremBenchmark avg ops/sec  
> Baseline:125677.36  Test:118359.94  Difference:   -5.8%
>  avg latency  
> Baseline:   5721859.26  Test:   6082944.28  Difference:   +6.3%
> This is ITERATION 4 of benchmarking against baseline.
> 

[jira] [Updated] (GEODE-10091) Benchmark instability in RedisZaddAndZremBenchmark

2022-04-01 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10091?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-10091:
--
Labels:   (was: needsTriage)

> Benchmark instability in RedisZaddAndZremBenchmark
> --
>
> Key: GEODE-10091
> URL: https://issues.apache.org/jira/browse/GEODE-10091
> Project: Geode
>  Issue Type: Bug
>  Components: benchmarks, redis
>Affects Versions: 1.15.0
>Reporter: Donal Evans
>Assignee: Eric Zoerner
>Priority: Major
>
> {noformat}
> This is ITERATION 1 of benchmarking against baseline.
>    RedisGetBenchmark avg ops/sec  
> Baseline:428358.55  Test:446241.49  Difference:   +4.2%
>  avg latency  
> Baseline:   1678809.55  Test:   1611045.34  Difference:   -4.0%
>   RedisHgetBenchmark avg ops/sec  
> Baseline:438535.25  Test:441078.97  Difference:   +0.6%
>  avg latency  
> Baseline:   1638968.92  Test:   1630999.16  Difference:   -0.5%
>    RedisHsetAndHgetBenchmark avg ops/sec  
> Baseline:191331.54  Test:186806.37  Difference:   -2.4%
>  avg latency  
> Baseline:   3759694.09  Test:   3850396.04  Difference:   +2.4%
>   RedisHsetBenchmark avg ops/sec  
> Baseline:308079.41  Test:303715.78  Difference:   -1.4%
>  avg latency  
> Baseline:   2332102.91  Test:   2367639.93  Difference:   +1.5%
>    RedisSetBenchmark avg ops/sec  
> Baseline:318147.74  Test:326996.22  Difference:   +2.8%
>  avg latency  
> Baseline:   2263687.57  Test:   2198181.68  Difference:   -2.9%
>    RedisWeightedHsetAndHgetBenchmark avg ops/sec  
> Baseline:390383.79  Test:385565.48  Difference:   -1.2%
>  avg latency  
> Baseline:   1842561.96  Test:   1864716.09  Difference:   +1.2%
>  RedisWeightedZaddAndZrangeBenchmark avg ops/sec  
> Baseline:375078.82  Test:350886.03  Difference:   -6.5%
>  avg latency  
> Baseline:   1917476.15  Test:   2049452.87  Difference:   +6.9%
>    RedisZaddAndZremBenchmark avg ops/sec  
> Baseline:124763.60  Test:118600.72  Difference:   -4.9%
>  avg latency  
> Baseline:   5766048.34  Test:   6065491.13  Difference:   +5.2%
>   RedisZaddBenchmark avg ops/sec  
> Baseline:432494.29  Test:438594.53  Difference:   +1.4%
>  avg latency  
> Baseline:   1662639.09  Test:   1639906.18  Difference:   -1.4%
> RedisZrangeBenchmark avg ops/sec  
> Baseline:295523.77  Test:340381.87  Difference:  +15.2%
>  avg latency  
> Baseline:   2433250.54  Test:   2112961.66  Difference:  -13.2%
>  RedisZrangeByScoreBenchmark avg ops/sec  
> Baseline:285836.33  Test:356142.91  Difference:  +24.6%
>  avg latency  
> Baseline:   2515708.23  Test:   2019249.68  Difference:  -19.7%
> This is ITERATION 2 of benchmarking against baseline.
>  RedisWeightedZaddAndZrangeBenchmark avg ops/sec  
> Baseline:358570.33  Test:348012.02  Difference:   -2.9%
>  avg latency  
> Baseline:   2005804.05  Test:   2066535.24  Difference:   +3.0%
>    RedisZaddAndZremBenchmark avg ops/sec  
> Baseline:122900.38  Test:115226.22  Difference:   -6.2%
>  avg latency  
> Baseline:   5858357.13  Test:   6243018.38  Difference:   +6.6%
> This is ITERATION 3 of benchmarking against baseline.
>    RedisZaddAndZremBenchmark avg ops/sec  
> Baseline:125677.36  Test:118359.94  Difference:   -5.8%
>  avg latency  
> Baseline:   5721859.26  Test:   6082944.28  Difference:   +6.3%
> This is ITERATION 4 of benchmarking against baseline.
>    RedisZaddAndZremBenchmark avg 

[jira] [Commented] (GEODE-10148) [CI Failure] : JMXMBeanFederationDUnitTest > MBeanFederationAddRemoveServer FAILED

2022-03-23 Thread Anilkumar Gingade (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-10148?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17511454#comment-17511454
 ] 

Anilkumar Gingade commented on GEODE-10148:
---

>From Barry:
>> The communication between servers and the JMX manager (locator) is async (a 
>> no-ack region). This test is most likely failing because of that.


> [CI Failure] : JMXMBeanFederationDUnitTest > MBeanFederationAddRemoveServer 
> FAILED
> --
>
> Key: GEODE-10148
> URL: https://issues.apache.org/jira/browse/GEODE-10148
> Project: Geode
>  Issue Type: Bug
>  Components: jmx
>Affects Versions: 1.15.0
>Reporter: Nabarun Nag
>Priority: Major
>  Labels: needsTriage
>
> JMXMBeanFederationDUnitTest > MBeanFederationAddRemoveServer FAILED
> java.lang.AssertionError: 
> Expecting actual:
>   ["GemFire:service=AccessControl,type=Distributed",
> "GemFire:service=CacheServer,port=20842,type=Member,member=server-1",
> "GemFire:service=CacheServer,port=20846,type=Member,member=server-2",
> 
> "GemFire:service=DiskStore,name=cluster_config,type=Member,member=locator-one",
> "GemFire:service=FileUploader,type=Distributed",
> "GemFire:service=Locator,type=Member,member=locator-one",
> 
> "GemFire:service=LockService,name=__CLUSTER_CONFIG_LS,type=Distributed",
> 
> "GemFire:service=LockService,name=__CLUSTER_CONFIG_LS,type=Member,member=locator-one",
> "GemFire:service=Manager,type=Member,member=locator-one",
> "GemFire:service=Region,name="/test-region-1",type=Distributed",
> 
> "GemFire:service=Region,name="/test-region-1",type=Member,member=server-1",
> 
> "GemFire:service=Region,name="/test-region-1",type=Member,member=server-2",
> 
> "GemFire:service=Region,name="/test-region-1",type=Member,member=server-3",
> "GemFire:service=System,type=Distributed",
> "GemFire:type=Member,member=locator-one",
> "GemFire:type=Member,member=server-1",
> "GemFire:type=Member,member=server-2",
> "GemFire:type=Member,member=server-3"]
> to contain exactly (and in same order):
>   ["GemFire:service=AccessControl,type=Distributed",
> "GemFire:service=CacheServer,port=20842,type=Member,member=server-1",
> "GemFire:service=CacheServer,port=20846,type=Member,member=server-2",
> "GemFire:service=CacheServer,port=20850,type=Member,member=server-3",
> 
> "GemFire:service=DiskStore,name=cluster_config,type=Member,member=locator-one",
> "GemFire:service=FileUploader,type=Distributed",
> "GemFire:service=Locator,type=Member,member=locator-one",
> 
> "GemFire:service=LockService,name=__CLUSTER_CONFIG_LS,type=Distributed",
> 
> "GemFire:service=LockService,name=__CLUSTER_CONFIG_LS,type=Member,member=locator-one",
> "GemFire:service=Manager,type=Member,member=locator-one",
> "GemFire:service=Region,name="/test-region-1",type=Distributed",
> 
> "GemFire:service=Region,name="/test-region-1",type=Member,member=server-1",
> 
> "GemFire:service=Region,name="/test-region-1",type=Member,member=server-2",
> 
> "GemFire:service=Region,name="/test-region-1",type=Member,member=server-3",
> "GemFire:service=System,type=Distributed",
> "GemFire:type=Member,member=locator-one",
> "GemFire:type=Member,member=server-1",
> "GemFire:type=Member,member=server-2",
> "GemFire:type=Member,member=server-3"]
> but could not find the following elements:
>   ["GemFire:service=CacheServer,port=20850,type=Member,member=server-3"]
> at 
> org.apache.geode.management.internal.JMXMBeanFederationDUnitTest.MBeanFederationAddRemoveServer(JMXMBeanFederationDUnitTest.java:130)
> 8352 tests completed, 1 failed, 414 skipped



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Assigned] (GEODE-10121) Transactional Redis commands are not actually transactional

2022-03-22 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10121?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade reassigned GEODE-10121:
-

Assignee: (was: Anilkumar Gingade)

> Transactional Redis commands are not actually transactional
> ---
>
> Key: GEODE-10121
> URL: https://issues.apache.org/jira/browse/GEODE-10121
> Project: Geode
>  Issue Type: Bug
>  Components: redis
>Affects Versions: 1.15.0
>Reporter: Donal Evans
>Priority: Major
>  Labels: blocks-1.15.0​
>
> The following test (if added to MSetDUnitTest) is intended to show that MSET 
> is transactional in nature by having the final key to be set throw an 
> exception, which should roll back the previous set values:
> {code:java}
>   public static final String THROWING_REDIS_STRING_EXCEPTION = "to be 
> ignored";
>   
>   @Test
>   public void mset_isTransactional() {
> IgnoredException.addIgnoredException(THROWING_REDIS_STRING_EXCEPTION);
> String hashTag = "{" + clusterStartUp.getKeyOnServer("tag", 1) + "}";
> String key1 = hashTag + "key1";
> String value1 = "value1";
> jedis.set(key1, value1);
> String key2 = hashTag + "key2";
> String value2 = "value2";
> jedis.set(key2, value2);
> String throwingRedisStringKey = hashTag + "ThrowingRedisString";
> String throwingStringValue = "ThrowingRedisStringValue";
> // Put a test version of RedisString directly into the region that throws 
> if the multi-argument set() is called on it
> clusterStartUp.getMember(1).invoke(() -> {
>   RedisKey throwingKey = new 
> RedisKey(throwingRedisStringKey.getBytes(StandardCharsets.UTF_8));
>   ThrowingRedisString throwingRedisString = new ThrowingRedisString();
>   
> throwingRedisString.set(throwingStringValue.getBytes(StandardCharsets.UTF_8));
>   
> ClusterStartupRule.getCache().getRegion(DEFAULT_REDIS_REGION_NAME).put(throwingKey,
>  throwingRedisString);
> });
> String newValue = "should_not_be_set";
> assertThatThrownBy(() -> jedis.mset(key1, newValue, key2, newValue, 
> throwingRedisStringKey, newValue)).hasMessage(SERVER_ERROR_MESSAGE);
> 
> assertThat(jedis.get(key1)).isEqualTo(value1);
> assertThat(jedis.get(key2)).isEqualTo(value2);
> 
> assertThat(jedis.get(throwingRedisStringKey)).isEqualTo(throwingStringValue);
> IgnoredException.removeAllExpectedExceptions();
>   }
>   static class ThrowingRedisString extends RedisString {
> @Override
> public void set(Region region, RedisKey key, byte[] 
> newValue,
> SetOptions options) {
>   throw new RuntimeException(THROWING_REDIS_STRING_EXCEPTION);
> }
>   }{code}
> This test fails due to {{key1}} having its value set to {{newValue}} despite 
> the fact that MSET is supposed to be atomic.
> Given the similar implementations each command uses, it is very likely that 
> MSETNX and SMOVE also show the same behaviour.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Assigned] (GEODE-10121) Transactional Redis commands are not actually transactional

2022-03-22 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10121?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade reassigned GEODE-10121:
-

Assignee: Anilkumar Gingade

> Transactional Redis commands are not actually transactional
> ---
>
> Key: GEODE-10121
> URL: https://issues.apache.org/jira/browse/GEODE-10121
> Project: Geode
>  Issue Type: Bug
>  Components: redis
>Affects Versions: 1.15.0
>Reporter: Donal Evans
>Assignee: Anilkumar Gingade
>Priority: Major
>  Labels: blocks-1.15.0​
>
> The following test (if added to MSetDUnitTest) is intended to show that MSET 
> is transactional in nature by having the final key to be set throw an 
> exception, which should roll back the previous set values:
> {code:java}
>   public static final String THROWING_REDIS_STRING_EXCEPTION = "to be 
> ignored";
>   
>   @Test
>   public void mset_isTransactional() {
> IgnoredException.addIgnoredException(THROWING_REDIS_STRING_EXCEPTION);
> String hashTag = "{" + clusterStartUp.getKeyOnServer("tag", 1) + "}";
> String key1 = hashTag + "key1";
> String value1 = "value1";
> jedis.set(key1, value1);
> String key2 = hashTag + "key2";
> String value2 = "value2";
> jedis.set(key2, value2);
> String throwingRedisStringKey = hashTag + "ThrowingRedisString";
> String throwingStringValue = "ThrowingRedisStringValue";
> // Put a test version of RedisString directly into the region that throws 
> if the multi-argument set() is called on it
> clusterStartUp.getMember(1).invoke(() -> {
>   RedisKey throwingKey = new 
> RedisKey(throwingRedisStringKey.getBytes(StandardCharsets.UTF_8));
>   ThrowingRedisString throwingRedisString = new ThrowingRedisString();
>   
> throwingRedisString.set(throwingStringValue.getBytes(StandardCharsets.UTF_8));
>   
> ClusterStartupRule.getCache().getRegion(DEFAULT_REDIS_REGION_NAME).put(throwingKey,
>  throwingRedisString);
> });
> String newValue = "should_not_be_set";
> assertThatThrownBy(() -> jedis.mset(key1, newValue, key2, newValue, 
> throwingRedisStringKey, newValue)).hasMessage(SERVER_ERROR_MESSAGE);
> 
> assertThat(jedis.get(key1)).isEqualTo(value1);
> assertThat(jedis.get(key2)).isEqualTo(value2);
> 
> assertThat(jedis.get(throwingRedisStringKey)).isEqualTo(throwingStringValue);
> IgnoredException.removeAllExpectedExceptions();
>   }
>   static class ThrowingRedisString extends RedisString {
> @Override
> public void set(Region region, RedisKey key, byte[] 
> newValue,
> SetOptions options) {
>   throw new RuntimeException(THROWING_REDIS_STRING_EXCEPTION);
> }
>   }{code}
> This test fails due to {{key1}} having its value set to {{newValue}} despite 
> the fact that MSET is supposed to be atomic.
> Given the similar implementations each command uses, it is very likely that 
> MSETNX and SMOVE also show the same behaviour.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Resolved] (GEODE-9704) When durable clients recovers, it sends "ready for event" signal before register for interest, this might cause problem for caching_proxy regions

2022-03-22 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-9704?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade resolved GEODE-9704.
--
Resolution: Won't Fix

> When durable clients recovers, it sends "ready for event" signal before 
> register for interest, this might cause problem for caching_proxy regions
> -
>
> Key: GEODE-9704
> URL: https://issues.apache.org/jira/browse/GEODE-9704
> Project: Geode
>  Issue Type: Bug
>  Components: regions
>Affects Versions: 1.15.0
>Reporter: Jinmei Liao
>Assignee: Mark Hanson
>Priority: Major
>  Labels: GeodeOperationAPI, blocks-1.15.1, pull-request-available
>
> This is the old Geode behavior, but may or may not be the correct behavior.
> When durable clients recovers, there is a queueTimer thread that runs 
> `QueueManagerImp.recoverPrimary` method,  it 
>  * makes new connection to server
>  - sends readyForEvents (which will cause the server to start sending the 
> queued events)
>  - recovers interest
>   - clears the region of keys of interest
>   - re-registers interest
> It sends readyForEvents before it clears region of keys of interest, if 
> server sends some events of those keys in between, it will clear them, thus 
> it seems to the user that the client region doesn't have those keys. 
>  
> Run geode-core distributedTest 
> AuthExpirationDUnitTest.registeredInterest_slowReAuth_policyKeys_durableClient(),
>  change the InterestResultPolicy to NONE, you would see the test would fail 
> occasionally, Adding sleep code in QueueManagerImp.recoverPrimary between 
> `createNewPrimary` and `recoverInterest` would make the test fail more 
> consistently.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (GEODE-10144) Regression in geode-native test CqPlusAuthInitializeTest.reAuthenticateWithDurable

2022-03-21 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-10144:
--
Affects Version/s: 1.15.0

> Regression in geode-native test 
> CqPlusAuthInitializeTest.reAuthenticateWithDurable
> --
>
> Key: GEODE-10144
> URL: https://issues.apache.org/jira/browse/GEODE-10144
> Project: Geode
>  Issue Type: Bug
>  Components: client/server
>Affects Versions: 1.15.0
>Reporter: Blake Bender
>Assignee: Jinmei Liao
>Priority: Major
>  Labels: blocks-1.15.0, needsTriage
>
> This test is failing across the board in the `geode-native` PR pipeline.  
> Main develop pipeline is green only because nothing can get through the PR 
> pipeline to clear checkin gates.  We have green CI runs with 1.15. build 918, 
> then it started failing when we picked up build 924.  
>  
> [~moleske] tracked this back to this commit:  
> [https://github.com/apache/geode/commit/2554f42b925f2b9b8ca7eee14c7a887436b1d9db|https://github.com/apache/geode/commit/2554f42b925f2b9b8ca7eee14c7a887436b1d9db].
>   See his notes in `geode-native` PR # 947 
> ([https://github.com/apache/geode-native/pull/947])



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Assigned] (GEODE-10144) Regression in geode-native test CqPlusAuthInitializeTest.reAuthenticateWithDurable

2022-03-21 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade reassigned GEODE-10144:
-

Assignee: Jinmei Liao

> Regression in geode-native test 
> CqPlusAuthInitializeTest.reAuthenticateWithDurable
> --
>
> Key: GEODE-10144
> URL: https://issues.apache.org/jira/browse/GEODE-10144
> Project: Geode
>  Issue Type: Bug
>  Components: client/server
>Reporter: Blake Bender
>Assignee: Jinmei Liao
>Priority: Major
>  Labels: blocks-1.15.0, needsTriage
>
> This test is failing across the board in the `geode-native` PR pipeline.  
> Main develop pipeline is green only because nothing can get through the PR 
> pipeline to clear checkin gates.  We have green CI runs with 1.15. build 918, 
> then it started failing when we picked up build 924.  
>  
> [~moleske] tracked this back to this commit:  
> [https://github.com/apache/geode/commit/2554f42b925f2b9b8ca7eee14c7a887436b1d9db|https://github.com/apache/geode/commit/2554f42b925f2b9b8ca7eee14c7a887436b1d9db].
>   See his notes in `geode-native` PR # 947 
> ([https://github.com/apache/geode-native/pull/947])



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (GEODE-10118) PartitionedRegionClearWithConcurrentOperationsDUnitTest will hang in clearShouldFailWhenCoordinatorMemberIsBounced

2022-03-21 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10118?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-10118:
--
Labels:   (was: needsTriage)

> PartitionedRegionClearWithConcurrentOperationsDUnitTest will hang in 
> clearShouldFailWhenCoordinatorMemberIsBounced 
> ---
>
> Key: GEODE-10118
> URL: https://issues.apache.org/jira/browse/GEODE-10118
> Project: Geode
>  Issue Type: Sub-task
>Affects Versions: 1.15.0
>Reporter: Xiaojian Zhou
>Priority: Major
>
> In PR Clear feature branch, after rebase merged in GEODE-9522, 
> clusterDistributionManager setRootCause before TCPConduit.stop. It could 
> cause restarting of server hang after force disconnect. 



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Commented] (GEODE-9910) Failure to auto-reconnect upon network partition

2022-03-07 Thread Anilkumar Gingade (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-9910?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17502521#comment-17502521
 ] 

Anilkumar Gingade commented on GEODE-9910:
--

[~smudundi] The ticket says affected version is 1.14...Are you continue to use 
1.14; are you looking for fix immediately. 

> Failure to auto-reconnect upon network partition
> 
>
> Key: GEODE-9910
> URL: https://issues.apache.org/jira/browse/GEODE-9910
> Project: Geode
>  Issue Type: Bug
>Affects Versions: 1.14.0
>Reporter: Surya Mudundi
>Assignee: Barrett Oglesby
>Priority: Major
>  Labels: GeodeOperationAPI, blocks-1.15.0​, needsTriage, 
> pull-request-available
> Attachments: geode-logs.zip
>
>
> Two node cluster with embedded locators failed to auto-reconnect when node-1 
> experienced network outage for couple of minutes and when node-1 recovered 
> from the outage, node-2 failed to auto-reconnect.
> node-2 tried to re-connect to node-1 as:
> [org.apache.geode.distributed.internal.InternalDistributedSystem]-[ReconnectThread]
>  [] Attempting to reconnect to the distributed system.  This is attempt #1.
> [org.apache.geode.distributed.internal.InternalDistributedSystem]-[ReconnectThread]
>  [] Attempting to reconnect to the distributed system.  This is attempt #2.
> [org.apache.geode.distributed.internal.InternalDistributedSystem]-[ReconnectThread]
>  [] Attempting to reconnect to the distributed system.  This is attempt #3.
> Finally reported below error after 3 attempts as:
> INFO  
> [org.apache.geode.logging.internal.LoggingProviderLoader]-[ReconnectThread] 
> [] Using org.apache.geode.logging.internal.SimpleLoggingProvider for service 
> org.apache.geode.logging.internal.spi.LoggingProvider
> INFO  [org.apache.geode.internal.InternalDataSerializer]-[ReconnectThread] [] 
> initializing InternalDataSerializer with 0 services
> INFO  
> [org.apache.geode.distributed.internal.InternalDistributedSystem]-[ReconnectThread]
>  [] performing a quorum check to see if location services can be started early
> INFO  
> [org.apache.geode.distributed.internal.InternalDistributedSystem]-[ReconnectThread]
>  [] Quorum check passed - allowing location services to start early
> WARN  
> [org.apache.geode.distributed.internal.InternalDistributedSystem]-[ReconnectThread]
>  [] Exception occurred while trying to connect the system during reconnect
> java.lang.IllegalStateException: A locator can not be created because one 
> already exists in this JVM.
>         at 
> org.apache.geode.distributed.internal.InternalLocator.createLocator(InternalLocator.java:298)
>  ~[geode-core-1.14.0.jar:?]
>         at 
> org.apache.geode.distributed.internal.InternalLocator.createLocator(InternalLocator.java:273)
>  ~[geode-core-1.14.0.jar:?]
>         at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.startInitLocator(InternalDistributedSystem.java:916)
>  ~[geode-core-1.14.0.jar:?]
>         at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.initialize(InternalDistributedSystem.java:768)
>  ~[geode-core-1.14.0.jar:?]
>         at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.access$200(InternalDistributedSystem.java:135)
>  ~[geode-core-1.14.0.jar:?]
>         at 
> org.apache.geode.distributed.internal.InternalDistributedSystem$Builder.build(InternalDistributedSystem.java:3034)
>  ~[geode-core-1.14.0.jar:?]
>         at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.connectInternal(InternalDistributedSystem.java:290)
>  ~[geode-core-1.14.0.jar:?]
>         at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.reconnect(InternalDistributedSystem.java:2605)
>  ~[geode-core-1.14.0.jar:?]
>         at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.tryReconnect(InternalDistributedSystem.java:2424)
>  ~[geode-core-1.14.0.jar:?]
>         at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.disconnect(InternalDistributedSystem.java:1275)
>  ~[geode-core-1.14.0.jar:?]
>         at 
> org.apache.geode.distributed.internal.ClusterDistributionManager$DMListener.membershipFailure(ClusterDistributionManager.java:2326)
>  ~[geode-core-1.14.0.jar:?]
>         at 
> org.apache.geode.distributed.internal.membership.gms.GMSMembership.uncleanShutdown(GMSMembership.java:1187)
>  ~[geode-membership-1.14.0.jar:?]
>         at 
> org.apache.geode.distributed.internal.membership.gms.GMSMembership$ManagerImpl.lambda$forceDisconnect$0(GMSMembership.java:1811)
>  ~[geode-membership-1.14.0.jar:?]
>         at java.lang.Thread.run(Thread.java:829) [?:?]
>  



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Commented] (GEODE-10084) Gfsh ImportDataCommand causes a ClassCastException

2022-02-24 Thread Anilkumar Gingade (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-10084?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17497811#comment-17497811
 ] 

Anilkumar Gingade commented on GEODE-10084:
---

Additional detail from the discussion:

In this case:  *The server has a pool defined:*

[info 2022/02/24 14:52:16.371 CST  tid=0x1] Pool localhost:16000 started 
with multiuser-authentication=false
That locator is defined by this system property:
-DLIVE_LOCATORS=localhost:16000
This causes a ClientTypeRegistration to be created.
The import command is trying to find the existing PdxType for that id.
The ClientTypeRegistration executes a GetPDXTypeByIdOp on the pool (normally a 
server - not sure what it is in this case):
PdxType type = GetPDXTypeByIdOp.execute((ExecutablePool) pool, typeId);
And that returns null, so the exception is thrown.

*Need to address this issue; even when pool is defined on server side.*

Another case with pool: For example, a WAN gateway creates a pool, but that 
doesn't count for determining the TypeRegistry. Other server pools shouldn't 
necessarily count either.


> Gfsh ImportDataCommand causes a ClassCastException
> --
>
> Key: GEODE-10084
> URL: https://issues.apache.org/jira/browse/GEODE-10084
> Project: Geode
>  Issue Type: Bug
>  Components: gfsh
>Reporter: Jinmei Liao
>Priority: Major
>  Labels: needsTriage
>
> When `ImportDataFunction` throws an InternalGemfireError, gfsh will report a 
> `ClassCastException`
> error 2022/02/22 09:55:27.275 CST ech-10-157-129-162-locator1  Connection(10)-10.157.129.162> tid=0x84] Could not execute “import data 
> --region=UDTVersionIndex_Stage 
> --file=/apps_data_01/fraud/UDTVersionIndex_Stage.gfd 
> --member=ech-10-157-129-162-server1”.
> java.lang.ClassCastException: org.apache.geode.InternalGemFireError cannot be 
> cast to org.apache.geode.management.internal.functions.CliFunctionResult
> at 
> org.apache.geode.management.internal.cli.result.model.ResultModel.addTableAndSetStatus(ResultModel.java:214)
> at 
> org.apache.geode.management.internal.cli.result.model.ResultModel.createMemberStatusResult(ResultModel.java:377)
> at 
> org.apache.geode.management.internal.cli.result.model.ResultModel.createMemberStatusResult(ResultModel.java:357)
> at 
> org.apache.geode.management.internal.cli.commands.ImportDataCommand.importData(ImportDataCommand.java:74)
> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
> at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
> at java.lang.reflect.Method.invoke(Method.java:498)
> at 
> org.springframework.util.ReflectionUtils.invokeMethod(ReflectionUtils.java:282)
> at 
> org.apache.geode.management.internal.cli.remote.CommandExecutor.callInvokeMethod(CommandExecutor.java:138)
> at 
> org.apache.geode.management.internal.cli.remote.CommandExecutor.invokeCommand(CommandExecutor.java:148)
> at 
> org.apache.geode.management.internal.cli.remote.CommandExecutor.execute(CommandExecutor.java:76)
> at 
> org.apache.geode.management.internal.cli.remote.CommandExecutor.execute(CommandExecutor.java:61)
> at 
> org.apache.geode.management.internal.cli.remote.OnlineCommandProcessor.executeCommand(OnlineCommandProcessor.java:132)
> at 
> org.apache.geode.management.internal.cli.remote.OnlineCommandProcessor.executeCommandReturningJson(OnlineCommandProcessor.java:138)
> at 
> org.apache.geode.management.internal.beans.MemberMBeanBridge.processCommand(MemberMBeanBridge.java:1252)
> at 
> org.apache.geode.management.internal.beans.MemberMBean.processCommand(MemberMBean.java:424)
> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
> at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
> at java.lang.reflect.Method.invoke(Method.java:498)
> at sun.reflect.misc.Trampoline.invoke(MethodUtil.java:71)
> at sun.reflect.GeneratedMethodAccessor5.invoke(Unknown Source)
> at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
> at java.lang.reflect.Method.invoke(Method.java:498)
> at sun.reflect.misc.MethodUtil.invoke(MethodUtil.java:275)
> at 
> com.sun.jmx.mbeanserver.ConvertingMethod.invokeWithOpenReturn(ConvertingMethod.java:193)
> at 
> com.sun.jmx.mbeanserver.ConvertingMethod.invokeWithOpenReturn(ConvertingMethod.java:175)
> at 
> 

[jira] [Updated] (GEODE-10009) The CacheClientProxy for a durable client can be terminated when it shouldn't be

2022-02-04 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10009?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-10009:
--
Labels: blocks-1.15.0​ pull-request-available  (was: blocks-1.15.0​ 
needsTriage pull-request-available)

> The CacheClientProxy for a durable client can be terminated when it shouldn't 
> be
> 
>
> Key: GEODE-10009
> URL: https://issues.apache.org/jira/browse/GEODE-10009
> Project: Geode
>  Issue Type: Bug
>  Components: client queues
>Affects Versions: 1.15.0
>Reporter: Barrett Oglesby
>Assignee: Barrett Oglesby
>Priority: Major
>  Labels: blocks-1.15.0​, pull-request-available
>
> When the client connection is closed but the server has not left or crashed 
> (e.g in the re-authentication failed case), its possible that two threads in 
> a durable client can interleave in a way that causes an extra durable task to 
> be created on the server that eventually causes the CacheClientProxy to be 
> terminated.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (GEODE-10009) The CacheClientProxy for a durable client can be terminated when it shouldn't be

2022-02-02 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10009?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-10009:
--
Labels: blocks-1.15.0​ needsTriage pull-request-available  (was: 
needsTriage pull-request-available)

> The CacheClientProxy for a durable client can be terminated when it shouldn't 
> be
> 
>
> Key: GEODE-10009
> URL: https://issues.apache.org/jira/browse/GEODE-10009
> Project: Geode
>  Issue Type: Bug
>  Components: client queues
>Affects Versions: 1.15.0
>Reporter: Barrett Oglesby
>Assignee: Barrett Oglesby
>Priority: Major
>  Labels: blocks-1.15.0​, needsTriage, pull-request-available
>
> When the client connection is closed but the server has not left or crashed 
> (e.g in the re-authentication failed case), its possible that two threads in 
> a durable client can interleave in a way that causes an extra durable task to 
> be created on the server that eventually causes the CacheClientProxy to be 
> terminated.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (GEODE-9980) Startup of Locator or Server should fail fast if geode.enableGlobalSerialFilter is enabled but fails configuration

2022-01-31 Thread Anilkumar Gingade (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-9980?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anilkumar Gingade updated GEODE-9980:
-
Labels: GeodeOperationAPI blocks-1.15.0​ pull-request-available  (was: 
GeodeOperationAPI pull-request-available)

> Startup of Locator or Server should fail fast if 
> geode.enableGlobalSerialFilter is enabled but fails configuration
> --
>
> Key: GEODE-9980
> URL: https://issues.apache.org/jira/browse/GEODE-9980
> Project: Geode
>  Issue Type: Bug
>  Components: serialization
>Affects Versions: 1.15.0
>Reporter: Kirk Lund
>Assignee: Kirk Lund
>Priority: Major
>  Labels: GeodeOperationAPI, blocks-1.15.0​, pull-request-available
>
> The following error conditions need better handling which includes handling 
> of all errors consistently and cause the startup of a Locator or Server to 
> fail if it's unable to honor the setting of 
> {{-Dgeode.enableGlobalSerialFilter=true}} for any reason. Currently, if 
> {{-Dgeode.enableGlobalSerialFilter=true}} is specified but Geode is unable to 
> create a global serial filter, then it will will log a warning and continue 
> running. A user may easily miss that log statement and believe that the JVM 
> is running with a properly configured serialization filter.
> 1) The user is trying to secure the JVM very thoroughly and accidentally 
> specifies both {{-Djdk.serialFilter}} and 
> {{-Dgeode.enableGlobalSerialFilter}}. 
> 2) The user runs some non-Geode code in the same JVM that invokes 
> {{ObjectInputFilter.Config.setFilter(...)}} directly.
> 3) The user is using a version of Java 8 prior to 8u121 (the release that 
> first added {{sun.misc.ObjectInputFilter}}) and specifies 
> {{-Dgeode.enableGlobalSerialFilter=true}}. Also, the same behavior occurs if 
> they do NOT specify enabling that property.
> 4) {{LocatorLauncher}} or {{ServerLauncher}} is started in a JVM that has 
> already created at least one {{ObjectInputStream}} which will cause 
> {{ObjectInputFilter.Config.setFilter(...)}} to fail.
> 5) {{LocatorLauncher}} or {{ServerLauncher}} is started in a Java 8 JVM that 
> is not based on OpenJDK (ie {{sun.misc.ObjectInputFilter}} does not exist).
> 6) {{LocatorLauncher}} or {{ServerLauncher}} is started in an unforeseen 
> environment that causes invocation of 
> {{ObjectInputFilter.Config.setFilter(...)}} via Java Reflection to throw 
> {{IllegalAccessException}}.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


  1   2   3   4   5   6   >