Bruce Schuchardt created GEODE-5349:
---------------------------------------
Summary: State-flush operation may terminate waiting for current
operations, allowing for cache inconsistency
Key: GEODE-5349
URL: https://issues.apache.org/jira/browse/GEODE-5349
Project: Geode
Issue Type: Bug
Components: regions
Reporter: Bruce Schuchardt
The state-flush operation relies in part on
DistributionAdvisor.waitForCurrentOperations() to stall until in-process
replication efforts have written their messages to communication channels.
This method currently has a self-imposed time limit of (2*ack-wait-threshold)-1
seconds, which defaults to 29 seconds. If a cache operation, say a transaction
commit, happens to take longer than this the waitForCurrentOperations() method
will terminate early, possibly allowing a new copy of a region to miss the
changes contained in that cache operation.
We should remove the timeout in waitForCurrentOperations and rigorously test
the change.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)