Hi, I have a query regarding data safety of replicated caches in the case of hard failure of the compute resource but where the storage resource is available when the node returns.
We are using Ignite 2.15 with the C# client. We have a number of these caches that have four nodes participating in the replicated caches, all with the default PrimarySync write synchronization mode. All data storage configurations are configured with WalMode = WalMode.Fsync. We have logic performing writes against these caches which will continue once the primary node for the replicated cache has written the data item. I am unsure of the guarantees made by Ignite at this point in the event of failure. Specifically, hard/red-button failure of compute hardware resources and/or abrupt (but recoverable) detachment of storage resources. Scenario one: Primary node returns "OK", then immediately fails (before check point). When the primary node returns should I expect the replicated value to be in the primary, and to appear in all other nodes too. Scenario two: Primary node returns "OK", then a secondary node immediately fails (before achieving the write and so before any check point). When the secondary node returns should I expect the replicated value to be in the recovered secondary node? In relation to these scenarios, does setting the cache write synchronization mode improve the safety of the write as all nodes must acknowledge the write before it returns. If there is an improvement in write safety in this instance, does this imply the Fsync WalMode write pathway has opportunities for data loss in these failure situations? Thanks, Raymond. -- <http://www.trimble.com/> Raymond Wilson Trimble Distinguished Engineer, Civil Construction Software (CCS) 11 Birmingham Drive | Christchurch, New Zealand raymond_wil...@trimble.com <https://worksos.trimble.com/?utm_source=Trimble&utm_medium=emailsign&utm_campaign=Launch>