I think I am starting to understand.
Reading the raft.proto file - what is the difference between StateMachineLogEntryProto#logData and StateMachineLogEntryProto#stateMachineEntry#stateMachineData ? In the counter example the log data is the 'increment request' and in filestore the stateMachineData is the stateMachineLogEntry is being used? What should be used where/when? ________________________________ From: Tsz Wo Sze <[email protected]> Sent: Saturday, December 18, 2021 1:36:18 AM To: [email protected] <[email protected]> Subject: Re: Raft log persistance The file store example is just a simple example. It does not intend to support all the features. It indeed does not work for the case that the data is deleted -- it will fail to read the file. You are right that, for real applications supporting multiple versions of data, it should retrieve the data from the version corresponding to the given log index. Tsz-Wo On Sat, Dec 18, 2021 at 1:05 PM Asad Awadia <[email protected]<mailto:[email protected]>> wrote: In file store example read method - it looks like it is returning the file at the path without taking the raft index in consideration i.e it would keep returning the same file for many entry indices provided the file hadn't changed? What is going on there? Shouldn't it return the data snapshot at instant entry.getIndex? ________________________________ From: Tsz Wo Sze <[email protected]<mailto:[email protected]>> Sent: Wednesday, December 15, 2021 7:37:20 PM To: [email protected]<mailto:[email protected]> <[email protected]<mailto:[email protected]>> Subject: Re: Raft log persistance > If I understand correctly, in the write method we should take the request > and apply it right there in the method into the state machine? So it only > gets persisted into the state machine and not the raft log? The write method should write the stateMachineData (i.e. LogEntryProto.getStateMachineLogEntry().getStateMachineEntry().getStateMachineData()) to its storage. The stateMachineData will be removed from the logEntryProto. The logEntryProto with removed stateMachineData will be written to the raft log. Yes, stateMachineData won't be written to the raft log. > How will syncing up and backups work then? If there is no 'raft log' anymore > since we are overriding the write method to only apply writes to the state > machine. How will the other servers be able to sync and catch up? When reading from the raft log, the server in addition calls the DataApi.read(LogEntryProto) method. The state machine should return back the stateMachineData corresponding to the given logEntryProto as a CompletableFuture<ByteString>. Then, the server will add the stateMachineData back to the logEntryProto and then send the logEntryProto to the other servers. Tsz-Wo On Thu, Dec 16, 2021 at 3:13 AM Asad Awadia <[email protected]<mailto:[email protected]>> wrote: I read through the file store example. If I understand correctly, in the write method we should take the request and apply it right there in the method into the state machine? So it only gets persisted into the state machine and not the raft log? How will syncing up and backups work then? If there is no 'raft log' anymore since we are overriding the write method to only apply writes to the state machine. How will the other servers be able to sync and catch up? ________________________________ From: Tsz Wo Sze <[email protected]<mailto:[email protected]>> Sent: Tuesday, December 14, 2021 8:16:23 PM To: [email protected]<mailto:[email protected]> <[email protected]<mailto:[email protected]>> Subject: Re: Raft log persistance Hi Asad, By default, all the write requests (including the data inside) are recorded in the raft log. This is specified in the standard RAFT algorithm so that any server can replay the log in order to sync up to the current state. If the state machine keeps another persistent copy of the data, you are right that there will be two copies. In this sense, the standard RAFT algorithm is not suitable for data intensive applications. In Ratis, we want to support data intensive applications such as Apache Ozone. State machine implementations may choose to manage the data itself. In that case, the data won't be recorded in the raft log. In order to use this feature, state machine implementations must override the read and write methods defined in StateMachine.DataApi. The FileStore example indeed has overridden these two methods. Hope it helps. Tsz-Wo On Wed, Dec 15, 2021 at 3:40 AM Asad Awadia <[email protected]<mailto:[email protected]>> wrote: Hello, Is ratis keeping a copy of all the writes made? On top of what is persisted in my own DB? So ~2x storage used I read that the file system example avoids that but I don't see how that is being done in the code? Regards, Asad
