[qi4j-dev] Qi4j persistence, part 4: eventual consistency

Rickard Öberg Tue, 14 Apr 2009 19:45:45 -0700

From the previous post we have that the client sends events to theserver by PUTting them to the /changes Atom feed. But what happens next?How are those changes applied to the snapshots for read?

One important thing to realize is that for the changes to be acceptedonly the following need to happen:1) Validate that changes are consistent. This includes checking versionof written and optionally read state. If changes only only includes newentities then validation can be skipped.2) The event (e.g. the XML from the previous post) needs to betransactionally persisted. It must include a pointer to the previous event.3) The EntityStore needs to transactionally store a pointer to the mostrecent event that was posted.

See attached image for example. Starting from the pointer in theEntityStore (2 in the image) the series of events then form a linkedlist which can be traversed back to the beginning of the series ofchanges. This list can optionally be stored outside of the changesthemselves, to optimize for traversal rather than retrieval.

What we want to do now is achieve eventual consistency, that is, theread snapshots accessed by EntityStoreUnitOfWork.getEntityState() whichinternally calls /entity?id=1234 needs to have the latest snapshot of"1234" somehow. To get this the read-store will go to /changes, which isan Atom feed. The result will be something like:

<?xml version="1.0" encoding="utf-8"?>
<feed xmlns="http://www.w3.org/2005/Atom";>

  <title>Changes</title>
  <updated>2009-04-13T12:30:05Z</updated>

  <entry>
    <title>Add new task to project</title>
    <link href="http://example.org/changes/unitofwork/aG324JWH"/>
    <id>urn:uuid:aG324JWH</id>
    <updated>2009-04-13T12:30:05Z</updated>
  </entry>

  <entry>
    <title>Create project</title>
    <link href="http://example.org/changes/unitofwork/bz452HSQ"/>
    <id>urn:uuid:bz452HSQ</id>
    <updated>2009-04-10T10:21:15Z</updated>
  </entry>
</feed>
---

The feed includes the linked list of UnitOfWork events that have beenpersisted, with the last one first. If there are lots of events, then itcan be chunked (let's say 100 for each feed), and then traversedbackwards in time using /changes?start=db325JH2 to indicate which eventto have first in the feed. The reader will keep track of how far back ithas read and simply traverses back to just before what it has alreadyread, and then gets the UnitOfWork's, one at a time, and applies themlocally to Entity snapshots. For performance we can optionally allow thefeed to include the state directly using the <content> tag. This shouldbe indicated in the URL though to allow both traversal (links asentries) and retrieval (content as entries).

Note now that there can be any number of readers here, and the writerwith the /changes URL does not have to know who they are. Either thereaders update every once in a while, or they can get open feeds wherethe server simply holds the connection open until data becomesavailable, to minimize the lag from receiving the change and applying it.

Note also that these feeds can be used for all sorts of fun stuff...more on that in later posts.

To conclude, with this simple REST-based scheme we can achieve extremelygood performance for writing (since the writer does not have to updatesnapshots, only log the events), and also arbitrary reader scalability,as all you have to do is add more readers to the feed. Either allreaders can have all state, or you can do consistent hashing for contentrouting in order to do data partitioning.

This also provides an answer as to what the version is of each entity.It is not "1,2,3,4" or something like that. Instead it is the id of thelast applied UnitOfWork!

In this scheme the reader can choose the level of consistency of data.Either the client can just get "whatever is there" when /entity iscalled, or if greater consistency is required, then the reader can callthe feed to ensure that there are no more changes to be applied to theentity being accessed.

The reader can also choose how to interact with the change feed. If thereader is on the same network you might want to do HTTP streaming callsso that whenever data is available in the writer it is sent to writers.On the other end of the spectrum you have WAN-access which happens onceevery day to get the changes for the past 24h. Consistency is lower, butthere is also less demand on the network, especially if content isincluded in the feed, and the feed is then also gzipped.


Continued in part 5.

_______________________________________________
qi4j-dev mailing list
[email protected]
http://lists.ops4j.org/mailman/listinfo/qi4j-dev

[qi4j-dev] Qi4j persistence, part 4: eventual consistency

Reply via email to