[qi4j-dev] Qi4j persistence, part 5: types of readers

Rickard Öberg Tue, 14 Apr 2009 19:46:04 -0700

With the eventual consistency feature available through the Atom feedsas described in the previous post, the next question is: what kind ofreaders are possible to make use of it?


Here is the current list I have managed to figure out:
* Entity readers
* Entity indexers
* Backup
* Logging
* Reporting


I will go through them and their characteristics below.

Entity readers
==============

The most obvious one is the reader that answers /entity requests. Thiscreates the main feedback loop between clients and servers. Clients readfrom /entity, uses domain logic to come up with a change set, which isthen sent to /changes. Readers may either be required to be Consistent,in that they can only answer requests if they have processed allchanges, or they can be Available and Partition tolerant, in the sensethat calls to /entity simply returns what's available right now, and thedata might be slightly out of date. It all depends on the clientsrequirements (response time vs accuracy vs availability).

What happens here is that the reader gets the /changes feed and appliesthe changes on the local database, which contains the snapshots of theentities to be read (either all of them or a partition). Each change cancause the snapshot to be updated, or optionally create a new snapshot,so that you can easily traverse the database back in time if you want to.


Entity indexers
===============

The same feed could also be used to update the index used by /query RESTrequests. There could also be many indexers in parallel, to covervarious needs. One reader could use RDF, one could use Lucene and onecould use Neo4j. REST routing would be used to figure out which one isused where. For example, /query/queries could return a feed of namedqueries, each of which points to a different backend:

<?xml version="1.0" encoding="utf-8"?>
<feed xmlns="http://www.w3.org/2005/Atom";>

  <title>Named queries</title>
  <link href="http://example.org/query/queries"/>

  <entry>
    <title>User by name</title>
    <link href="http://example.org/query/rdf/User_by_name"/>
  </entry>
  <entry>
    <title>User friends</title>
    <link href="http://neo4j.example.org/query/neo4j/User_friends"/>
  </entry>

  <entry>
    <title>Message by content</title>
    <link href="http://example.org/query/lucene/Message_by_content"/>
  </entry>
</feed>

Note that both paths and hostnames change... the client will read thisto find out where the specific queries are. Each of these could beclustered, or changed as the system evolves, without having to updatethe client.


Backup
======

In my experience backups is a very complicated matter. The naivedeveloper might think that if they simply allow the admin to do backupsonce every day of the application database everything is just fine. Inreality I have never met a customer who is satisfied with this, not interms of making the backups, but rather in terms of restoring backupsfrom such nightly snapshots. If the database breaks and a restore isrequested, using a nightly snapshot would mean that the customer loseson average 12 hours worth of operation on it. This is simply notacceptable. When I was working on SiteVision we therefore had to come upwith some funky tricks to do partial restoration of databases (so that aminimal amount of the current working database is lost), either byselecting parts of databases to be restored, or even on an individualobject basis. This is non-trivial, and ensuring that the result isconsistent is theoretically impossible. But, if you talk to an averagedeveloper who is not familiar with deployment and administrationconcerns this is what they would suggest.

Another major problem is taking backups of the online database. Somedatabases do support that backups are taken while the system is running,but this still causes the system to lose performance. In the SiteVisioncase I had one customer who never got a full backup, simply becausetheir monitoring system restarted SiteVision every night at 2.15ambecause it was responding too slow. This was when the backup was beingmade. They therefore had a whole lot of partial backups, but not asingle completed one. They eventually had a crash which is when this wasfound out... (funny note: the admin folks actually thought it was normalto restart the server every night. That's just sad)

With the EventSourced approach this problem becomes muuuuch simpler tohandle. Simply create a separate backup server which reads the changesfeed and applies it locally to a database. When a backup is requested,stop reading changes from the master, take the snapshot backup andinclude the id of the last read message. Also do a backup of all thechanges that have been done. If the changes are stored in plain filesyou can easily use incremental backups to minimize how much is copiedeach time. When the backup is done, resume reading changes so that thebackup copy eventually catches up and has more or less the current state.

When restoring from backup, get the snapshot, and then apply changesfrom the time of the backup up until now, or if there are some changesyou don't want to include, such as "Delete the whole database", thenfilter those out. This way the customer will not lose any importantbusiness data and you can ensure that the database is in a consistentstate when resuming operation.


Logging
=======

In a sense this is the simplest case: have a separate server that getsthe changes and stores them locally, optionally in a format that makesit easier to search them. This can be used for finding out what hashappened in a system, either for debugging purposes or legal or similar.It is also possible to let the logger listen to several /changes feedsand get a syndicated view of everything that is going on in multiplesystems. This can also be extended into a monitoring system where youfor example measure nr of changes per hour, or "nr of 'New object X'messages per hour", or similar. Lots of interesting monitoring thingsbecome trivial.


Reporting
=========

One of the main culprits in continuing the "Objects in RDBMS" fallacy isthat customers want to do reporting using live data. This is so wrong Idon't want to even spend time explaining why, because it's a whole essayon its own. With the EventSourced approach we now have a way to take themessages and massage them into a RDBMS (yes plain SQL, tables, rows andcolumns) in a format that is suitable for whatever report is needed. Itcan be done as many times as necessary, and if the reporting needschanges, then simply throw away the tables and start all over, readingthe /changes from day one up until the last message. This provides amuch better basis for doing advanced reports, and also ensures thatreporting does not impact domain modeling or the online applicationserver performance.


Conclusion
==========

This is a short rundown of what you can do with readers in anEventSourced system. As you can see a whole bunch of typically complexproblems become muuuch easier to deal with.


Continued in part 6.

_______________________________________________
qi4j-dev mailing list
[email protected]
http://lists.ops4j.org/mailman/listinfo/qi4j-dev

[qi4j-dev] Qi4j persistence, part 5: types of readers

Reply via email to