Re: Archetype relational mapping - a practical openEHR persistence solution

Thomas Beale Tue, 26 Jan 2016 02:36:07 -0800


On 26/01/2016 09:51, Bert Verhees wrote:

On 26-01-16 10:38, Jan-Marc Verlinden wrote:
# Our first version was Java based with a postgres DB, everythingstored as path/values.Every query would take about a second. We did not even try complexqueries..:-). Also the GUI side did not know what to do with thepathvalues.
Hi Jan-Marc,
There where some problems handling the path/values, most problems werebased on giving a semantic meaning to the paths.Storing path and an according a value is very, very quick. I askeddatabase specialists, and they say this is the best way to go untilbillions of records.

this is also what I would expect. Path-based storage does rely on verysmart ways to figure match terms in a query to paths of course. Thereare some tricks to use here. For example, the path to systolic BPDV_QUANTITY node from the archetype is

In the whole of CKM there are probably about 7,000 'interesting' leafpaths (if you assume that you crunch DATA_VALUE subtypes into littleblobs). That's a tiny number. Assume that when they've modelledeverything in medicine (outside of genomics and proteomics) that we have50,000 such 'paths of interest'. That's a very small number. These pathscan be mapped in smart ways to a 64-but number space so that finding outif a specific query term is in some EHR is very quick. When you includea coded list of archetype ids in the mix, I think querying can be madeextremly quick.

The devil is in the details. Various large DBs used path-based approachsin the past, Informix was one.

Also easy to migrate to another database, for clustering or other reasons.
But there are some problems to solve, which were harder to solve fiveyears ago.
One problem is the GUI builders, they are looking at a difficult tounderstand database-approach, and also easy to create errors in, hardto debug.
They need JSON to write their datasets in.
The other problem is querying. As long as it are predefined queries,you can do anything, but then you are no different from an oldmonolithic system.
But writing new templates heavily relies on on the fly query building
There are however, some technological progresses, also in the opensource domain.
The path/value storage could come to a better life again with help ofANTLR, which can help to interpret AQL for this purpose. I even thinkthis is promising.
Let engineers read the Definitive ANTLR4 Reference by Terence Parr,and read it with path/values in the back of the mind. Both the GUIproblem as the query problem can be solved.
It should be worth the spent time and the price of the book ;-)


It is.

- thomas

_______________________________________________
openEHR-technical mailing list
[email protected]
http://lists.openehr.org/mailman/listinfo/openehr-technical_lists.openehr.org

Re: Archetype relational mapping - a practical openEHR persistence solution

Reply via email to