[qi4j-dev] Persistence: state, type, instance

Rickard Öberg Fri, 03 Apr 2009 03:14:57 -0700

Hey,

Just wanted to make a few notes on the division of responsibility in thenew persistence SPI, since it is radically changed from before.

Basically there will be three layers (instead of two): state, type,instance. Instance is the UoW-layer as before, and state is theEntityStore responsibility, and this is what will be pluggable. Managingtype will be done by a new thingy called EntityRegistry which will sitbetween the UoW and the EntityStore. Any migration hooks will be in there.

To illustrate the difference between the layers, here is therepresentation of a property in each of them:

Instance: Property<String> foo();
Type: QualifiedName(mypackage.myinterface.foo)
State: StateName(hashed QualifiedName) = "Bar"

The Property is short-lived, as it is local to a particular UoW. TheQualifiedName in the type layer has the same lifespan as the applicationuptime. After the application has been restarted the property might havebeen refactored, and hence has a new QualifiedName. The StateName ="Bar" represents the actual stored value, and has a lifespan equal tothat of the object, or until the next schema migration.

In the state layer we expect that values will be stored using the hash(secure hash, like SHA1) of the property rather than the property nameitself. So, if "foo" has hash value AFEB3241 then on disk we storeAFEB3241="Bar". This ensures that next time we load the entity we canknow that this property belonged to a particular version of theEntityType. If the EntityType changes then we can do schema migrationproperly.

Since the hashes might be fairly long it is perfectly ok for theEntityStore to replace them with e.g. a number, so that in the aboveexample it might be stored physically as 32="Bar". It is up to theEntityStore to manage the mapping between "AFEB3241" and 32.

Because of this separation of responsibilities the EntityState interfacechanges to something like this (not finished version):

public interface EntityState
{
    EntityReference identity();
    long version();
    long lastModified();
    void remove();
    EntityStatus status();

    void addEntityType(EntityTypeReference type);
    void removeEntityType(EntityTypeReference type);
    boolean hasEntityType(EntityTypeReference type);
    Set<EntityTypeReference> entityTypes();

    Object getProperty(StateName stateName);
    void setProperty(StateName stateName, Object newValue);

    EntityReference getAssociation(StateName stateName);
    void setAssociation(StateName stateName, EntityReference newEntity);

    ManyAssociationState getManyAssociation(StateName stateName);

    void hasBeenApplied();

    ValueState newValueState(Map<QualifiedName, Object> values);
}

Instead of using QualifiedName, which includes the class name andproperty name, both of which will change over time, the above methodsare instead using StateName, which includes the name of the property andthe hashed name. Quick&Dirty stores will use the name, but "real" storesshould use the hashed name to store the value, which will ensure that itcan be retrieved later on even if the QualifiedName of a property haschanged. EntityTypeReference also contains the hashed name of theEntityType(s) rather than only the name. Typical storage of a singleentity in a hashmap-oriented store hence becomes:

id=123
version=5
lastModified=<somedate>
entityTypes=<hash of type 1>,<hash of type 2>,<hash of type 3, etc.>
AFEB3241="Bar"
---

When the instance in the UoW wants to load the property it will use aTypedEntity, which is a decorator of the EntityState, and do pretty muchlike before:

typedEntity.getProperty(qualifiedName);

The TypedEntity translates the QualifiedName into a StateName and thencalls EntityState:

entityState.getProperty(stateName);
which can do the lookup in the above hashmap.

Apart from making persistence more change-tolerant it should also makeit much easier to implement EntityStores. There's a lot of code relatedto type in the Neo4j store today, for example, which I think wouldsimply go away, and the implementation becomes pretty much a straightwrapper of the underlying node, since all type-related code sits in Qi4j.

The above isn't all of it, but a very important part. Do y'all think itmakes sense? Any potential problems with it? The main issue I can seeright now is for the mapping stores which needs access to the type infosomehow. That will have to be fixed somehow.


/Rickard

_______________________________________________
qi4j-dev mailing list
[email protected]
http://lists.ops4j.org/mailman/listinfo/qi4j-dev

[qi4j-dev] Persistence: state, type, instance

Reply via email to