Re: [Chandler-dev] [Sum] The Great Architecture Discussion of 2007

Phillip J. Eby Tue, 09 Oct 2007 18:55:45 -0700

At 05:27 PM 10/9/2007 -0700, Andi Vajda wrote:

Not throw out. Migrate to a new schema. Just like in a relational database.
If you change the low-level layout (format), core schema, or appschema (table layout) someone needs to migrate the data. It might beapparently easier in a relational schema but not so once you'vecarefully optimized it and duplicated stuff left and right to getthe desired performance. Essentially, it becomes harder once the 1-1correspondance between programmer's view (kind/class) and SQL table is broken.

Have a look at Hibernate, which is used by Cosmo: it uses an XML filethat specifies the mapping between objects and database. Thecontents of this file are never known to the application, whichsimply uses its own object model.

Hibernate maps object retrieval and queries to SQL, and applicationsuse either the collections defined by the mapping, or use "HQL",which is an SQL-like query language that queries in terms of the*object* schema, rather than the relational one. And it takes careof all the non-1-1-ness in the mapping.

Now, if you add new types to the application schema, of course youhave to add to the XML file. But in principle you could generate theXML in a logical fashion from the new piece of application schema, sothat even that step is not necessary when you are first adding to theapplication.

Now, Hibernate is not available for Python (although I suppose youcould make it so with JCC!) but it illustrates the point that ispossible to separate things in this fashion. I believe there is atleast one Python ORM that claims to be inspired by or to work likeHibernate, though. I also seem to recall that SQLAlchemy for Pythonalso has a great deal of flexibility in mapping between differentrelational schemas, such that your code can deal with a logicalschema rather than an actual one.

There is also the possibility of just rolling Yet Another Python ORM,perhaps based on EIM. But these things don't matte as much aslayering the application in such a way that it does not *care* howthings actually get stored. Chandler's domain model objects shouldnot be subclasses of a storage type, for example. (i.e., they shouldnot be repository.Items).

That way, we will be able to experiment with different mappings anddifferent back ends for optimum performance. For that matter, wecould use more than one back end if we chose, such that email bodiesmight be stored in mbox files, while their headers get indexed inSQLite. (While all being dumpable and reloadable, of course.)

And, it is likely that for some period, we will still back-end to therepository -- we just would go through a mapping layer of some sortfirst. (And that would mean that we could do some physical schematuning there, without needing to mess with the application layer.)


_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _

Open Source Applications Foundation "chandler-dev" mailing list
http://lists.osafoundation.org/mailman/listinfo/chandler-dev

Re: [Chandler-dev] [Sum] The Great Architecture Discussion of 2007

Reply via email to