Re: [Dev] Keeping Users' Data

Morgen Sagen Fri, 03 Feb 2006 00:05:40 -0800

On Feb 2, 2006, at 12:25 PM, Phillip J. Eby wrote:

At some point, Chandler is going to be keeping users' real data,and be the only place where that data is kept. When that happens,we will be responsible for ensuring that their data remains intactacross upgrades. Even if this is done through some kind of import/export mechanism, we will need to keep track of the changes in ourschema so that this *can* be done.
There are several key questions about this that remain open:
* When will we be committing to keeping users' data safe acrossupgrades? Is it 0.7? 0.8? 1.0?
* How will we ensure (procedurally or otherwise) that each versionof Chandler will successfully upgrade from older versions?
* What support will we provide for parcel developers to ensure that*their* schemas upgrade safely, as well as the Chandler "out of thebox" schemas?
* When an upgrade has to be reverted, what guarantees should wegive the user for being able to revert safely without losing data?
While I'm taking responsibility for driving the technicalimplementation of schema evolution features in 0.7, there is a lothere that is more on the organizational commitment level. Really,the technical implementation of schema evolution features is onlythe tip of the iceberg of effort here. Tracking changes, writingupgrade code, and testing will be the biggest resource investmentshere.
If you want more technical information about the issues involved atthe implementation level, a good place to start is this post Iwrote in October:
http://lists.osafoundation.org/pipermail/dev/2005-October/003994.html
You will probably want to skip most of the first half (which dealswith developer-only code-upgrading ideas) and go straight to thesection entitled "Updating Chandler Schema". The text that followsaddresses details of various aspects of upgrading Chandler schemas,given the existing repository and schema APIs.

I was hoping to have an answer or two, but I only came up withquestions... :-)

Although I had always assumed we would have code within each parcelthat knew how to upgrade its data from the previous version to thecurrent version, a more elegant solution would be to have a log ofschema changes coupled with some transformation code that could applythose changes, as you described in your October email. So how dothose two methods compare? You say:

"With sufficient care and infrastructure support, we can relativelyeasilysupport manual schema upgrades, in the sense of having installParcel() makethe changes, if we entirely forbid certain classes of schema changethatcould not be implemented in this way. However, the amount ofdevelopercare required currently appears prohibitive, in the sense that it'sgoing
to seriously impede our flexibility to refactor."

Do you mean that if we did take the route of putting hand-builttransformation code into installParcel( ), the amount oftransformation code would be unwieldy? If we go the schema-change-log route, developers would still have to create log entries for eachchange, or are you saying we could automatically build that log?

There's always the chance that if we do the log-based transformationsystem, some parcel writer will want to be able to perform an upgradethat the transformation system doesn't support. In that case couldthey have custom upgrade code in their parcel, or would all upgradingneed to go through the transformation system?

_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _

Open Source Applications Foundation "Dev" mailing list
http://lists.osafoundation.org/mailman/listinfo/dev

Re: [Dev] Keeping Users' Data

Reply via email to