Re: [Dev] schema api: final review before migration

Phillip J. Eby Tue, 14 Jun 2005 14:08:35 -0700

At 10:23 AM 6/14/2005 -0700, Katie Capps Parlante wrote:

Its our goal to finish the migration by the next milestone. For the mostpart, the dev platform team will handle the migration, but you need to beaware of some of the issues during the transition. Phillip will send moredetails about the migration.

During development of 0.6, Chandler is moving from defining its schemaspartly in XML and partly in Python, to defining them entirely inPython. While there are lots of positive benefits to this, there are alsosome things you should be aware of, especially during the change-over process.

Currently, because we define parts of the schema in Python (the classhierarchy) and part in XML (the parcel and kind hierarchies), it's possiblefor them to be inconsistent in some ways. For example, there are a fewclasses whose name is different from the name of the Kind that uses theclass. There are also several Kinds that share the same class, rather thancreating a new class for each Kind. Finally, there are classes thatinherit from different classes than those inherited by their Kind.

For the most part, these minor inconsistencies don't affect Chandler'soperation right now, but when we move to having only *one* place where thisinformation is specified, it will be necessary to know which representationis correct: the parcel.xml or the class? Later today, I'll be postingabout the inconsistencies I've found, and my proposed resolution forthem. These kinds and classes will need to be made consistent before theycan be migrated.

It's important that they be resolved correctly, however; bug #3242 was theresult of making the Kind match the class, when the class should've beenmade to match the Kind. Doing it the other way around can produce bugs,too. Luckily, there are only a handful of inconsistencies, and most ofthem should be straightforward to resolve.

You may have also noticed various '__parcel__' variables popping up aroundthe codebase, along with various 'import' statements in __init__.py files,and that some imports in modules are changing from this format:


    import foo.bar.Baz as Baz

to this format:

    from foo.bar import Baz

These changes are all to support the schema API, or more precisely, tosupport the *transition* to the schema API, during which we need to havethe schema API and parcel.xml-defined schemas interoperating. So, what dothese changes mean, and how do they affect you?

The '__parcel__' setting tells the schema API that the classes in thatmodule belong to the named parcel. These parcel names are Python packagenames, not repository paths or XML namespace URIs. In the long run, wewill be doing away with both repository paths and XML namespace URIs as away of identifying parcels, since they will become redundant with respectto Python package names.

However, until we have actually done away with the use of repository pathsin the Chandler code base, we need to ensure that the new schema livesunder exactly the same repository paths as the old schema, and that's where'__parcel__' comes in. After the transition, we won't care aboutrepository paths any more, because we'll be importing classes frompackages, not retrieving kinds using paths. But until then, we need'__parcel__' settings to maintain backward compatible repository paths.

Typically, the '__parcel__' is set to the name of the enclosingpackage. For example, in the 'osaf.contententmodel.ContentModel' module,'__parcel__' is set to "osaf.contentmodel", because that's the name of thepackage that contains the relevant parcel.xml file.

You do not need to add '__parcel__' strings to existing modules unlessyou're one of the people who is porting existing packages. However, if youare reorganizing a parcel's schema and want to move kinds between parcels,you may need to change this setting if it already exists. If you are notsure what to do with a '__parcel__' setting during the transition period,please contact me. Or if you're adventurous, you can run the'schema_status' command before and after your change, diffing the outputsto see if you broke anything. :) (Always remembering to run the testsbefore checkin, of course.)

The second kind of change you need to be aware of, is the addition ofimports to __init__.py files. These ensure that all of a parcel's classesare defined and present in the package's top-level module when thecorresponding parcel is loaded. This is mostly a transitional change, andmight go away in some cases if we end up flattening the package structuresa bit in a later milestone. In general, however, all of a package'sdependencies need to be imported and ready to use by the time the packageis fully imported.

Therefore, if you add a new Kind+class during the transition, please makesure that the containing package's __init__.py is updated to import the newclass. If the class has the same name as an enclosing module, you willneed to rename it in the import. For example, inosaf.contentmodel.__init__, you'll see this:


    from ItemCollection import ItemCollection as __ItemCollection

This is because the class name (ItemCollection) would clash with the modulename (also ItemCollection). Renaming it in the import prevents thiscollision. The schema API doesn't care what name it has in the package__init__, just as long as it's there. (Its original name in the definingmodule, however, *must* match the name of the Kind, and the Kind *must* bedefined in the parcel.xml of the package named in the module's '__parcel__'setting.)

By the way, you'll notice that it's a pain to have classes whose name isthe same as the module name, and we strongly suggest you don't do it infuture. There are two common practices used to avoid this. One, that isalready done many places in Chandler, is to make the module name a plural(such as 'ContainerBlocks') so that it is different from the singular classname (such as 'ContainerBlock'). Another practice, that is more commonamong large Python frameworks (e.g. Twisted, Zope, PEAK, etc.) is to useall-lowercase names for modules. For example, in Twisted, the'twisted.internet.selectreactor' module contains the 'SelectReactor'class. (In addition to avoiding name clashes, this convention also makesit obvious whether a particular piece of code is working with a module or aclass.)

The third major class of change is moving from 'import x.y.Z as Z' to 'fromx.y import Z' or just 'import Z'. This is not a change made for estheticreasons, but practical ones. A quirk of the Python import machinery makesthe old form not work, when you are importing a sibling module, while beingimported by a parent package that is in the import statement.


For example, in osaf.contentmodel.ItemCollection, the code originally did this:

    import osaf.contentmodel.ContentModel as ContentModel

However, when we added code to osaf.contentmodel that importsItemCollection, this means that this statement executes *while* theosaf.contentmodel package is still being imported, and it thereforefails. Changing the statement to:


    from osaf.contentmodel import ContentModel

fixes the problem.  We could also just say:

    import ContentModel

because the current package *is* osaf.contentmodel.

You do not need to go through your code and change these, but you should beaware of the issue, and we recommend you write new import statements in oneof the two other forms. (Note: you can still use 'as' with these forms torename a module; it's not 'as' that causes the problem, but rather theattempt to do an absolute import while the parent package is still in theprocess of being imported.)

The current plan for schema migration is to have all Kind, Attribute, andCloud definitions moved to Python by the next milestone date. In the nextmilestone, we'll be looking at getting rid of path dependencies, flatteningpackages, and beginning to take advantage of the benefits of the schema API(like being able to create and test items without needing to go through aparcel load operation). These benefits unfortunately are not available*during* the transition period, because of the need for backwardcompatibility, and because we want to disturb things as little as possibleduring the transition process. Thanks for your patience and assistance.


_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _

Open Source Applications Foundation "Dev" mailing list
http://lists.osafoundation.org/mailman/listinfo/dev

Re: [Dev] schema api: final review before migration

Reply via email to