[Dev] Simplifying biref definition and kind extensions

Phillip J. Eby Thu, 08 Sep 2005 12:03:59 -0700

Yesterday, I posted a short and rough proposal for making it possible todefine a bidirectional reference from only one class. However, discussionon IRC and some emails I got privately made it clear that I didn't reallyprovide enough background on either the why's or how's of the proposal, andthere was also some IRC discussion that led to a better solution for one ofthe problems, than the solution I proposed here yesterday. So, I'm goingto restate the proposal to incorporate that enhancement, and also toprovide some of the background that was asked for.

Donn asked, "why is circularity a problem?" and "why is it more of aproblem now?" And the answer to both questions is that circularity breaksmodularity. Because if component A depends on component B, and B dependson A, then you can't use either one without the other, and so you no longerhave any meaningful distinction between A and B - they might as well be thesame component. You lose the ability for someone to learn A and then B,and you lose the ability to have A first and then optionally add B later.

So, the problem is that bidirectional reference definitions being splitacross parcels breaks this modularity, and the problem is popping up morenow as we try to enforce the modularization of the parcel structure. Westill want to have bidirectional references across parcels, of course, butwe need to be able to define them without making A depend on B and B dependon A. We'd like to be able to define the whole biref from *one* parcel, sothat you can have A and then add B later, and if you never add B then Astill works as-is.

Right now, however, if you define a biref with the schema API, you have todo half of it in A, and half in B. This is fundamentally broken because itmeans you can't have *any* modularity and still relate things in differentparcels. So, we need a way to define both sides of a biref from only oneplace.

And that's the first part of my proposal, that we allow you to define abiref from only one "side". In many of the cases of birefs in our currentschema, the other side is only there because we *have to have it*; we neveractually use the "A" side of the biref, we're only really using the "B" side.

Let's take Morgen's sharing use case as an example. The sharing parcelneeds to keep a collection mapping iCal UID's to calendar events. In thiscase, calendar events are part of parcel "A" - the "pim" parcel. The pimparcel shouldn't have anything to do with sharing, or else it can't be usedindependently, or taught independently. (That is, if you have tounderstand sharing before you can fully understand the pim parcel, we havea learning curve problem as well as an inability to deploy them separately.)

But, the only way Morgen can have a bidirectional reference between thesharing parcel and calendar events, is if he *modifies* the calendar eventmixin class to add an attribute, which then makes the calendar parceldepend on sharing - making A depend on B, in other words. This approachdoesn't scale very well, and it definitely doesn't work for third-partyparcels. And at our current team and application size, we are starting torun into problems because effectively we are all "third-party" with respectto one another's code. That is, in this example, Morgen is "third-party"when it comes to the calendar parcel.

So the first part of what I'm proposing, then, is that in the sharingparcel, Morgen should be able to do this:


    items = schema.Sequence(pim.CalendarEventMixin, inverse=schema.One())

and *not* have to go edit the CalendarEventMixin class, just to add thebackward reference that he never uses anyway. He just specifies that hewants a new 'One()' reference to be added to the CalendarEventMixin kind inthe repository, and this will happen as soon as his parcel isinstalled. The calendar parcel, meanwhile, can be loaded and used*without* the sharing parcel, because it doesn't have any references tosharing defined in its code. The calendar developers don't have to ask,"what's this sharing thing in our code?", and so they are happy. Morgendoesn't have to worry about annoying the calendar developers, or what tocall the extra attribute he doesn't want anyway, and so Morgen ishappy. Life is good. :)

There's an additional detail to this idea, which is how it's implementedinternally. When you create a "one-way biref" like this, it will actuallyadd a new attribute to CalendarEventMixin for you. You just don't have togive it a name, or add it to the class by hand. The name this attributewill be automatically given is "osaf.sharing.UIDMap.items.inverse", whichof course cannot collide with any of the calendar-specific attributesdefined by the calendar parcel. It does mean that it's more awkward toaccess that attribute, if you really need to access it for some reason,because you have to use getattr(ob,name) or ob.getAttributeValue(name)(where 'name' is "osaf.sharing.UIDMap.items.inverse"). You can't just say'ob.name' the way you can with attributes that are created explicitly.

This is a feature, though, not a bug. The fact that you can't access itvia 'ob.name' means that the calendar parcel can never *accidentally* usethis attribute, or define a conflicting attribute. This is a good thing,because it means that no matter what other parcels do to the kind, thecalendar parcel never needs to know about it. It can define whateverattributes it wants, and everybody else can have whatever attributes theywant, and everybody is happy. Life is still good. :)

Okay, so what about the case where you really want to be able to use thatattribute? Or what if you just want to add an attribute to an existingkind, like in the AbstractCollection.color case?

Well, that's what the second proposed feature is for, and this part of theproposal is a bit different today, based on the IRC discussionsyesterday. It's an API to allow you to define these additional attributes,and to access them conveniently, without having to spell out attributenames like "osaf.sharing.UIDMap.items.inverse". Here's an example, looselybased on a suggestion by Alec on IRC yesterday:


    class SidebarInfo(schema.Annotation):
        schema.annotates(pim.AbstractCollection)
        calendarColor = schema.One(blocks.ColorType)
        alertSound    = schema.One(schema.Lob)

If this class were defined in "some_module", then loading that module intothe repository would add two new attributes to the AbstractCollection kind:"some_module.SidebarInfo.calendarColor", and"some_module.SidebarInfo.alertSound".

But, it also does one other thing, which makes it much more useful. TheSidebarInfo class is actually an "annotation wrapper" class that you canapply to an item, in order to access the attributes "normally". That is,the Annotation subclass would have automatically-defined properties thatlook up the corresponding attributes on an underlying item.


So, if you wanted to get the calendar color of a collection, you would do this:

    the_color = SidebarInfo(some_collection).calendarColor

And if you wanted to set a collection's calendar color, you would do this:

    SidebarInfo(some_collection).calendarColor = the_color

And in each case, the attribute being get or set on the annotation objectwould cause the attribute to be get or set (using its full, dotted,internal name) on the wrapped item.

If you are doing lots of things with a particular annotation, you can ofcourse save it in a variable, and use it more than once:


   sbi = SideBarInfo(some_collection)
   MessageBox(("Your color is %s" % sbi.calendarColor), sound=sbi.alertSound)

However, annotation wrappers aren't persistent and shouldn't be stored inthe repository -- although they could be later if we have theneed. They're really just a convenience for Python code, at the moment,though, and things like attribute editors should probably just use theattributes' full dotted names, rather than using a wrapper to access them.

In addition to annotation attributes, you can also define methods onAnnotation classes, and then use these methods on the instances, e.g.:


    class SidebarInfo(schema.Annotation):

        schema.annotates(pim.AbstractCollection)

        calendarColor = schema.One(blocks.ColorType)
        alertSound    = schema.One(schema.Lob)

        def alert(self):
            MessageBox(
                 ("Your color is %s" % self.calendarColor),
                 sound = self.alertSound
             )

    # Alert about some_collection:
    SidebarInfo(some_collection).alert()

Thus, you get a kind of "dynamic mixin" capability that's ideal for addingextra information and behavior needed by "third party" parcels. (Exceptthat third party is a misleading name, since most of our parcels are "thirdparty" relative to some other parcel).

There are a couple more examples I need to present, in order to show howthe two proposals above (i.e. "one-way" birefs and annotation classes) worktogether. First, I'll revisit yesterday's Contact likers/likees example:


    class Friends(schema.Annotation):
        schema.annotates(pim.Contact)
        likes = schema.Many(pim.Contact)
        isLikedBy = schema.Many(pim.Contact, inverse=likes)

    Friends(somebody).likes       # get the contacts who somebody likes
    Friends(somebody).isLikedBy   # get the contacts who like somebody
    you in Friends(me).isLikedBy  # do you like me?
    me in Friends(you).likes      # no, really, do you like me?  :)

    Friends(everybody).likes.add(somebody)  # everybody likes somebody!
    Friends(me).likes.remove(you)           # I don't like you any more  :(

This of course is the special case where both attributes are annotating thesame existing kind. If we wanted to create a biref between two differentexisting kinds, we might have something like:


    class Favorites(schema.Annotation):

        schema.annotates(pim.Contact)

        favorite_feeds = schema.Many(feeds.Feed)
        favorite_movies = schema.Many(movies.Movie)

        # ... other 'favorite things' attributes here


    class FavoriteFeed(schema.Annotation):

        schema.annotates(feeds.Feed)

        favorite_of = schema.Many(
            pim.Contact, inverse=Favorites.favorite_feeds
        )

We could then use 'FavoriteFeed(some_feed).favorite_of' to find the peoplewho consider 'some_feed' a favorite, and we can use'Favorites(some_contact).favorite_feeds' to find a person's favoritefeeds. (And we can do all this without modifying either the pim or feedsparcels.)

The last example covers the case where a parcel wants to create a two-waylink between an existing kind and a new kind:


    class SoccerMatch(pim.ContentItem):
        # ... various other attributes here
        referee = schema.One(pim.Contact)
        # ... more attributes here

    class SoccerReferee(schema.Annotation):
        schema.annotates(pim.Contact)
        refereed_games = schema.Sequence(
            SoccerMatch, inverse=SoccerMatch.referee
        )

We can now use some_match.referee to find a match's referee, and we canfind out if a contact has refereed any games using'SoccerReferee(some_contact).refereed_games'. We could also add methods tothe SoccerReferee class to do things like compute statistics about therefereed games, etc.

Now, you could make an argument that this last use case should beimplemented by creating a SoccerReferee kind, and I wouldn't necessarilydisagree with you. However, as the number of roles an individual playsincreases, the number of mixin kinds is O(2^N). That is, every time amixin is added, the total number of kinds doubles. Having just threemixins means eight kinds (what we have now for stamping), 4 mixins means 16kinds, and by the time you get to twenty mixins there are over a millionpotential kinds. That's an awful lot of repository space just to store allthe different kind mixtures. :)

The annotation approach, on the other hand, doesn't create any new kinds,but instead allows items to be of multiple "virtual" kinds at once. AsDonn pointed out in an email this morning, this means that annotationsmight end up being a better way to implement extensible stamping in futureversions of Chandler.


Anyway, that's the updated proposal.  Comments?  Questions?

_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _

Open Source Applications Foundation "Dev" mailing list
http://lists.osafoundation.org/mailman/listinfo/dev

[Dev] Simplifying biref definition and kind extensions

Reply via email to