Re: [Chandler-dev] [Sum] The Great Architecture Discussion of 2007

Phillip J. Eby Tue, 09 Oct 2007 16:02:45 -0700

At 11:08 AM 10/9/2007 -0700, Philippe Bossut wrote:

On the idea of having a "small group attack[ing] architecture on theside", my concern is to make sure that the progress of this work iscorrectly tracked and focus on the right thing. My experience withthis kind of project (2 in my past life as a manager and bothunsuccessful) makes me prudent and wary: it's easy for the"rearchitecture guys" and the "product maintenance guys" to be sodisconnected that the 2 projects suffer and fail, the"rearchitecture project" going on a wild goose chase with grandioseobjectives, the "product maintenance" seeing its immediate needs(say testability or performance) not addressed in any foreseeablefuture and growing downright negative on the rearchitecture. This isa very serious risk.

In a vacuum, it would certainly be reasonable to be wary of suchprojects. However, it should also be pointed out that in my timeserving OSAF, we've successfully completed no less than fourrearchitecture projects under my guidance, including the removal ofparcel XML, the transition to "stamping as annotations", EIM-basedsharing, replacing parcel discovery with eggs, replacing the oldtimer system with osaf.startup, and others.

All of these (not to mention the various greenfield architectureprojects I worked on) were completed on or ahead of schedule, withhigh approval ratings for the results -- even from people who atfirst thought a particular project was unfeasible, unnecessary, orjust a bad idea.

So, I think it's only fair to match your experience with twounsuccessful projects in other environments than this one, with myexperience of 4+ successful ones in this environment.

Now, that's not to say that this effort can't fail -- anything canfail, of course. It's just that where Chandler is concerned, Ihaven't done so yet, and don't intend to start now. :) This is notbraggadocio on my part, as it is not really a question of Chandlerneeding an especially *good* or "elegant" architecture; it would doquite well with a mediocre one -- as long as it was one reasonably*appropriate* to the tasks Chandler actually needs to perform.

Unfortunately, where Chandler has had any architecture at all (e.g.CPIA and the repository), it has typically been aimed at building anentirely different sort of application than the one we ended up developing!

Thus, Chandler's performance, reliability, code size, and testabilityhave all been burdened by five-year-old assumptions about goals wehave long ago stopped chasing. It is time to stop paying (andpaying, and paying) for that legacy.

To mitigate that risk we'll need to get a monthly formal statuspoint, reviewing what has been learned in that month, what's thenext action and how more clarity the recent progress gives us on theoverall schedule of the project. The rearchitecture project willhave to be very open so that the rest of the team stay engaged andon board with the new architecture. We also have to have enoughvisibility that, if the goals seem to slip further away everymonths, we should be able to call the project off and cut ourlosses. This is what could be called an "accountability" clause forthe rearchitecture project.

Of course, accountability is a must. I myself would like to see ademo-capable version of Chandler on the new architecture (minuscertain features such as sharing and email) by year-end, that offerssignificantly improved memory footprint, startup time, and UIresponsiveness compared to its big brother.

That is, the improvements in the product should be visible to an enduser, not just a developer. This is key for PR and funding reasons,to answer the inevitable (and misguided) "why are you rewriting"questions in situations where a nuanced reply won't be anywhere nearas convincing as a side-by-side comparison.

If it works, we have much to gain, and if it fails, we lose only thetime spent on the pilot by a limited group of people.

Testability
-----------
PJE: Testability is a requirement for the goals laid out in Katie'soriginal email, and is therefore a "wedge issue" for the wholediscussion. Testability is also necessary to build a developer community.Morgen: Testability has been vital for development and ongoingmaintenance and feature development of syncing code.Heikki: Testability has not been a requirement in the developmentof a well-known family of open source web browers whose process hehappens to be familiar with.There was some discussion of testability in the thread followingAparna's "Desktop Test Automation Project" email. In the standardlayered approach, we could tackle testability of presentation layercode with a mock wx.
I can't pull myself and say that testability in and on itself shouldbe the driver. IOW, if testability was our only issue with thearchitecture, we couldn't justify overhauling the architecture justfor that and would rather think about other approaches to test (testby community, use off the shelf testing frameworks, etc...).

One must look beyond the surface meaning of the word "testability" tosee what I mean. A component that cannot be unit tested is acomponent that has undesirable couplings to globals or other components.

Undesirable coupling, in turn, means that a unit cannot be worked onseparately, nor can it be relied upon as a solid base for developmentof other components. A unit that can't be treated as a black boxbecomes a development bottleneck, as progress relies on a limitednumber of "experts" whose breadth and depth of knowledge issubstituted for adequate tests and documentation.

So, there is a lot more to the consequences on the project than themere literal meaning of being able to test something. The lack oftestability is the direct cause of many systemic project issues, aspreviously mentioned.

CPIA/Persistence in the UI
--------------------------
Katie's original email suggested that we shouldn't be persisting somuch of the UI structure of the app. John noted that transparentpersistence was a refreshing contrast to other systems he'd workedon, where you had to write SQL queries every time you needed topersist something. Philippe pointed out that having everything bepersistent is confusing for new developers: it's hard to make senseof all the attributes that get tied to even simple items whendisplayed in the UI.
PJE: What we need is to separate out visual presentation (notpersistent) from application logic (e.g. which items are selectedin which collections, etc). That leads to greater testability (youcan test the application logic without the UI). Ideally, you couldseparate out persistence as well, which means you can run tests ofthe application without the repository. Greater separation meansmore opportunity for parallel development (i.e. of views vsinteraction model vs persistence) of features.
*Agreement* (Philippe, PJE, Reid, Andi, John, Mikeal): Cleanerseparation of UI from the rest of the app.*Agreement*: (Philippe, PJE, Andi): Not persistingredundant/constant UI data.*Agreement* (John, PJE): The current template mechanism in Blocksis confusing and mostly a historical artifact, so it should be removed.
The CPIA remnants and what Reid described as the "wall ofabstraction" must go. If we could organize the rearchitectureproject so that this part was done first and merged with the trunkbefore the rest (full testability, performance and scalability), I'dvote for that.

Key to the approach that Grant and I envision, is that we are firstand foremost *removing* code, rather than changing itin-place. After all, if we are changing existing code that does nothave tests, how do we know whether it's working? So, it has to bedone test-first, which means starting with no code, and addingtests. Once a test exists, only then is it safe to add in code. Soin truth, the "walls of abstraction" in CPIA and the repository wouldbe gone the moment the pilot project begins. :) (That is, they willbe gone by simple virtue of not adding them to the branch.)

However, there would be no merging to the trunk. Instead, any newtests added on the trunk (and where applicable, the implementation ofthe corresponding fixes and feature additions) migrate to the branch,to keep it up to date. Then, when the time is right, the branchsimply replaces the trunk.

In fact, calling it a branch is a bit of a misnomer, as even if weput it under branches/ in SVN, it'll likely be started with an emptydirectory, rather than a copy of the trunk (although where possible,we'll "svn cp" in files as they become useful, so that revisionhistory remains.)

Performance, Scalability
------------------------
*Open Issue*: (Grant, Andi, Brian K) Unclear what the goals arew.r.t. email beyond what we have today. Need measurements of howChandler performs in the presence of many items (and/orcollections), as well as explicit performance goals.
As an objective, we should aim at handling thousands of items inChandler: emails, events, tasks, snippets of all kind (see Katie'semail in that thread). So far though, it doesn't seem to me the mosturgent issue, as I feel the one listed previously (CPIA architectureobscurity) is a road block on any effort. Also, one could thinkabout addressing performance and scalability at the repo level,without changing the whole architecture.

One could think about it, but one would be unlikely to make muchheadway. :) As Andi says, the repository has come a long way, butthis has mostly been through micro-optimization rather than lookingat the overall approach, and we are running out of things tomicro-optimize. Any major improvement will have to involve more thanjust the repository (as Andi has also pointed out).

Unfortunately, the overall approach we are using with the repositoryis actually an anti-pattern for this type of application, in myexperience. In fact, we have what might be called anti-patterns allthe way down, including:


1. application-level code meddling in storage-level details

2. lack of sufficient domain-specific query APIs

3. no indirection between the application's logical schema and itsphysical storage schema


4. implementing a generic database inside another generic database

5. implementing generic indexes inside of generic indexes

6. reimplementing all the guts of a relational database in Python,only without getting any of the benefits of actually using arelational data model (such as query transparency, or index/tabledefinition independence at the application layer), or theperformance/maintenance benefits of using an RDBMS written in C andmaintained by someone else.

Note in particular, that getting rid of #6 eliminates #4 and #5,while making the other three a lot easier to fix.

_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _

Open Source Applications Foundation "chandler-dev" mailing list
http://lists.osafoundation.org/mailman/listinfo/chandler-dev

Re: [Chandler-dev] [Sum] The Great Architecture Discussion of 2007

Reply via email to