Re: [Chandler-dev] [Sum] The Great Architecture Discussion of 2007

Andi Vajda Tue, 09 Oct 2007 14:39:07 -0700


On Tue, 9 Oct 2007, Grant Baillie wrote:

Also, one could think about addressing performance and scalability at therepo level, without changing the whole architecture.
While Chandler was developed with an infinitely scalable and infinitelyfast repository in mind, it might be time to let reality sink in. Therepository has come a long way in terms of performance and could still beimproved, for sure, but coulddn't one think about addressing performanceand scalability at the app level as well, without changing the wholerepository architecture ?
Well, one can think about anything, so sure :). But as things stand, thereisn't really an "app level" to speak of: The repository is intertwined witheverything, and its API shapes the app layer in ways that aren't always soeffective. (The current indexing situation is one concrete example).

In other words, it's up to the app to dis-intertwine itself from therepository. I don't think that just tackling repository performance inisolation as has been the approach until now is the right solution anymore.

If, for instance, when importing 100,000 mail message we tell the UI aboutevery itsy bitsy change one attribute at a time, no amount of repositoryperformance improvements is going to get us to the performance we expect.

About mail import performance I need to point out that the message in thestatus bar at the bottom of the UI is misleading. It says "committing <n>messages" implying that it's spending time inserting item records into therepository.

Since this conversation is now in the mode where we're throwing around rowinsert number timings, how about changing the message to saying something like"converting mail messages to chandler items" ? The actual repo insert part,the repo commit(), part is pretty small, even negligible, when compared to thetime spent "chandlerizing" the mail messages into items with a live UI. I suredon't want people to think that it takes half an hour to write 7,000 mailmessage items into the repository.

Earlier today, Heikki proposed using multiple processes to better takeadvantage of multi-core hardware. Berkeley DB and the Chandler repositoryalready fully support multiple processes accessing the same repositoryconcurrently. It should be fairly easy for the application to split off sometasks into separate processes without any code changes in the task orrepository components themselves. Importing a large amount of mail in adifferent process or background syncing collections in a different processcould yield some interesting results.


Andi..
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _

Open Source Applications Foundation "chandler-dev" mailing list
http://lists.osafoundation.org/mailman/listinfo/chandler-dev

Re: [Chandler-dev] [Sum] The Great Architecture Discussion of 2007

Reply via email to