Re: dirty reads - update strategies

Damien Katz Thu, 13 Nov 2008 10:30:34 -0800


On Nov 13, 2008, at 1:10 PM, ara.t.howard wrote:

On Nov 13, 2008, at 10:39 AM, Damien Katz wrote:
My answer is "Don't do that". Values in documents shouldn't dependon values in other documents, that's a better fit for a relationalor OO DB. In your example though, CouchDB's views could be used tocompute the sums.
i don't think that's realistic. consider something like thefollowing:
let's say we write a publishing system, users can create documentswith content and tags. at the end of the month the editor is goingto write a summary of the content from that month, obviously thissummary should be tagged with the union of the tags from allsummarized content - for later searching. regardless of whether westore the tags inside the document or outside of it we have quite atask - we need to get a consistent read of all content for themonth, with all it's tags, in order to properly construct thesummary document with it's aggregate tags. this isn't strictdependence - it's merely a read/write consistency issue which nearlyany application is going to face. we can argue that it's notimportant that the summary of tags exactly mirrors the tags of it'sconstituent parts, but that kind of thinking results not in aninformation store, but a collection of valueless data.

CouchDB views are a consistent snapshot of the database, your reportsare generated from the views. The view APIs are the place to look forbetter reporting capabilties.

anyhow, i think it's important to be able to agree upon bestpractices for this kind of operation. saying that values shouldn'tdepend on values in other documents is quite a statement - it meanscouch should no be used for any information store where theinformation value needs to grow recursively.

What I mean is you should never depend on the accuracy of the computedvalues in documents that are based on other documents. Particularlywith replication.

in my case we're modeling financial information which gets processedin increasingly sophisticated ways - where documents are inputs toprocesses which produce other documents. i can't think of anapplication that does not do the same thing: a blog comment dependson the blog post, a 'friends list' depends on the users, etc.



are you referring to 'values' as different from 'ids' ?

Yes, I mean values as computed values. The main post shouldn't beupdated with a comment count or anything computed like that. It's fineif comments have a reference to their parent, and its fine if thecomments are tagged as children of the post. This way, when the mainpost is opened, the comment count can be computed from a view, or whenviewing a comment, the user is also shown the parent, and maybesubcomments if its a threaded discussion.


-Damien

kind regards.

a @ http://codeforpeople.com/
--
we can deny everything, except that we have the possibility of beingbetter. simply reflect on that.
h.h. the 14th dalai lama

Re: dirty reads - update strategies

Reply via email to