Re: Designing a database

Troy Kruthoff Mon, 28 Dec 2009 22:44:44 -0800

I made an app that does exactly what you described, although it was a"for-fun" hack just to showcase couch to my buddies (it was calledtweetmesexy, so you can imagine how much fun it actually was)... WhatI ended up doing was:


1)  Each new comment is a new doc that references the tweet id

2) Use view collation to get the tweet(s) and comments via singlehttp call (http://wiki.apache.org/couchdb/View_collation)3) Run a script (via cron or whatever) to move the comments to thetweet (and delete the comments) when the tweet is no longer "hot".This is not required, but in our case it allowed us to do some niftyanalytics thanks to couch's incremental map/reduce

As for if couch is a good fit or "update-heavy" applications, I thinkan RDBMS has advantages in a true "update" scenario (like 'updatestats set counter=counter+1'). But remember, you are only using theword "update" because couch's awesomeness allows you to even considerstoring the comments inline with the doc. Technically you can do thesame with an SQL database, using a serialized blob and have the sameconflict issues (without built-in revision love).

So assuming I'm correct that the structure of your data will besimilar if using a SQL database or couch, you would be well servedwith couch:

1) You can archive the comments inline, as I mentioned above and runcool map/reduce on the tweet and comments together2) Simple master-master, allowing you to scale writes to your heart'scontent3) With SQL you'll need multiple queries (or go the ugly join route)to get the comments and the tweet, vs a single http call

Bottom line, just because you find yourself structuring your data likeyou would in an SQL database, does not negate the other advantages ofcouch.


Troy

On Dec 28, 2009, at 10:09 PM, Sean Clark Hess wrote:

Our system will have comments related to live data - imagine people
commenting on tweets right after they are written.
I'm having trouble deciding how to model it. It makes a lot of senseto makeone document containing all the comments for each data segment, butwe could
theoretically have hundreds of users commenting on the same segment at
once.
Would data consistency become a nightmare? With an RDBMS you wouldhave a
comments table, and insert a new row for each comment - preventing
conflicts. I could do the same thing with couch, by adding a separate
document for each comment, but it seems to violate a fundamentalprinciple
of couch.
Is Couch DB a bad fit for an update-heavy system? Updates will onlybe heavy
within the first minute or so after the data is released, then it will
switch to a very read-heavy system.

Thanks for your help

Re: Designing a database

Reply via email to