Re: MIME dump/load and implications

Nitin Borwankar Thu, 06 Aug 2009 16:07:07 -0700

Adam Kocoloski wrote:

Hi Nitin, sure. I think Paul just meant that if we wrote it forreplication, anyone could use the same facility to build somethinglike what you're talking about.


Hi Adam, Paul,

Yes certainly agreed in theory but in practice I see two majorobstacles, one social and the other technical to this approach.

a) The number of people who can work on the replication code is ahandful and this hugely raises the bar and puts all this in the domainof "someday-maybe" - "it will need to get on the release schedule" etc.which is less attractive to me.

On the other hand doing it in userland via couchapp throws open thegates for anyone to start experimenting with this and then the beststuff will emerge and get traction, one hopes. I couldn't possibly makeany useful contribution at the replication level at my current level ofknowledge of Couch/Erlang/... but hacking at the couchapp level is notout of the range of my capabilities. I suspect most users of Couch arein the same boat as me.

So I deliberately focus on a REST based MIME-payload protocol that couldrun in parallel with the REST based JSON-payload protocol.At this level of abstraction it is easy to understand and talk about fora much huger audience. Moreover the amount of work needed to bridge thegap between the already working python code and a couchapp/javascriptimpl of this is much smaller.

b) This will all also be possible in the replication layer but amessaging system has additional requirements such as keeping cumulativetrack of where the message came from - adding a MIME-header (JSONattribute) i.e modifying the doc at each hop etc. These concerns arenot fundamental to point-point doc replication and need to be layered ontop/in addition to replication. Replication needs to faithfullyreproduce a doc from one endpoint to another without the kinds ofmodification that happen to headers (metadata attributes) in messagetransport.

So thinking of messaging as "merely" replication glosses over somefundamental issues that distinguish message documents in motion, frommore general documents. Message documents have container levelattributes that get modified during the process of store-and-forward totrack the motion of the doc.Generic documents in CouchDB have no such requirement and it may becounterproductive to intermingle the requirements.

In fact it may be possible to make a stronger statement - Couchreplication should focus single mindedly on replication of documents ina pt-to-pt fashion.Messaging semantics can then be layered on top of replication, in anupper layer if desired - or not. Injecting messaging concerns(especially modification of container attributes) into replication is adistraction of focus for replication, IMHO and possibly counter to therequirements of faithful replication.

Hope this clarifies why I reacted somewhat strongly to conflating RESTbased messaging with replication - they overlap but should not beequated if we want to build a messaging system, != a peer to peer docsharing system.

After all, the replicator just speaks to CouchDB servers using thesame HTTP requests as everyone else.
I'm sure jchris knows better whether mimeparse would be suitable forthis. Cheers,

Mimeparse.js - looking at the code - seems focused on making the bestmatch of content-type to provide given an accept header so it may not bethe ...er best match for this.


Cheers,

Nitin

Adam

On Aug 6, 2009, at 5:26 PM, Nitin Borwankar wrote:
Hi Paul,
I never used the word replication - it should be possible to create aREST based couchapp driven MIME transfer p2p web quiteindependent ofreplication which is also cool. Front the couchapp with the usualauth-proxy stuff for now so only auth'ed people can communicate withyou.
Just replace JSON with MIME in all the reference docs and make theURL's point to a design doc that does the transformations.
On the way out it could be just _shows or an _list that takesmultiple objects and wraps them as a mime multipart.
On the way in set up some REST endpoints that take POST's, parse mimemultiparts ( jchris's mimeparser?) convert to Couch docs, manageattachments and puts them in _attachments ... and we're off to theraces - free user controlled, MIME-and-Mail-as-aplatform drivenwebmail apps for all.
Yay Couch!

Nitin


Paul Davis wrote:
Definitely some interesting points here. There have been discussions
on using multipart-mime messaging in the replication protocol which
could setup for some interesting prospects like this. I'm not sure on
specifics in terms of replication, but having an endpoint that allows
edits via multipart-mime could be a very fun thing to play with.

Also, AFAIK there's nothing that prevents an isomorphic
representation. As you point out, couchdb-python handles everything
just fine here.

Paul Davis
On Thu, Aug 6, 2009 at 4:46 PM, Nitin Borwankar<[email protected]>wrote:
Hi guys,
I see that the python based dump/load uses MIME multipart docs asan on-disk
serialisation format for couchdb databases.
An overall question then arises - can CouchDB be considered a MIMEdatabase
which oh also happens to talk JSON?
So before that - is there a 1-1 strong correspondence between aCouchDBdocument and a MIME multipart, or are there things around the edgesthat arecrufty - I would assume a strong correspondence since dump/loaduses it andI haven't seen any caveast about document content that is notdumpable.
So assuming the 1-1 correspondence - could one use some"translation layer"couchapp that accepts arbitrary content/type + multipart-mixed MIMEobject
over HTTP and then transparenty serialise them to JSON underneath.
Given that dump/load already does this - it would see that thereare noobvious glaring flaws in this logic - but I have been known to bewrong,
once :-).
If this is indeed feasible - then each CouchDB + MIME-trans becomesa webmail node - and Couch begins to be the platform for a messagingrevolutionas well as an application revolution. I am thinking now not asCouchDB forbacking up your email - but CouchDB as your mail client/server forp2p MIME
based "email".
Permissions etc are important to avoid complete disaster of course- but
private high quality communication that just reuses existing message
formats, with better storage and transport would seem like an ideawhose
time has come a long time ago and has been knocking at the door for a
decade.

Yes, yes, there's the issue of spam - so see the P.S.

Just a few idle thoughts,

Nitin
P.S. Back in 1998 I tried to convince Sybase to have MIME as anative typein the db and it even got speced out ( I have the spec with thedate on it!) but got canned becous ethe VP of enginnering wanted to know "whatwas themarket exactly for this kind of stuff". Other than that I wasgranted apatent for doing p2p discussions over email back in 2003 - I let itexpirefor multiple reasons. So I am somewhat non-naive about and awareof theissues and pitfalls around this sort of thinking. At the same timeI am ofthe strong belief that when one looks at messages as data to bemoved aroundbetween endpoints with well defined addressing schemes, and oneignores the
protocols for a bit, then all sorts of fun things start to happen.


37% of all statistics are made up on the spot
-------------------------------------------------------------------------------------
Nitin Borwankar
[email protected]

Re: MIME dump/load and implications

Reply via email to