The 1.0 Thread

Damien Katz Thu, 18 Jun 2009 14:35:13 -0700

Okay, time to ask the question, what features do we need to get to 1.0?


I'm going to list my must haves, and my nice to haves.

Must have:

- Document integrity checking: Using some sort of hashing scheme forend to end integrity checking of documents and attachments. Reusingthe revision ID as the hash of the document might work, and has thebenefit of allowing writing the same changes to 2 different serversand not causing a conflict. Also multiple clients can write the samechange to a document and not get unnecessary conflicts.- Reader/Writer access databases and servers: Allow/disallowanonymous, users, groups.- Continuous replication: Keeping a constant connection and being ableto replicate changes as soon as they happen.- Better testing: We need really some performance and stress testingas part of the source. And we need much better code coverage ingeneral with the testing.



Nice to have:

- Hashing/CRC everything written to disk, data, metadata, indexstructures, etc. But optional, since many filesystems activelyintegrity-check disk data.- Better full text integration: Out of the box integration and theability intersect results with views, for easier result formatting.Lucene would be the primary FT engine, but we make it pluggable, muchlike the view engines are.- Attachment level replication: By tracking the revision when anattachment was modified, the replicator can avoid copying unchangedattachments to the target. The same can apply to json fields, but it'smuch less of a win there.- Partitioning/sharding support: Ideally would be nice to havesomething that "just works" without a lot of setup.- Built-in authentication: A plug-in that authenticates HTTP users andassign them roles. It would use a couch database as a directory thatcontains users documents, etc.- Selective replication: The ability to replicate a subset ofdocuments, using a javascript function as a selector.- Server side doc processing: The ability to POST data and havearbitrary server-side processing. The simplest case is posting adocument to a Js handler that can do some data cleanup and add defaultvalues the document before saving it. But ideally would be able tointeract with the full database- Scheduled replication: The ability to schedule replication every sooften, like a cron job. But this can be done with an actual cron joband CURL, so it's not critical to have it built-in.


There are probably a bunch of things I forgot about.

Respond to this with your must haves and nice to haves. No promisesyou'll get your way (no guarantee for me for that matter), but letsstart talking about it.

And anyone who wants to take on any of these issues: mine, yours oranyone else's, just do it. Read code, mail dev@ with questions andadvice, write some code, repeat.


-Damien

The 1.0 Thread

Reply via email to