CouchDB HTTP refactoring and new extension support

Damien Katz Thu, 02 Oct 2008 12:49:24 -0700

I just checked in the first round of refactoring of front end HTTPdcode. Previously, nearly 100% of the CouchDB httpd was incouch_httpd.erl, now it's been divided up to couch_httpd_db.erl,couch_httpd_view.erl and couch_httpd_misc_handlers.erl.

The couch_httpd module now mostly provides a wrapper around mochiweband implements dispatch facility for finding the correct module tohandle incoming HTTP requests.

On startup, CouchDB reads the ini config files to figure out whatmodule and function, if any, should be invoked in response to specialHTTP request.


Here is an example of a default.ini read at startup:
[httpd_global_handlers]
/ = {couch_httpd_misc_handlers, handle_welcome_req, <<"Welcome">>}

_utils = {couch_httpd_misc_handlers, handle_utils_dir_req, "../../share/www"}

_all_dbs = {couch_httpd_misc_handlers, handle_all_dbs_req}
_config = {couch_httpd_misc_handlers, handle_config_req}
_replicate = {couch_httpd_misc_handlers, handle_replicate_req}
_uuids = {couch_httpd_misc_handlers, handle_uuids_req}
_restart = {couch_httpd_misc_handlers, handle_restart_req}

[httpd_db_handlers]
_view = {couch_httpd_view, handle_view_req}
_temp_view = {couch_httpd_view, handle_temp_view_req}

[daemons]
view_manager={couch_view, start_link, []}
db_update_notifier={couch_db_update_notifier_sup, start_link, []}
full_text_query={couch_ft_query, start_link, []}
query_servers={couch_query_servers, start_link, []}
httpd={couch_httpd, start_link, []}

/ end

The [httpd_global_handlers] are the modules and function names thatget invoked for special urls (plus an optional third argument) Afterreading the ini key/values into a dictionary in memory, every requestURL that comes in is parsed to see if the first URL path segmentmatches a special key. For example, for a request like "GET /_utils/images/image.gif", CouchDB will parse the url to get the "_utils"part, then find a matching "_utils" handler in the handler dictionary,then invoke the handler with the couch_http request object.

If there is no matching httpd_global_handler, then CouchDB hands therequest off the the couch_httpd_db module where it might invoke a[httpd_db_handlers] for it. The couch_httpd_db module firsts looks atthe second URL path segment (Example: In "GET /db/_view/foo", the"_view" is the second path segment) and If it finds an db handler forit, then it open the database and invokes the handler with the HTTPrequest and database passed in as the context. But if no handlermatches, the couch_httpd_db module attempts to serve the requestitself (including some special urls, like _all_docs, and _compact).

This will allow for custom CouchDB database extensions. A simpleexample that's currently disabled by default iscouch_httpd_misc_handlers:increment_update_seq_req/2. It purpose is toallow a client to increment the database update seq# and have itreturned the client. This was needed by someone using CouchDB as anIMAP storage backend, but probably isn't generally useful. Therefore,anyone who wants to can enable this extension by adding this to theirlocal.ini file:


[httpd_db_handlers]

_increment_update_seq = {couch_httpd_misc_handlers,increment_update_seq_req}

Once enabled, whenever a client does a "POST /db/_increment_update_seq", it will invoke the handler.

The handlers can have a 3rd argument, which will always be passed tothe handler as an extra arg, which must be a valid erlang term. In themain example, we pass the welcome message as an argument like this:

/ = {couch_httpd_misc_handlers, handle_welcome_req, <<"Welcome">>}

The daemon support causes CouchDB to load up a new OTP serverprocesses. You provide a name as the key and the module, function andstart arguments as the value, and CouchDB will attempt to load themodules and start the subprocesses. If the sub-processes crash,CouchDB will restart them just as any other OTP server process. Andthese OTP process can also spawn external child OS processes ifnecessary.

By combining daemons and and new HTTP handlers, it is possible tocreate new CouchDB services, like a full text search engine written inErlang. The search engine daemon will keep the indexes up to date, andthe http handlers will process incoming requests and query theindexes, likely by interacting with the daemon.

I think we might still need to provide a way for CouchDB to find 3rdparty extension modules, I'm thinking that should probably be an inisetting, with multiple directories for erlang to search for modules.

Feed back welcome. Remember, nothing is set in stone, and much stillcan be done to further organize the code. Fire away.


-Damien

CouchDB HTTP refactoring and new extension support

Reply via email to