Re: Per-DB Auth Ideas and Proposal

Jason Davies Mon, 14 Sep 2009 13:10:30 -0700

Hi all,

Thanks for all the excellent responses!

With Chris Anderson's "simplest thing that could possibly work" ideain mind, here's a quick summary of what I plan to implement as a firstcut. I've taken ideas from multiple responses on this thread, so Iwasn't sure which message to reply to, but this plan is mostlyinspired by Adam's ACL ideas so I've included that message below thisone for reference.

The simplest idea is that we have a special doc in each database,"_local/_acl" or similar, containing a list of [role(s), "read" or"write"] pairs. By default everything is denied to everyone (except_admin). The most common use case would be to then have ["username","read"] and ["username", "write"] to give a user read and writepermissions to that particular database. (In this example, I'veassumed that in the _users database we map the "username" user to the"username" role for simplicity). If we want to give particular access(e.g. read-only) to *everyone*, we can use the special "*" string todenote a wildcard, which matches any role, including no role at alle.g. ["*", "read"].

I envisage this default "deny all" behaviour being a switch inthe .ini file, so people will only turn it on once they have users and/or ACLs set up.

OK, so that is the simplest implementation. Possible extensions wouldbe to make this more rule-like (c.f. Apache) and have ["grant" or"deny", role(s), "read" or "write"]. This would let you do morecomplex setups, although I'm not sure this is necessary unless weintroduce more advanced pattern matching. validate_doc_update alreadylets us do things like denying new docs from being created, etc, so noneed to concern ourselves with that level of write granularity (i.e.we only need a "write" permission, no need for "create" or "update").Pattern matching is a possible extension too, letting us grant/denycertain URL paths instead of the whole db. I'm not convinced tyingourselves to HTTP verbs is a good idea, it could be that read/write issufficient for us if we pass this to couch_db:open and throw anexception if any handlers attempt to write when they have opened thedb in read-only mode.

Another extension would be to have per-doc ACLs, but I think we stillneed more discussion about this on the mailing list. It couldpotentially look like having a similar ACL structure for each doc in aspecial _acl member. We can potentially push this information intothe fulldoc B-tree (instead of looking it up in a separate view) tosave having to do two view lookups when performing authz.

One decision that we might want to achieve more consensus on is aboutwhere the ACL itself is stored - either in the db that it applies to,or in the users db. Several on the mailing list thread have expressedthat it's better to keep the ACL with the db that it applies to, whichseems to make the most sense to me (yes, I've switched my thinking onthis): for example if you delete a db then its ACL doc automaticallygets deleted. As it is a local doc it won't get replicated anyway.This seems more intuitive as it's more similar to how UNIX permissionson a directory work.

My thoughts are that I should get down and implement this, as even ifwe change our minds about some things, like where the ACL gets stored,the majority of the code will be the same and trying it out as codemay give us valuable feedback about other design choices.


Thanks,
--
Jason Davies

www.jasondavies.com

On 8 Sep 2009, at 23:41, Adam Kocoloski wrote:

Hi Jason, nice proposal. We were discussing it today at Cloudantand had a few comments:
On Sep 7, 2009, at 6:50 PM, Jason Davies wrote:
Hi all,
There have been sporadic discussions about various granularities ofauthorization. The most simple level to tackle is per-dbauthorization. What follows is a summary of discussions and ideasso far.
I should point out that this is primarily to flesh out the defaultauthorization modules that address the needs of the majority ofusers. We probably will have an authorization_handlers settings,analagous to authentication_handlers, allowing custom authorizationmodules to be used.
1. Where are the permission "objects" themselves stored? Thepermissions determine which users can do what with each database.I think storing these in the per-node users database (called"users" by default) makes the most sense. We are talking about per-db auth so it wouldn't make any sense to store this information inthe affected databases themselves.
I think it's actually pretty sensible to store some authzinformation in the DB itself, for many of the same reasons outlinedby Brian and Benoit. The big exception there is the ability tocreate new DBs. That's traditionally the task of a server admin,but perhaps we could come up with some special role that could begranted to users to allow them to do that.
2. What types of operations do we need to support? I think themajority of users will only care about being able to makeparticular databases read-only, read/write, or write-only (not sureabout the latter one).
I think write-only is a keeper. It may also be useful todistinguish between creating new documents and updating existingones. For instance, SQL GRANT tables distinguish between INSERT andUPDATE.
On the other hand, our REST interface doesn't have that cleandistinction; PUT and POST can both be used for create and update.And I agree with Chris; mapping authz to REST verbs is a goodsmell. In the rest of the discussion I've assumed that mapping.
3. How do we implement these operations using the existing user_ctx{name=..., roles=[...]} object? I don't think we necessarily needto set any special roles, although this was my initial thought e.g.['_read', '_write'] on a per-db basis. As authorization is aseparate module, we can simply pass the appropriate permission(read and/or write) through when opening the db internally in thehttpd db handler function. The db-opening function will then needto throw an error if writes are attempted and it is in read-onlymode. Using actual roles is potentially more elegant, as customroles could also be set using the permission objects andimplementation might be easier.
+1 for adding elements to the roles array in the #user_ctx. More onthis in 5.1
4. One use-case we need to bear in mind is being able to grant/denyaccess to sets of databases at a time. One way to do this would beto allow patterns to be specified, for example:
{
  "_id": "foo",
  "type": "permission",
  "username": "jason"
  "match": "jason/*",
  "operations": ["_read"]
}
This would grant the user "jason" read-only access to any databasethat has the prefix "jason/".
5. Permissions per roles vs permissions per users? Although theabove example specifies access for a particular user, it might bemore elegant and efficient to do this per role instead. If peruser is needed this can be done by giving the user a special roleunique to them. If a user has multiple roles then we would takethe union of the resulting permission set.
+1 for roles here. I think it makes sense for the users DB todefine roles for each user, either by adding roles to a userdocument or users to a role document (or both). But the actualspecification of privileges for a role in a given DB should go inthe DB.
I realize this doesn't allow for easy configuration of privilegesacross multiple DBs.
5. Default settings: we already have the require_valid_usersetting, which forces a node to authenticate users. We would needto support certain access permissions for non-logged-in users i.e.anonymous users. This could be done using a special "_anonymous"string in the permission to override the default, which wouldprobably be read/write for everyone as it is now.
6. Future work: thisfred suggested that the pattern-matching couldbe extended to the full URL instead of just the database name.This seems like a simple way to extend authorization. Of course,it's dependent on a particular node's URL mappings (these can bechanged in the .ini). This then brings up the question of what theoperations should be, it would make the most sense to let them beHTTP verbs, so that one could restrict access to certain URLs tobeing only GET and HEAD for example. This seems a bit too tied toHTTP for my liking, but I guess CouchDB is very much a RESTful andtherefore HTTP-reliant database. Any further ideas would bewelcomed.
So, after giving this some thought I'm partial to the idea of AccessControl Lists. Instead of directly granting privileges on databasesin the users DB, we'd store an ordered list in the DB in a specialdocument that would allow|deny requests that match a rule. Forinstance, if I wanted to make a read-only DB where only I couldaccess the _design documents I could upload a document like
{
   _id: "_authorization",
   _rev: "1-1340514305943",
   _acl: [
{"access":"allow", "role":"kocolosk", "method":"*","path":"*"},{"access":"deny", "role":"*", "method":"*","path":"_design*"}{"access":"allow", "role":"*", "method":"GET","path":"*"},{"access":"deny", "role":"*", "method":"*","path":"*"}
   ]
}
The rules in the ACL array are applied in order, and the first ruleto match wins. Here I've assumed that my user has a correspondingrole, like a UNIX group.
I explicitly listed the deny rule at the end, but we could make thatthe default if we wished. CouchDB has historically been prettyopen, but sysadmins would probably prefer it if things were secureout-of-the-box. I think the right default setting will become clearduring the implementation.
Benoit mentioned that he wanted authz to replicate. If we decidethat's the way we want to go, storing the ACL in a regular documentwith a reserved ID would allow for that. If we didn't want it toreplicate, we could just change that docid to something like _local/authorization
We might take this one step further and allow additional AccessControl Elements in individual documents. These ACEs would beprepended to the DB ACL and would allow you to specify custom authzfor a subset of documents in a DB without having to resort to path-based regex and editing the DB ACL every time.
Finally, there's the issue of authz in views. What privileges doesthe view indexer have? If a user who is only allowed to read someof the documents in the DB is allowed to upload a _design document,it seems to me that the views generated from that _design documentmust exclude any forbidden documents. I guess this can work if the_design doc stores the roles of the user who saved it. It seemslike a tricky, but solvable problem.
Best, Adam

Re: Per-DB Auth Ideas and Proposal

Reply via email to