Re: author ids

Janne Jalkanen Fri, 22 Aug 2008 11:30:55 -0700


On 22 Aug 2008, at 18:31, Andrew Jaquith wrote:

1) storing of attachments (I haven't figured out yet whether it'sbetter to store them in a separate workspace, because then you canleverage faster local filesystems instead of putting really bigbinaries into the database)
Probably better to store these in a second database -- sort of how,today, we allow you to use a different directory for attachments v.wikipages.

I think I was more convinced by Florian; the repository should dealwith large items. This is also faster, since you don't need to dotwo lookups for each object.

The ID associated with documents and revisions and whatnot shouldbe the unique ID number. That's classic normal-form stuff.


That was my initial reaction as well, but...

Storing the id alone brings in the following problems:
* Imports/exports break, since the repo model would only exportthe ID, and there would be no binding of that to real identity
You'd rely on the user/group managers to tie the user identity backto the IDs.

Here lies the problem - the JCR API does not know about this at all.Once you say Session.exportSystemView(), you get *exactly* what is inthe repo. IDs and all.

Also, the JCR best-practice seems to be to stay away from IDs. Ifyou need to reference something, you should use a Path or Nameproperty to reference something intra-workspace.


http://wiki.apache.org/jackrabbit/DavidsModel

The approach I've seen elsewhere is to have a place where you mapthe user IDs to the identifiers used on the "identity system ofrecord," whether that be LDAP, a relational database or whatnot.This adds another level of indirection, of course, which sort ofsucks, but it's really just one more table that would get stored inJCR.

Well, JCR does not exactly store tables, but it stores trees (witharbitrary built-in references to make the graph circular, if you wantto).

We probably should keep the interfaces the way the are, but makethe default implementation ("JCRUserDatabase") use the JCR back-end. Do we keep the XML and JDBC implementations for those who wantthem, or maybe even get rid of them?

This is feasible, though then access to the Repository object needsto be centralized somewhere (which means that it will leak upwards).

Migration, I think, is the key. We need to keep those around atleast to help in the conversion process.

Another problem with the UIDs is that then you can never reallydelete someone's user account - but you must retain the mapping.Otherwise you lose info about who changed what. This means that itis also impossible to re-register using the same account name, eventhough it has been "removed" (which necessarily isn't a bad thing,since it could open up some security holes if you could guess adeleted UID, and some pages still had ACLs referring to it). Butwhat this means that in practice, the login name becomes an uniquekey throughout the wiki's life time, so you can't actually change theUID - you really have to go and create a new account.

Dunno. There is a security problem brewing in allowing people tochange their account id's, because ACLs refer to those IDs, not theUIDs. We should at least keep a trace of all old loginnames andwikinames. Or just not allow changing them at all. I'm starting tosee Terry's point of view on all this...


/Janne

Re: author ids

Reply via email to