Re: uuid, auto-increment, et al.

ara.t.howard Mon, 20 Oct 2008 00:18:59 -0700


On Oct 19, 2008, at 11:34 PM, Antony Blakey wrote:

If you want to ensure that the username is unique at the time theuser enters it, then you need a central synchronous service. Usingthe username/password as a pair isn't a good idea because it onlytakes two naive/lazy users to use a similar password (based say ontheir username? :) for collision to subsequently occur.
I've been considering this in a production environment, and I sawfour solutions:
1. Append some form of unique id from the server you are currentlytalking to i.e. checksum then machine uuid + process. Any checksumis going to have some chance of global collision, but it could bemade vanishingly small. Not great for the user because they have acomplicated username.
2. Define your user interaction such that it can deal withsubsequently needing to add some suffix to the username e.g. whenyou get a replication conflict (which could involve an number ofconflicts equal to the number of writable replicas), you amend some/all of the names to include a serial number and then email the user.This complicates things for the user, and they end up with ausername they haven't chosen, or they may not see the email and endup abusing tech support etc etc.
3. Use couchdb in a single-writer multiple-reader scenario. If youonly do that for those activities that require uniqueness then youhave consistency issues to deal with because replication isasynchronous. One way to do that is to switch a session to thewritable server as soon as you need uniqueness. The single writerbecomes a bottleneck, but this is what I'm doing because it matchesmy information architecture.
4. Use a central specialized server to check uniqueness and generatean opaque userid token that you would subsequently use as a key (youshouldn't use the username as a key). An ldap server or somethinglike it. Equivalent to the option above, but the single server onlyneeds to deal with the particular operations requiring uniqueness.It's still a single point of failure, but I don't think you can getaround that if you want synchronous global uniqueness testing.




a validating and complete treatment - thanks.

so pretty much my thoughts exactly. the further advantage that #3 hasis that is means *nothing* has to be done up front, it's only when theapp scales out the multiple dbs that work needs to be done but, atthat time, it's presumably justified. i've also considered #4 heavily(a possible great web service actually...) probably will go will somesort of hybrid. that is to say always get them from the single db,but in a way that could require zero code changes when getting an idmeant hitting some central service


  Db.next_id_for('user')

for instance. that way i can run with single writer, and seamlesslymove to #4 later. i don't even think that would need to be a singlepoint of failure as having a couple of those machines would be trivialif the knew about each other and generated ideas with small amount ofserver based uniqueness in them, for instance


  42a
  42b
  42c

all coming from behind a triple set of id generators. in the end itdoes seem like only #3 and 4 are appropriate for systems used by people.


cheers.


a @ http://codeforpeople.com/
--

we can deny everything, except that we have the possibility of beingbetter. simply reflect on that.

h.h. the 14th dalai lama

Re: uuid, auto-increment, et al.

Reply via email to