Re: Best practices in ID generation?

Jonathan Gray Wed, 08 Jul 2009 11:36:45 -0700

Right, BD.

We use incrementing IDs for most things because it gives us ordering.

If you only have random-key access, you would be better suited usingUUID-style IDs as Bryan says.


Bryan Duxbury wrote:

Not necessarily in context of hbase, but Rapleaf uses UUIDs/GUIDs, sincethey are crazy fast to generate and have no dependencies on externalresources.
In the context of hbase, a benefit of UUIDs is that they will berandomly distributed over your whole table, instead of consistentlyshowing up in the last region in the table.
-Bryan

On Jul 8, 2009, at 11:10 AM, Jonathan Gray wrote:
There are a number of different ways you could generate IDs.
Some people use GUIDs, probably the simplest way, though not myrecommendation.
ZooKeeper as a facility for ID generation.
Here, we use HBase for ID generation. Currently in production, whichis running on 0.19, I run a custom patch that works very much likeincrementColumnValue does in 0.20. When moving to 0.20 I plan onmigrating our ID assignment system to the built-in ICV.
You can expect about a 1ms end-to-end time on an increment operation,so if you need to generate more than 1000 ids/second, you need tothink about how to distribute it across multiple rows or you couldgrab them in batches (increment by 100 to generate 100 ids at a time,still takes 1ms).
Hope that helps.

JG

Vaibhav Puranik wrote:
Hi,
Does anybody have any suggestion/best practices on id/row/keygeneration for
HBase rows?
Do people use sequential ids (like rdbms - 1,2,3,4...) or people use
strings  ids?
What id server do you use? Do we have to write our own?
Any help/experiences please?.
Regards,
Vaibhav

Re: Best practices in ID generation?

Reply via email to