key hashing?

Fernando Padilla Mon, 27 Jul 2009 12:00:27 -0700

So I will be generating lots of rows into the db keyed by userId, inuserId order.

I have already learned through this mailing list that this use-case isnot ideal, since it would mean most row-inserts will be on one regionserver. I know that some people suggest to add some randomization tothe keys so that it will be spread around, but I can't do that, sinceI'll need to be able to do random access lookup on the rows via userId.

But I'm wondering if I could map/hash the real userId, into anothernumber that will spread around the inserts. And I can still do randomaccess lookups given a real userId, by calculating the hash..




1) i think i like this idea, does anyone have any experience with this?

2) assume userId is a 8byte long, what would be some good hashingfunctions? I would be lazy and use little-endian, but I bet one of youcould come up with something better. :)

key hashing?

Reply via email to