Hi Ted, Thanks for pointing me to HBASE-4218. I will take a look at it.
JM 2013/2/13 Ted Yu <[email protected]> > My name is Ted, not Lars :-) > > On Wed, Feb 13, 2013 at 7:41 PM, Mehmet Simsek <[email protected] > >wrote: > > > Thanks Lars > > > > M.Nurettin Şimşek > > > > On 14 Şub 2013, at 05:18, Ted Yu <[email protected]> wrote: > > > > > Jean-Marc: > > > You can find almost all the details you need from this JIRA: > > > HBASE-4218 Data Block Encoding of KeyValues (aka delta encoding / > prefix > > > compression) > > > > > > Cheers > > > > > > On Wed, Feb 13, 2013 at 6:09 PM, Jean-Marc Spaggiari < > > > [email protected]> wrote: > > > > > >> Hi Lars, > > >> > > >> Can you please tell more about key prefix block encoding? Or refer to > > >> some blog/doc? How it works, what it is, etc.? > > >> > > >> Thanks, > > >> > > >> JM > > >> > > >> 2013/2/13, lars hofhansl <[email protected]>: > > >>> Depends on you search pattern. > > >>> If you never care about scans ordering i.e. you only do point gets to > > see > > >>> whether you've already seen an email address, do the hash part. > > >>> > > >>> I'd perfer #1 over #2, because it would let you do efficient key > prefix > > >>> block encoding (FAST_DIFF). > > >>> > > >>> -- Lars > > >>> > > >>> > > >>> > > >>> ________________________________ > > >>> From: Nurettin Şimşek <[email protected]> > > >>> To: [email protected] > > >>> Sent: Wednesday, February 13, 2013 12:35 AM > > >>> Subject: RowKey design with hashing > > >>> > > >>> Hi All, > > >>> > > >>> In our project mail adresses are row key. Which rowkey design we > > should > > >>> choose? > > >>> > > >>> 1) com.yahoo@xxxx (Reversed) > > >>> 2) [email protected] > > >>> 3) md5 hash([email protected]) > > >>> 4) Any other solution. > > >>> > > >>> Many thanks. > > >>> > > >>> -- > > >>> M. Nurettin ŞİMŞEK > > >> > > >
