My name is Ted, not Lars :-) On Wed, Feb 13, 2013 at 7:41 PM, Mehmet Simsek <[email protected]>wrote:
> Thanks Lars > > M.Nurettin Şimşek > > On 14 Şub 2013, at 05:18, Ted Yu <[email protected]> wrote: > > > Jean-Marc: > > You can find almost all the details you need from this JIRA: > > HBASE-4218 Data Block Encoding of KeyValues (aka delta encoding / prefix > > compression) > > > > Cheers > > > > On Wed, Feb 13, 2013 at 6:09 PM, Jean-Marc Spaggiari < > > [email protected]> wrote: > > > >> Hi Lars, > >> > >> Can you please tell more about key prefix block encoding? Or refer to > >> some blog/doc? How it works, what it is, etc.? > >> > >> Thanks, > >> > >> JM > >> > >> 2013/2/13, lars hofhansl <[email protected]>: > >>> Depends on you search pattern. > >>> If you never care about scans ordering i.e. you only do point gets to > see > >>> whether you've already seen an email address, do the hash part. > >>> > >>> I'd perfer #1 over #2, because it would let you do efficient key prefix > >>> block encoding (FAST_DIFF). > >>> > >>> -- Lars > >>> > >>> > >>> > >>> ________________________________ > >>> From: Nurettin Şimşek <[email protected]> > >>> To: [email protected] > >>> Sent: Wednesday, February 13, 2013 12:35 AM > >>> Subject: RowKey design with hashing > >>> > >>> Hi All, > >>> > >>> In our project mail adresses are row key. Which rowkey design we > should > >>> choose? > >>> > >>> 1) com.yahoo@xxxx (Reversed) > >>> 2) [email protected] > >>> 3) md5 hash([email protected]) > >>> 4) Any other solution. > >>> > >>> Many thanks. > >>> > >>> -- > >>> M. Nurettin ŞİMŞEK > >> >
