Regards, Jean-Marc. The best resource that I found for this is a great blog post called Apache HBase I/O - HFile from Matteo Bertozzi in Cloudera´s blog. Here´s the link: http://blog.cloudera.com/blog/2012/06/hbase-io-hfile-input-output/
2013/4/3 Jean-Marc Spaggiari <[email protected]> > Is there any documentation anywhere regarding the differences between > PREFIX, DIFF and FAST_DIFF? > > 2013/4/3 prakash kadel <[email protected]>: > > thank you very much. > > i will try with snappy compression with data_block_encoding > > > > > > > > > > On Wed, Apr 3, 2013 at 11:21 PM, Kevin O'dell <[email protected] > >wrote: > > > >> Prakash, > >> > >> Yes, I would recommend Snappy Compression. > >> > >> On Wed, Apr 3, 2013 at 10:18 AM, Prakash Kadel <[email protected] > > > >> wrote: > >> > Thanks, > >> > is there any specific compression that is recommended of the use > >> case i have? > >> > Since my values are all null will compression help? > >> > > >> > I am thinking of using prefix data_block_encoding.. > >> > Sincerely, > >> > Prakash Kadel > >> > > >> > > >> > On Apr 3, 2013, at 10:55 PM, Ted Yu wrote: > >> > > >> >> You should use data block encoding (in 0.94.x releases only). It is > >> helpful > >> >> for reads. > >> >> > >> >> You can also enable compression. > >> >> > >> >> Cheers > >> >> > >> >> > >> >> On Wed, Apr 3, 2013 at 6:42 AM, Prakash Kadel < > [email protected] > >> >wrote: > >> >> > >> >>> Hello, > >> >>> I have a question. > >> >>> I have a table where i store data in the column qualifiers(the > >> values > >> >>> itself are null). > >> >>> I just have 1 column family. > >> >>> The number of columns per row is variable (1~ few thousands) > >> >>> > >> >>> Currently i don't use compression or the data_block_encoding. > >> >>> > >> >>> Should i? > >> >>> I want to have faster reads. > >> >>> > >> >>> Please suggest. > >> >>> > >> >>> > >> >>> Sincerely, > >> >>> Prakash Kadel > >> > > >> > >> > >> > >> -- > >> Kevin O'Dell > >> Systems Engineer, Cloudera > >> > -- Marcos Ortiz Valmaseda, *Data-Driven Product Manager* at PDVSA *Blog*: http://dataddict.wordpress.com/ *LinkedIn: *http://www.linkedin.com/in/marcosluis2186 *Twitter*: @marcosluis2186 <http://twitter.com/marcosluis2186>
