Re: HFile size writeup in HBase Blog

2014-04-12 Thread Doug Meil

Thanks Ted!

I can add that to the to-do list.  Also have plans for read/write
performance numbers too in a follow-up blog.






On 4/11/14, 6:00 PM, Ted Yu yuzhih...@gmail.com wrote:

Nice writeup, Doug.

Do you have plan to profile Prefix Tree data block encoding ?

Cheers


On Fri, Apr 11, 2014 at 3:14 PM, Doug Meil
doug.m...@explorysmedical.comwrote:

 Hey folks,

 Stack published a writeup I did on the HBase blog on the effects of
rowkey
 size, column-name size, CF compression, data block encoding and KV
storage
 approach on HFile size.  For example, had large row keys vs. small row
 keys, used Snappy vs. LZO vs. etc., used prefix vs. fast-diff, used a KV
 per column vs. a single KV per row.  We tried 'em all... and wrote it
up.

 http://blogs.apache.org/hbase/


 Doug Meil
 Chief Software Architect, Explorys
 doug.m...@explorysmedical.com






Re: HFile size writeup in HBase Blog

2014-04-12 Thread Ted Yu
Looking forward to your next blog. 

Cheers

On Apr 12, 2014, at 5:08 AM, Doug Meil doug.m...@explorysmedical.com wrote:

 
 Thanks Ted!
 
 I can add that to the to-do list.  Also have plans for read/write
 performance numbers too in a follow-up blog.
 
 
 
 
 
 
 On 4/11/14, 6:00 PM, Ted Yu yuzhih...@gmail.com wrote:
 
 Nice writeup, Doug.
 
 Do you have plan to profile Prefix Tree data block encoding ?
 
 Cheers
 
 
 On Fri, Apr 11, 2014 at 3:14 PM, Doug Meil
 doug.m...@explorysmedical.comwrote:
 
 Hey folks,
 
 Stack published a writeup I did on the HBase blog on the effects of
 rowkey
 size, column-name size, CF compression, data block encoding and KV
 storage
 approach on HFile size.  For example, had large row keys vs. small row
 keys, used Snappy vs. LZO vs. etc., used prefix vs. fast-diff, used a KV
 per column vs. a single KV per row.  We tried 'em all... and wrote it
 up.
 
 http://blogs.apache.org/hbase/
 
 
 Doug Meil
 Chief Software Architect, Explorys
 doug.m...@explorysmedical.com
 


HFile size writeup in HBase Blog

2014-04-11 Thread Doug Meil
Hey folks,

Stack published a writeup I did on the HBase blog on the effects of rowkey 
size, column-name size, CF compression, data block encoding and KV storage 
approach on HFile size.  For example, had large row keys vs. small row keys, 
used Snappy vs. LZO vs. etc., used prefix vs. fast-diff, used a KV per column 
vs. a single KV per row.  We tried 'em all... and wrote it up.

http://blogs.apache.org/hbase/


Doug Meil
Chief Software Architect, Explorys
doug.m...@explorysmedical.com




Re: HFile size writeup in HBase Blog

2014-04-11 Thread Ted Yu
Nice writeup, Doug.

Do you have plan to profile Prefix Tree data block encoding ?

Cheers


On Fri, Apr 11, 2014 at 3:14 PM, Doug Meil doug.m...@explorysmedical.comwrote:

 Hey folks,

 Stack published a writeup I did on the HBase blog on the effects of rowkey
 size, column-name size, CF compression, data block encoding and KV storage
 approach on HFile size.  For example, had large row keys vs. small row
 keys, used Snappy vs. LZO vs. etc., used prefix vs. fast-diff, used a KV
 per column vs. a single KV per row.  We tried 'em all... and wrote it up.

 http://blogs.apache.org/hbase/


 Doug Meil
 Chief Software Architect, Explorys
 doug.m...@explorysmedical.com