[ https://issues.apache.org/jira/browse/HBASE-3732?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13016021#comment-13016021 ]
Benoit Sigoure commented on HBASE-3732: --------------------------------------- Oh yeah I forgot that this was in the {{info:regioninfo}} column, my bad. Wouldn't it be awesome if this was actually on a key-per-key basis? Is there a spare bit in {{KeyValue}} we can steal to indicate "this KV is compressed"? We could not only compress the value, but also the column qualifier and/or the key if they're big too (some applications store data in the column qualifier or, less frequently, in the key). > New configuration option for client-side compression > ---------------------------------------------------- > > Key: HBASE-3732 > URL: https://issues.apache.org/jira/browse/HBASE-3732 > Project: HBase > Issue Type: New Feature > Reporter: Jean-Daniel Cryans > Fix For: 0.92.0 > > > We have a case here where we have to store very fat cells (arrays of > integers) which can amount into the hundreds of KBs that we need to read > often, concurrently, and possibly keep in cache. Compressing the values on > the client using java.util.zip's Deflater before sending them to HBase proved > to be in our case almost an order of magnitude faster. > There reasons are evident: less data sent to hbase, memstore contains > compressed data, block cache contains compressed data too, etc. > I was thinking that it might be something useful to add to a family schema, > so that Put/Result do the conversion for you. The actual compression algo > should also be configurable. -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira