[ https://issues.apache.org/jira/browse/HBASE-3732?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13019967#comment-13019967 ]
stack commented on HBASE-3732: ------------------------------ BenoƮt: We can't compress column qualifier because then columns would sort differently. As to adding bit to say KV is compressed, that might be possible. Currently we have a type byte in each KV. The top four bits are unused. I had stared a patch to use the top two for 'version' and had done the work to make sure version was not considered comparing adding proper masks etc. I could revive this work to add in a compression bit. > New configuration option for client-side compression > ---------------------------------------------------- > > Key: HBASE-3732 > URL: https://issues.apache.org/jira/browse/HBASE-3732 > Project: HBase > Issue Type: New Feature > Reporter: Jean-Daniel Cryans > Fix For: 0.92.0 > > > We have a case here where we have to store very fat cells (arrays of > integers) which can amount into the hundreds of KBs that we need to read > often, concurrently, and possibly keep in cache. Compressing the values on > the client using java.util.zip's Deflater before sending them to HBase proved > to be in our case almost an order of magnitude faster. > There reasons are evident: less data sent to hbase, memstore contains > compressed data, block cache contains compressed data too, etc. > I was thinking that it might be something useful to add to a family schema, > so that Put/Result do the conversion for you. The actual compression algo > should also be configurable. -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira