On Thu, Jan 16, 2014 at 1:00 AM, Amit Sela <[email protected]> wrote:
> ... > > Could such a compression ratio make sense in case of many qualifiers per > row in a table (avg is 16 but in practice there are some rows with much > more and even a small number of rows with hundreds of thousands...) ? If > each KeyValue contains the rowkey, and the rowkeys contain more bytes than > the qualifiers / values, than the rows repeat themselves in the HFile and > actually make most of the HFile, right ? > If lots of repetition, yes. You could test by counting uncompressed and compressed tables. You get same answer? Yours, St.Ack
