Hi

>> Since I'm storing
historical data (snapshot data) and changes between adjacent value cells
are relatively small.

If the values are changing even if it is smaller the FASTDIFF will rewrite
the value part.  Only if there are exact matches then it would skip the
value part. JFYI.

Regards
Ram

On Thu, Nov 13, 2014 at 11:23 AM, Jianshi Huang <jianshi.hu...@gmail.com>
wrote:

> I thought FASTDIFF was only for rowkey and columns, great if it also works
> in value cell.
>
> And thanks for the bjson link!
>
> Jianshi
>
> On Thu, Nov 13, 2014 at 1:18 PM, Ted Yu <yuzhih...@gmail.com> wrote:
>
> > There is FASTDIFF data block encoding.
> >
> > See also http://bjson.org/
> >
> > Cheers
> >
> > On Nov 12, 2014, at 9:08 PM, Jianshi Huang <jianshi.hu...@gmail.com>
> > wrote:
> >
> > > Hi,
> > >
> > > I'm currently saving JSON in pure String format in the value cell and
> > > depends on HBase' block compression to reduce the overhead of JSON.
> > >
> > > I'm wondering if there's a more space efficient way to store JSON?
> > > (there're lots of 0s and 1s, JSON String actually is an OK format)
> > >
> > > I want to keep the value as a Map since the schema of source data might
> > > change over time.
> > >
> > > Also is there a DIFF based encoding for values? Since I'm storing
> > > historical data (snapshot data) and changes between adjacent value
> cells
> > > are relatively small.
> > >
> > >
> > > Thanks,
> > > --
> > > Jianshi Huang
> > >
> > > LinkedIn: jianshi
> > > Twitter: @jshuang
> > > Github & Blog: http://huangjs.github.com/
> >
>
>
>
> --
> Jianshi Huang
>
> LinkedIn: jianshi
> Twitter: @jshuang
> Github & Blog: http://huangjs.github.com/
>

Reply via email to