Re: Accumulo iterators in HBase

Stack Fri, 04 Jul 2014 14:56:24 -0700

On Wed, Jul 2, 2014 at 12:07 AM, Carl Austin <[email protected]> wrote:


> Thanks for the time to look and comment and glad it sounds interesting,
>
> The reason I started on this was that I'm using Accumulo and want to make
> an application usable on both HBase and Accumulo with the same codebase. I
> do a lot of aggregations on data and I feel the Accumulo iterator mechanism
> is superior for this use case; it's one of the main reasons I went with
> Accumulo and one of the only remaining major differences between the two
> applications now that HBase has implemented cell level ACLs.
> For example, as I am ingesting a main table of data I am creating many
> other question focused tables that keep answers like how many times did I
> see combinations of values, when was the last time I saw combinations
> together, how many distinct values where in this field for each combination
> (using probabilistic counting of course) and many more. All of these things
> are well suited to Accumulo iterators for performance at scale because of
> how they run at compaction time across key/values that are already being
> read at that point, rather than having to update the answers to these
> questions on every single insert.
>
> This use case won't be for everyone, but the iterator mechanism is pretty
> neat, powerful and a real differentiator in Accumulo (of course there are
> many differentiators in HBase too!).
>

Thank you for sharing your experience.  I'm watching your repo.  Feel free
to ping me off-list if you want an opinion on how to hbase it or if you
want a review.

Thanks Carl,
St.Ack

Re: Accumulo iterators in HBase

Reply via email to